I help AI systems understand questions better by evaluating, refining, and improving how large language models (LLMs) interpret and respond to human input.
I specialize in Human-in-the-Loop (HITL) AI training, focusing on structured evaluation, data quality, and prompt refinement to improve model accuracy, reasoning, and instruction adherence.
My core services include:
• LLM response evaluation and grading
• AI output quality assurance (QA)
• Data annotation and text labeling
• Prompt evaluation and prompt optimization
• NLP content review
• Conversational AI testing
• AI alignment and instruction-following checks
• Structured feedback documentation
I work with detailed rubrics and evaluation frameworks to assess:
Accuracy and factual consistency
Logical reasoning and coherence
Instruction compliance
Tone and contextual alignment
Bias, ambiguity, and edge cases
With a background in procurement and operations, I bring analytical thinking, structured documentation, and high attention to detail into AI model evaluation workflows. I am comfortable identifying subtle reasoning gaps, constraint violations, and real-world context issues that automated systems often miss.
If you are building or improving:
Large Language Models (LLMs)
Conversational AI systems
Chatbots
Generative AI applications
AI-powered tools
I can support your team by delivering consistent, high-quality evaluation work that strengthens training data and improves model performance.
Reliable. Structured. Detail-oriented.
Focused on clarity, alignment, and measurable improvement.
Experience: 2 - 5 years
Workflow Optimization & Process Improvement - I design structured workflows and simple systems that reduce friction, improve clarity, and support scalable operations for founder-led teams.
Experience: 5 - 10 years
Cross-Functional Communication & Stakeholder Coordination - Experienced in aligning multiple parties across time zones, managing expectations, and ensuring clear, consistent communication.
Experience: 2 - 5 years
AI-Assisted Drafting with Human Refinement - I use AI strategically to accelerate documentation, summaries, and communication — while applying careful human review to maintain tone, accuracy, and credibility.
Experience: 2 - 5 years
Administrative & Calendar Coordination - Organized and dependable in managing schedules, logistics, and recurring operational tasks across time zones.
Experience: 2 - 5 years
SOP & Documentation Development - Able to create and refine internal guides, checklists, and templates that maintain consistency and reduce repeat questions.
Experience: 2 - 5 years
Transcript Review & Document Cleanup - Comfortable reviewing long-form transcripts and structured documents to ensure clarity, formatting consistency, and accuracy.
Experience: 2 - 5 years
Client Onboarding & Communication Support - Skilled at creating structured onboarding flows and clear messaging that helps clients feel welcomed and informed from the start.
Experience: Less than 6 months
Email Marketing Platforms (e.g., Mailchimp – learning stage) - Comfortable learning new marketing platforms quickly and applying structured testing to ensure accurate execution.
Experience: Less than 6 months
Basic Automation Tools - Familiar with lightweight automation concepts and continuously expanding knowledge of workflow tools.
“I have found someone who is smart, has a great work ethic and is easy to work with.”
Sara Brumfield
SEE MORE REAL RESULTS“They're not only loyal and hardworking, they're super detail oriented!”
- Travis OVAAnswers
Onlinejobs.ph "ID Proof" indicates if "they are who they say they are".
It DOES NOT indicate skill level.
ID Proof scores are 0 - 99 with 99 being the best. It is calculated based on dozens of data points.
It's intended to help employers know who they're talking to is real, and not a fake identity.