AI Evaluation Specialist · Prompt Engineer
Building evaluation infrastructure and training data for frontier AI systems.
Computer Engineering graduate with 1+ year of hands-on experience in LLM evaluation, prompt engineering, and AI training data creation.
Currently contracted across three AI platforms — executing structured SxS evaluations and hallucination detection at Mercor AI, authoring SWE-bench-style tasks and gold-standard datasets at AfterQuery, and performing large-scale bilingual data annotation at Innodata.
Skilled in RLHF pipelines, Generative AI output assessment, AI Alignment, bias detection, and PII identification. Combines strong analytical evaluation skills with full-stack development expertise.
San Francisco, USA — Remote
San Francisco, CA — Remote
Remote
Anand, Gujarat
Anand
Full-Stack Retail System
Retail dashboard with POS billing, real-time inventory tracking, automated GST reporting. Reduced build time 40% with Turbopack. Secure API routes and KPI-driven analytics.
Face Recognition Pipeline
Real-time face recognition for automated attendance. >90% detection accuracy with live video processing and CSV-based logging.
B.E. Computer Engineering · Aug 2021 – May 2025
CGPA 8.35 / 10
English (Professional) · Hindi (Native) · Gujarati (Native)