We're Hiring

Forward Deployed Eval Engineer

Be the bridge between cutting-edge AI evaluation technology and real-world customer success. Help organizations build trust in their AI systems through hands-on implementation and expertise.

Apply Now Learn More

A Note on This Role

The "Forward Deployed Eval Engineer" role was coined by Shen Pandi, Founder of TowardsEval, based on his extensive experience in AI and his hypothesis about why 95% of AI projects fail (according to MIT Research and Gartner). Through years of working with organizations struggling to deploy trustworthy AI systems, Shen identified a critical gap: the lack of specialized engineers who can bridge the divide between AI evaluation theory and practical implementation. This role was created to address that gap and fundamentally change how organizations approach AI trust and safety.

What is a Forward Deployed Eval Engineer?

A Forward Deployed Eval Engineer is a specialized technical role that combines deep expertise in AI/ML evaluation with hands-on customer implementation. Unlike traditional software engineers who work from headquarters, Forward Deployed Eval Engineers work directly alongside customer teams—either on-site or remotely—to ensure their AI systems are rigorously tested, compliant, and trustworthy.

This role sits at the intersection of technical excellence, customer success, and AI safety. You'll be the expert who translates complex evaluation methodologies into practical, actionable implementations that help organizations confidently deploy AI systems.

Core Mission

Empower organizations to build and maintain trust in their AI systems by implementing comprehensive evaluation frameworks, ensuring regulatory compliance, and establishing continuous monitoring practices that catch issues before they impact users.

Why This Role is Super Important

AI Safety & Trust

As AI systems become more prevalent in critical applications (healthcare, finance, hiring), ensuring they're safe, unbiased, and reliable isn't optional—it's essential. You'll be the guardian helping organizations avoid costly mistakes and reputational damage.

Regulatory Compliance

With regulations like the EU AI Act, organizations face legal requirements for AI evaluation and documentation. You'll ensure customers meet compliance standards while maintaining operational efficiency.

Customer Success

Many organizations want to evaluate their AI properly but lack the expertise. You'll be the trusted advisor who translates technical complexity into practical solutions, ensuring customers achieve their AI trust goals.

Business Impact

Your work directly impacts customer retention, expansion, and satisfaction. By ensuring successful implementations, you'll drive TowardsEval's growth while helping organizations build better AI systems.

How is This Different from a Forward Deployed Engineer?

Traditional Forward Deployed Engineer

Focuses on general software implementation and integration

Works across various product features and use cases

Primarily concerned with technical deployment and uptime

May not require deep domain expertise in AI/ML

Forward Deployed Eval Engineer (You!)

Specialized in AI evaluation: Deep expertise in LLM testing, bias detection, statistical analysis, and evaluation methodologies

Trust & safety focus: Ensures AI systems are safe, unbiased, compliant, and reliable—not just functional

Regulatory expertise: Understands EU AI Act, GDPR, and other AI compliance requirements

Evaluation methodology: Designs custom test suites, statistical experiments, and monitoring dashboards specific to each customer's AI use case

Advisory role: Acts as a trusted consultant on AI best practices, not just a technical implementer

What You'll Do

Customer Implementation

•Work directly with customer teams to implement TowardsEval's evaluation platform
•Design custom evaluation workflows tailored to each customer's AI use case
•Configure bias detection, toxicity checks, and compliance monitoring
•Set up statistical A/B testing and experiment frameworks

Training & Enablement

•Train customer teams on AI evaluation best practices and methodologies
•Conduct workshops on bias detection, fairness testing, and compliance
•Create documentation and playbooks for ongoing evaluation processes
•Empower customers to independently maintain and expand their evaluation practices

Compliance & Risk Management

•Ensure customer AI systems meet EU AI Act and other regulatory requirements
•Implement documentation and audit trails for compliance reporting
•Identify and mitigate AI risks before they impact production systems
•Advise on industry-specific compliance requirements (healthcare, finance, etc.)

Strategic Advisory

•Act as a trusted advisor on AI evaluation strategy and best practices
•Help customers define success metrics and KPIs for their AI systems
•Provide feedback to product team on customer needs and feature requests
•Build long-term relationships that drive customer success and expansion

Who You Are

Technical Skills

Strong background in AI/ML, particularly LLMs and generative AI
Experience with Python, statistical analysis, and evaluation frameworks
Understanding of bias detection, fairness metrics, and AI safety
Familiarity with A/B testing, experiment design, and statistical significance
Knowledge of AI regulations (EU AI Act, GDPR, etc.)

Soft Skills

Excellent communication skills—can explain complex concepts simply
Customer-focused mindset with strong relationship-building abilities
Problem-solving orientation—thrives on tackling unique challenges
Comfortable with ambiguity and rapidly changing environments
Willingness to travel and work flexible hours across time zones

Ready to Make AI Trustworthy?

Join TowardsEval as a Forward Deployed Eval Engineer and help organizations around the world build AI systems they can trust. Your work will directly impact the safety, fairness, and reliability of AI systems used by millions.

Apply Now Learn About TowardsEval