Founding AI Engineer

Permanent
Technology
Experienced
AI, LLM, Agent Development
San Francisco, California
California
US$120000 - US$180000 per annum + Equity

Tired of LLMs breaking in the wild? Come build the platform that prevents it.

One of the most exciting early-stage startups in the AI infrastructure space is hiring a Founding Machine Learning Engineer to help shape the future of LLM observability, testing, and evaluation.

They're backed by Y Combinator and founded by IIT Bombay alumni with experience at ETH Zurich and top-tier quant trading firms. The mission? To make sure LLM-powered voice agents actually work - before they go live.

Their platform automatically simulates thousands of real-world conversations - from ordering food to handling job interviews - to stress-test agents with scale and depth. Think load testing meets GPT, with full evaluation, benchmarking, and monitoring.

🛠 What you'll build:

  • AI tools to test, evaluate and benchmark large language models (LLMs)
  • Scalable pipelines for real-time agent monitoring and performance feedback
  • Core infrastructure for LLM agent reliability
  • Customer-facing features, working directly with users and founders

🙋‍♂️ Who they're looking for:

  • Strong Python and ML engineering experience
  • Hands-on background in LLM product development or deployment
  • Interest in agent infrastructure, evaluation frameworks, or LLM testing
  • Bonus if you've worked in early-stage startups or on AI tooling

💡 This is your chance to be the technical co-founder of a product every AI team will need building at the edge of what's possible in AI reliability.

If you (or someone you rate highly) is excited by the intersection of AI agents, LLM infrastructure, and startup ownership - drop me a message. Happy to share more.

Similar Jobs

£125000 - £150000 per annum + + double OTE
New York
Series A devtools startup ($60M raised) is hiring Enterprise AEs to sell a high-ACV, AI-powered platform that helps engineers rapidly build internal apps-selling into technical stakeholders (CTOs, VPs Eng, etc).
US$120000 - US$180000 per annum + Equity
California
Join a Y Combinator-backed AI startup as a Founding ML Engineer to build tools that test and monitor LLM-powered voice agents. You'll develop core infrastructure, real-time evaluation pipelines, and user-facing features. Ideal for engineers with strong Python, ML, and LLM experience who want to shape critical AI reliability tools.
US$125000 - US$150000 per annum + + double OTE
Texas
We're hiring a Founding Account Executive to lead GTM at a stealth AI startup making internal data conversational through LLMs. Work directly with a proven, multi-time founder to shape the sales motion from day one.
£350 - £450 per day
West Midlands
Job Title: Network Support Engineer Location: Birmingham - Hybrid Job Type: Contract (Inside IR35) - 9-12 months Rate: £350-450 p/day
£800 - £900 per day
Berkshire
We are working on behalf of a leading organisation seeking an experienced Workday Extend & Integration Specialist to take a hands-on role in the design, build, and delivery of bespoke Workday applications. This is a fantastic opportunity to work across multiple business areas, applying modern integration techniques and enhancing enterprise systems through intelligent automation and secure, scalable design.
£40000 - £60000 per annum
Norfolk
This is a unique opportunity to join a close-knit team as they rebuild their data-driven platform from the ground up. You'll be responsible for improving the system that ingests and processes large volumes of real-time data, presents it in a user-friendly interface, and powers external integrations via robust APIs. Tech stack includes Laravel (PHP) and Vue.js - so proven experience in both is essential.
£100000 - £110000 per annum + 10% bonus
London
Key Responsibilities: Collaborate with stakeholders, engineers, and product managers to define architectural solutions. Design scalable, secure backend systems using microservices and Google Cloud. Ensure seamless integration of APIs and backend services into Unity-based game environments. Provide architectural oversight on AI enablement and automation initiatives. Create and maintain architecture diagrams, API specs (OpenAPI 3.0), and data flows.
£77000 - £116000 per annum
London
Job Title: Elasticsearch Platform Engineer Location: London - full time on-site Clearance Requirement: UK-Highest Level of Government Clearance Salary Range: £77,000 - £116,000 + Benefits
Job Title: NOC Architect / NOC SME - Network Operations Centre Location: Remote Job Type: Contract (Inside IR35) - 3 months initially w/ likely extensions Rate: £700-750 p/day
£60000 - £90000 per annum
Gloucestershire
We are working on behalf of a leading defence and national security organisation to identify experienced Agile Coaches and Scrum Masters to support a high-impact, mission-critical programme. This is an opportunity to contribute to complex and sensitive technology projects with real-world implications.
£100000 - £115000 per annum
London
TEC Partners are working with a world leading defence and security company who work in collaboration with governments across the world to ensure international security across land, sea and air. They are leaders in highly sensitive international protection and continually seek to lead the way for innovation in their domain.
£100000 - £110000 per annum
Gloucestershire
TEC Partners are representing one of the leaders in international defence and security who specialise across land, sea and air working with governments and their authorised agency partners.