Founding AI Engineer

Permanent
Technology
Experienced
AI, LLM, Agent Development
San Francisco, California
California
US$120000 - US$180000 per annum + Equity

Tired of LLMs breaking in the wild? Come build the platform that prevents it.

One of the most exciting early-stage startups in the AI infrastructure space is hiring a Founding Machine Learning Engineer to help shape the future of LLM observability, testing, and evaluation.

They're backed by Y Combinator and founded by IIT Bombay alumni with experience at ETH Zurich and top-tier quant trading firms. The mission? To make sure LLM-powered voice agents actually work - before they go live.

Their platform automatically simulates thousands of real-world conversations - from ordering food to handling job interviews - to stress-test agents with scale and depth. Think load testing meets GPT, with full evaluation, benchmarking, and monitoring.

🛠 What you'll build:

  • AI tools to test, evaluate and benchmark large language models (LLMs)
  • Scalable pipelines for real-time agent monitoring and performance feedback
  • Core infrastructure for LLM agent reliability
  • Customer-facing features, working directly with users and founders

🙋‍♂️ Who they're looking for:

  • Strong Python and ML engineering experience
  • Hands-on background in LLM product development or deployment
  • Interest in agent infrastructure, evaluation frameworks, or LLM testing
  • Bonus if you've worked in early-stage startups or on AI tooling

💡 This is your chance to be the technical co-founder of a product every AI team will need building at the edge of what's possible in AI reliability.

If you (or someone you rate highly) is excited by the intersection of AI agents, LLM infrastructure, and startup ownership - drop me a message. Happy to share more.

Similar Jobs

£55000 - £65000 per annum
Cambridgeshire
My client, a high-growth global software business, operating at the forefront of secure remote access technology, is seeking a Senior Software Engineer to join its Portal team. This is an opportunity to work on a business-critical web platform used daily by thousands of customers worldwide and central to the company's commercial and product ecosystem.
£75000 - £85000 per annum
Remote
Job Title: Senior DevOps Engineer - ArgoCD/GitOps Location: UK - remote Salary: £75-85K Type: Permanent
£70000 - £200000 per annum
Remote
A founding-level Tech Lead / future CTO role at a fully remote, well-funded AI startup building a user-facing "Notion for AI agents." You'll own product and engineering end-to-end, rapidly prototyping and shipping features while also working hands-on with multi-agent systems and AI infrastructure. Small team, long runway, high autonomy, and a clear path to CTO as the company scales.
€70000 - €100000 per annum
Ile de France
A Senior Data Engineer role focused on building the core data infrastructure for a well-funded AI ClimateTech startup. You'll work closely with the CTO to design and scale pipelines in Python, DBT, and PostgreSQL, enabling ML and LLM systems built on complex scientific data. High ownership, early-stage environment, and direct impact on AI products reducing CO₂ emissions.
€80000 - €120000 per annum
Berlin
Join an early-stage team building an applied AI platform for the construction industry, already live with leading enterprises. We're automating the back-office of construction companies by unlocking value from their unstructured data - PDFs, scans, and drawings, saving teams up to 90% of their time.
£40000 - £50000 per annum
London
An established engineering organisation operating in a highly technical environment is seeking a Software Test Engineer to join its growing team. This role offers the opportunity to work on software that supports complex products and services used in demanding, real-world applications.
£27 - £31 per hour
Oxfordshire
End User Device Engineer Location: Kidlington Rate: Up to £250 per day Umbrella Contract until end of March 2026 with possible extension
£70000 - £80000 per annum + + benefits
West Sussex
Tec Partners are working with a well-known utilities provider, who are looking for an experienced Software Developer, with React and Python experience, to join their team, focused on protecting the environment and maintaining water quality in the southern region. As a Software Developer, you will be a key technical contributor to one of their flagship digital products, focused on environmental protection. You will work with Reach and TypeScript on the front-end, Python on the back-end...
Tec Partners are working with a world-leading technology client who are currently looking for an experienced Senior Infrastructure Developer, with a strong background in infrastructure automation, Python and VMware Cloud Foundation (VCF) (or similar virtualization technologies). As a Senior Infrastructure Developer, you will automate configuration management and integrate APIs for virtualization platforms using VMware Cloud Foundation (VCF), working on large-scale deployments.
Negotiable
London
Tec Partners are working with a world-leading technology client who are currently looking for an experienced Senior Infrastructure Engineer, with a strong VMware Cloud Foundation (VCF) expertise. As a Senior Infrastructure Engineer, you will deliver virtualization platforms using VMware Cloud Foundation (VCF), working on large-scale deployments.
€50000 - €60000 per annum
Berlin
We are seeking a Java Developer to join a dynamic team delivering cutting-edge solutions within the defence and security sectors. You will play a key role in designing, developing and maintaining the critical backend infrastructure that supports vital services. This is a unique opportunity to work on diverse and challenging projects, contributing to real-world impact within a hybrid and flexible working environment.
€60000 - €70000 per annum
Berlin
We are seeking a Senior Frontend Developer to join a skilled team working on innovative and complex technology solutions within the defence and security sectors. In this role, you will be responsible for designing, developing and maintaining critical frontend applications that support key services. You will work on a variety of challenging projects and contribute to delivering solutions with real-world impact in a hybrid and flexible working environment.