Applied AI Engineer

Permanent
Technology
Senior
AI, LLM, Agent Development
San Francisco, California
California
US$180000 - US$250000 per annum + Equity

Tired of brittle, unpredictable LLM agents? Come help build the IDE that makes them actually work.

I'm partnering with one of the most exciting early-stage startups in the AI tooling space - a YC-backed company that's on a mission to bring structure (and joy) to human-AI collaboration.

They're building an IDE that lets anyone design, test, and deploy sophisticated AI agents using natural language - not code. Imagine the Notion of AI systems, empowering the next billion knowledge workers to create with AI.

They're now hiring an Applied AI Engineer to help architect and scale the core agent infrastructure - from memory and evaluation to real-world reliability.


🚀 What you'll build:

  • Multi-step, tool-using agents that call real APIs, manage auth, retries, timeouts, and all the tricky edge cases.

  • RAG pipelines that turn messy data into grounded, useful answers.

  • Memory systems that persist context - scratchpads, summary buffers, embedding stores.

  • Deterministic execution and replay tools so users can trace exactly how an agent thinks.

  • A robust eval framework blending automated checks with human-in-the-loop scoring.

  • Plus whatever greenfield ideas you want to bring to life.


🧑‍💻 Who they're looking for:

  • 3+ years of engineering experience shipping production software. You've built agent-like systems: multi-step LLM workflows, tool-using bots, or scripted assistants.

  • Hands-on with:

    • RAG (embeddings, vector DBs, chunking)

    • Agent memory (scratchpads, history compression, summaries)

    • Orchestrating real tools + APIs (auth flows, plugins)

    • Evaluation - defining success metrics, running regression tests, iterating on agent behavior

  • Obsessed with fast response times, predictable outputs, traceability, and uptime. This is production, not research.

  • Thrive in fast-moving, product-first teams that bias for shipping.

Bonus points for:

  • Experience with (or strong opinions about) LangChain, CrewAI, DSPy.

  • Shipped agents used by actual customers - beyond internal demos.

  • Deep familiarity with LLM ops, tracing, observability.

  • Been a founder or early engineer who sweats the details of product quality.


🏢 The details:

  • Full-time, in-person role in San Francisco (Presidio) - 5 days a week.

  • Must have US work authorization (open to O-1 visas for exceptional folks).

  • You'll do the best work of your life alongside genuinely sharp, friendly people.


🎯 Why it matters:
Most LLM agents break in the wild. Here, you'll help build the platform that ensures they don't - enabling the world to create smarter AI systems without ever writing a line of code.

Similar Jobs

£40000 - £50000 per annum
Norfolk
A well-established organisation is looking to recruit a .NET Developer to join its internal development team, working on the design, development and support of a range of business-critical systems. Reporting to the Development Team Leader, this role will involve developing new applications, enhancing existing systems and providing ongoing technical support to internal users. The successful candidate will play an important role in delivering reliable, high-quality software solutions
£40000 - £50000 per annum
Norfolk
A well-established organisation is seeking an experienced IS Business Analyst to join its Information Systems team. This role will focus on analysing business processes, identifying opportunities for improvement and translating business requirements into clear technical specifications for development teams.
US$125000 - US$150000 per annum + + commission
New York
Our client is an early-stage, venture-backed technology company building compliance infrastructure for AI Agents. Their platform provides runtime control, auditing, and regulatory compliance for agentic systems operating at scale, helping enterprises in regulated industries deploy AI responsibly, without sacrificing speed or innovation.
£50000 - £60000 per annum
Norfolk
A well-established organisation is seeking an experienced .NET Development Team Leader to lead a small but highly capable development team responsible for delivering and supporting business-critical systems.
£60000 - £100000 per annum
Cambridgeshire
Key Responsibilities * Design, develop, and maintain high-performance C++ applications * Collaborate with FPGA engineers, DevOps, and other software engineers * Participate in code reviews, debugging, and performance optimisation * Contribute to architectural decisions and system evolution * Support development of new systems, including projects using Rust
US$125000 - US$150000 per annum + + commission
New York
Our client is an early-stage, venture-backed technology company building compliance infrastructure for AI Agents. Their platform provides runtime control, auditing, and regulatory compliance for agentic systems operating at scale, helping enterprises in regulated industries deploy AI responsibly, without sacrificing speed or innovation.
£45000 - £55000 per annum
Suffolk
Strong Python development experience Solid knowledge of SQL / PostgreSQL Experience with JavaScript, HTML, CSS and modern front-end frameworks Familiarity with Git, CI/CD pipelines (Azure DevOps/Jenkins) Experience working in Agile delivery environments
£300 - £600 per day
Remote
We're currently looking for an experienced LANSA Developer to join an exciting ERP upgrade programme for a leading organisation. This is a contract role for a minimum of 6 months, supporting the migration from Oracle JDE World to Oracle JDE EnterpriseOne. Key Requirements Strong experience developing in LANSA RDML Hands-on use of the Visual LANSA Editor Solid experience working on the IBM i (AS/400) platform Ability to support and contribute to large-scale ERP upgrade or transformation projects
£650 - £700 per day
Remote
We are supporting a large organisation seeking a Senior Developer with strong experience in Oracle Fusion ERP, particularly across HR modules, to support and enhance a critical enterprise platform within its Shared Application Services function.
£50000 - £55000 per annum
Norfolk
Key Responsibilities Maintain and support existing SQL Server databases and infrastructure Develop database modules and software components to meet client requirements Design and propose database architecture and infrastructure improvements Produce and maintain technical documentation including standards and procedures Ensure development best practices including code reviews, testing and standards Mentor and support other members of the technical team
£610 - £620 per day
London
Position: Tech Assurance Delivery Lead Location: Reading 2 days p/week; 3 days remote Type: Contract, Inside IR35, 6 Months Rate: £620 p/day (umbrella rate)
£700 - £705 per day
London
Position: AI Security & Governance Workstream Lead Location: Reading 2 days p/week; 3 days remote Type: Contract, Inside IR35, 6 Months Rate: £705 p/day (umbrella rate)