What does this cost, and how is it priced?

Project-based, scoped and fixed before we write production code, not hourly staff augmentation. As a shape, most engagements start in the €15k–€50k range, smaller scoped builds below that, larger multi-feature products above €50k. The running cost ceiling for inference is a separate number we design to and enforce in code.

How long until something is actually running?

You see working software in your environment within the first weeks, with a demo every Friday, not a big-bang reveal at the end. A focused single feature is typically a few weeks to production; a multi-surface product is longer and broken into milestones you sign off on. We don't show notebooks and call it progress.

We don't do staff augmentation, ship demos as deliverables, edit prompts in a console with no version control, or build without evals and a cost ceiling. We don't put juniors on your project. And if a deterministic rule or a plain integration solves the problem cheaper than AI, we tell you and point you there.

How is this different from your other services?

AI Strategy decides what to build and the roadmap, no code. Integration wires AI into systems and data you already run. Automation strings workflows and agents together to remove manual steps. AI Development is where the production software gets written: the custom feature or product, with evals, guardrails, cost ceilings and an owned metric.

01 / Services / AI Development

We build the AI feature and ship it to production

AI Development is the build service. We take a scoped feature or product and turn it into software running in your environment, with evals, guardrails and a cost ceiling. Working code every week, not demos.

Start a project

02/What you get

Everything you need.Shipped to production.

No notebooks. No demos as deliverables. Just software running in your environment, with evals, guardrails and a cost ceiling you agreed up front.

A documented model choice

Frontier vs open-weight per task, with the cost, latency and quality trade-off written down against your data. Not a vendor default you can't argue with.

Evals running in CI from day one

A labelled test set scored on every change. A prompt or model swap that regresses quality fails the build, before it reaches a user.

Prompts and config in version control

Prompts live in the repo, are diffable and reviewed in PRs, with the model version pinned. No silent changes in a console nobody can trace.

A cost ceiling enforced in code

Per-request and per-tenant budgets, model-tier routing, caching and alerts before you hit the number. Agreed up front, not discovered on the invoice.

Guardrails on the user-facing path

Schema-constrained outputs, PII handling, injection and jailbreak mitigation, sane refusals on out-of-scope asks. It survives real users, not just the happy path.

Software you own, with the metric

Code in your repo, a runbook, and a dashboard tracking the one number we agreed to move. A clean handover your team can run without us.

Weekly

Working code shipped, with a demo every Friday

€15k–€50k

Where most engagements start

Day one

Evals running in CI, prompts in version control

Seniorengineers

No juniors on your project. Principal engineers only.

1metric

One owned number, agreed up front and tracked to the end

03/How we work

Five steps.To production, fast.

From a thirty-minute call to a deployed feature with an owned metric, broken into milestones you sign off on.

01 · ASSESS

Assessment call

Thirty minutes with an engineer on the call. You describe the feature, the data you have, latency tolerance and budget shape. We tell you honestly whether this is a build, an integration, or something you don't need us for.

02 · SCOPE

Scope and eval design

Architecture, model choice with the trade-off written down, the eval set and target metric defined together with you, cost ceiling, milestones and price in writing. The metric is agreed here, not retrofitted at the end.

03 · BUILD

Build, week by week

Senior engineers ship to a real environment on a weekly cadence, with a demo every Friday against the eval set. Prompts and config in version control from the first commit; evals running in CI.

04 · HARDEN

Hardening and guardrails

Once the core path works: injection and abuse handling, cost-ceiling enforcement, observability, load behaviour and edge cases. The thing holds up under real traffic.

05 · SHIP

Ship and own the metric

Production deploy, dashboard live, tuned until the agreed number holds. Then a clean handover with runbook and docs, or we stay on for Support & Scale if you want it.

04/Who it's for