handoff

End of page. Start of the working loop.

If this site did its job, the next step is simple: ask for the private CV, read the field notes, or check the public proof. I am looking for AI product work where the model is only one part of the system.

Request CVRead the notes
available for
Staff / Principal AI product roles
best work
harnesses, retrieval, evals, workflow control
base
Madrid / remote-first
paths
writingprojectsabout
contact
emailLinkedInGitHubX
context before autonomyevals before confidenceproof beats claimscode must stay legiblesystems over demosprivate CV on requestcontext before autonomyevals before confidenceproof beats claimscode must stay legiblesystems over demosprivate CV on request
(c) 2026 Petru Arakissbuilt as a public surface; private documents stay off the web
petruarakiss
aboutprojectswriting
open to Staff / Principal AI product roles

Petru Arakiss

AI products that survive production.

Madrid-based product engineer working across UX, backend, retrieval, evals, and agentic workflows.

I use Codex and Claude inside the engineering loop, but the standard is still clear product judgment, reliable software, and systems people can operate when the model is unsure.

Request CVLinkedInGitHub
role signal
best fit

Staff / Principal IC roles where AI has to become product infrastructure.

current edge

agent harnesses, retrieval systems, eval loops, and workflow control.

working range

full-stack product ownership across UX, backend, data, and operations.

operating principles
  • agent harnesses before agent hype
  • context is a product surface
  • evals before confidence
  • code must stay legible
current proof

Proof outside the demo.

The public version is intentionally compact. The pattern is the same across the private work: build the product surface, define the harness, keep evidence visible, and make failure recoverable.

01

BIFROST

document evidence

Financial document pipeline with OCR, semantic chunking, cited retrieval, and reviewable answers for messy uploads.

02

ORVIAN

workflow control

B2B collections workflow with intent parsing, deterministic states, drafting support, and escalation rules.

03

Polaris

internal search

Permission-aware retrieval, cited answers, and streaming UX for support, sales, and product teams.

positioning

The model is one component.

The useful work is usually the system around it: how it gets context, chooses tools, exposes uncertainty, hands work back to people, and stays understandable after months of changes.

AI product engineering

I build products where language models are useful components inside a larger system: permissions, state, queues, interfaces, persistence, observability, and failure handling.

Agentic workflow design

I design routing, tool use, handoffs, evaluator loops, human checkpoints, and stopping conditions for workflows that need judgment without becoming uncontrolled automation.

Harness engineering

I shape the environment around Codex, Claude, and human engineers: repository knowledge, executable plans, review loops, browser checks, traces, evals, and CI guardrails.

Context and retrieval systems

I work on the less glamorous parts of RAG: document ingestion, chunking strategy, metadata, grounding, permission boundaries, citations, and workflows for when retrieval is uncertain.

Eval-driven reliability

I turn vague quality into examples, traces, graders, regression checks, and operating thresholds so teams can improve systems instead of arguing from screenshots.

Full-stack delivery

I can own the product surface and the backend path: Next.js, TypeScript, Python, FastAPI, Postgres, Redis, background jobs, deployment, monitoring, and cost control.

working model

Fast with AI, strict with the work.

I use AI tools aggressively, but not as a substitute for judgment. The leverage comes from designing a loop where humans, agents, and the running system can all give useful feedback.

With people

I clarify the goal, the risk, and what must stay under human control. The point is not to automate everything; it is to make the right work easier to trust.

With agents

I use Codex and Claude as engineering collaborators, but I design the harness around them: context, tools, tests, reviews, and observable feedback from the running system.

With systems

I care about boundaries, data contracts, failure modes, latency, cost, and the operational screen someone will use when the model is wrong or unsure.

best fit

For teams where AI has to become a product, not a slide.

  • Staff or Principal Engineer for AI-native product teams
  • AI Engineering Lead for agentic workflows and internal tools
  • AI Platform / Developer Productivity roles building agent harnesses
  • Full-stack product ownership where AI, UX, and backend reliability meet
Request CVLinkedIn
public surface
agentic workflows

routing, tool use, handoffs, evaluator loops, and stopping conditions

harness engineering

repo-local knowledge, executable plans, browser checks, traces, and review loops

full-stack systems

Python, FastAPI, Next.js, TypeScript, Postgres, Redis, observability, CI

writingprojectspast work: BBVA, Santander, Bankinter, El Corte Ingles