Risk Labs

Senior LLM Systems Engineer

9.0/10

Risk Labs

$100,000 โ€“ $200,000 USD
Remote
senior
2 days ago
aicryptodefiweb3PythonTypeScriptPostgresGCPCloud RunGitHub ActionsTerraformReact

AI Summary

The vacancy is well-structured and informative, offering clear expectations and compensation details.

Check Match โ€” Just drop your CV

See your fit for Senior LLM Systems Engineer in seconds.

Description

What You'll Own

  • โ€ข**LLM Accuracy:** improve prompts, model selection, tool usage, structured outputs, retrieval, and evaluation coverage so the system gets more decisions right over time.
  • โ€ข**System Performance:** reduce latency, token usage, and cost while preserving decision quality and operational reliability.
  • โ€ข**Resilience:** design validation, retries, fallbacks, uncertainty handling, and human review paths for ambiguous, adversarial, incomplete, or conflicting inputs.
  • โ€ข**Evaluation and Monitoring:** build datasets, regression tests, dashboards, traces, and review loops that make model quality visible and prevent repeated failures.
  • โ€ข**Agent and Tooling Architecture:** Improve agent orchestration and tool use across internal services, APIs, search workflows, databases, and external data sources.
  • โ€ข**Production Operations:** help debug live issues, investigate regressions, improve runbooks, and reduce repeated operator friction.

Compensation and Benefits

  • โ€ขPay packages include competitive salaries & meaningful long term equity participation.
  • โ€ขSalaries for this role range from $100-200k (USD).
  • โ€ขWill pay in stablecoins or fiat.
  • โ€ขPhilosophies for a culture that show we care: Take vacation when you need it, family care, training and development (just to name a few).
  • โ€ข100% remote, which means we encourage you to create the work environment that you thrive in.
  • โ€ขAt least two team wide offsites a year.

Requirements

Skills & Experience

#### Required

  • โ€ข3+ years of professional software engineering experience in Python, TypeScript, or similar production languages.
  • โ€ขHands-on experience building production systems that use LLMs, agents, retrieval, structured outputs, or model-powered workflows.
  • โ€ขExperience designing evaluations, test datasets, regression checks, quality metrics, or manual review loops for AI systems.
  • โ€ขStrong debugging ability across APIs, databases, queues, logs, model outputs, and external data sources.
  • โ€ขPractical understanding of prompt engineering, tool calling, structured output validation, retrieval, and common LLM failure modes.
  • โ€ขAbility to reason carefully about correctness in uncertain or adversarial environments.
  • โ€ขHigh agency, strong ownership, and clear written communication.

#### Nice to Have

  • โ€ขExperience with oracle systems, prediction markets, DeFi protocols, or other crypto infrastructure.
  • โ€ขExperience with UMA, optimistic oracle mechanisms, Polymarket, or similar systems.
  • โ€ขExperience building agentic systems that use tools, search, browser automation, APIs, or database queries.
  • โ€ขExperience with LLM tracing, model monitoring, evaluation frameworks, or AI observability tools.
  • โ€ขExperience optimizing model cost and latency at scale.
  • โ€ขExperience with Postgres, data pipelines, queue-based systems, background jobs, or event-driven architectures.
  • โ€ขFamiliarity with blockchain operational constraints, especially RPC limits, indexing, event logs, finality, and chain-specific behavior.
  • โ€ขExperience with GCP, Cloud Run, GitHub Actions, Terraform, or similar infrastructure.
Loading similar jobs...