VK

Research Programmer

6.0/10

VK

Not specified
Hybrid
mid
about 5 hours ago
aitechPythonpandasPyTorchTensorFlowAPI LLMMLDSNLPRAGSQL
AI SummaryVerified by Aipplify AI

The vacancy is well-defined but lacks compensation details, affecting overall quality.

AI quality score6.3 / 10

Check Match — Just drop your CV

See your fit for Research Programmer in seconds.

Overview

VK is looking for a Research Programmer to participate in designing RAG architecture for B2B cases and work with LLMs. The role involves data ingestion, evaluation pipelines, and integration with infrastructure. VK is a leading technology company that develops a wide range of products and services, focusing on social media, communication, and entertainment.

Responsibilities

  • •Participate in designing RAG architecture for B2B cases: help choose data sources, set up ingestion, chunking, basic retrieval strategies (dense/hybrid) under the guidance of more experienced colleagues;
  • •Configure LLM/agents for the subject area: implement prompts, connect tools (tool calling), work with context and basic memory;
  • •Implement POC/POV on client data, conduct iterative improvements based on metrics and feedback;
  • •Build Data Ingestion pipelines with OCR;
  • •Implement evaluation pipelines: generate test sets, run auto-eval, participate in human-eval;
  • •Calculate and analyze key metrics (accuracy, coverage, latency, user success rate), prepare reports for the team;
  • •Participate in setting up monitoring: logging, basic alerts, analyze unsuccessful cases and their fixes;
  • •Participate in integration with infrastructure (Kubernetes, cloud), work with metrics and logs;
  • •Participate in load testing to assess inference speed and necessary resources.

Requirements

  • •Experience with Python (pandas, PyTorch/TF, API LLM);
  • •Experience in ML/DS/NLP for at least two years;
  • •Experience with LLM/RAG;
  • •Understanding of the full ML development cycle;
  • •Practical experience with RAG: setting up retrieval, working with embedding models, vector databases (FAISS, Qdrant, Pinecone, etc.);
  • •Understanding of basic quality improvement techniques (reranking, filtering, prompt tuning);
  • •Experience with agent frameworks (LangChain / LangGraph / LlamaIndex / ADK, etc.);
  • •Basic understanding of agentic approaches and tool calling;
  • •Experience in prompt engineering and configuring LLM for tasks;
  • •Basic experience in fine-tuning (or understanding approaches: LoRA, QLora, instruction tuning);
  • •Understanding of inference optimization (at the level of using tools: vLLM, quantization, etc.);
  • •Experience with popular ML frameworks: transformers, vllm, sglang, pytorch;
  • •SQL, basic understanding of working with data and pipelines;
  • •Experience deploying services (Docker, FastAPI), basic understanding of Kubernetes/cloud;
  • •Skills in logging, monitoring, and basic A/B experiments;
  • •Experience with evaluation for ML/LLM (auto-metrics + participation in manual evaluation, LLM-as-Judge).

Skills

PythonpandasPyTorchTensorFlowAPI LLMMLDSNLPRAGSQLDockerFastAPIKubernetes
Loading similar jobs...