Immediate joiner & zero-notice jobs in India

Gen AI Developer/AI ML Engineer

ZNP

6L – 7L / yr Full time/Permanent 2 Years Day Shift Work From Office Bengaluru

Posted today

Job highlights

Strong Python skills FastAPI, asyncio, Pydantic; clean, tested, production-ready code.Hands-on experience building LLM-powered applications (OpenAI, Gemini, Llama, or similar).Experience with RAG architectures and vector databases (pgvector, FAISS, Pinecone, or similar).Familiarity with agentic frameworks (LangChain, LlamaIndex, CrewAI, or custom implementations).Knowledge of streaming APIs: SSE and WebSockets.Understanding of prompt engineering, prompt versioning, and evaluation techniques.Experience integrating REST/gRPC APIs and enterprise connectors.Awareness of AI safety, guardrails, PII handling, and responsible AI practices.Familiarity with observability tooling (OpenTelemetry, Grafana, Prometheus, or similar).Good understanding of CI/CD pipelines and DevOps practices in an AI/ML context.

Key skills

MCP JIRA Advance Python Devops Schema Data Goverance Management Sharepoint Auditing Testing

Skills highlighted in blue are preferred key skills

Job description

AI ML

Gen AI Developer/AI ML Engineer

We are looking for a GenAI/AI-ML engineer to build and own core LLM/RAG services, agentic workflows, and the integration layer that connects AI capabilities to our platform. You will work Python-first, ship streaming APIs, manage prompt lifecycles, and ensure safety, observability, and performance at scale.

Role Requirements What Were Looking For

Everything you need to know about this role responsibilities and the skills we value.

Strong Python skills FastAPI, asyncio, Pydantic; clean, tested, production-ready code.

Hands-on experience building LLM-powered applications (OpenAI, Gemini, Llama, or similar).

Experience with RAG architectures and vector databases (pgvector, FAISS, Pinecone, or similar).

Familiarity with agentic frameworks (LangChain, LlamaIndex, CrewAI, or custom implementations).

Knowledge of streaming APIs: SSE and WebSockets.

Understanding of prompt engineering, prompt versioning, and evaluation techniques.

Experience integrating REST/gRPC APIs and enterprise connectors.

Awareness of AI safety, guardrails, PII handling, and responsible AI practices.

Familiarity with observability tooling (OpenTelemetry, Grafana, Prometheus, or similar).

Good understanding of CI/CD pipelines and DevOps practices in an AI/ML context.

Build LLM/RAG services in Python (FastAPI/asyncio, Pydantic) with clean APIs and tests.

Implement agentic AI workflows tool-using agents with planning, memory, multi-step execution, and recovery/fallback paths.

Stream responses server-side from model to UI (SSE/WebSockets with retries, backoff, partial responses, and cancellation).

Build RAG pipelines: ingestion, chunking, embeddings, indexing, reranking, and grounded answers with citations (pgvector/FAISS/Pinecone).

Manage prompt versioning, templates/params, safe fallbacks, and rollbacks.

Run evaluations: hallucination/groundedness checks, regression suites for prompts and retrievers.

Implement guardrails: PII detection/redaction, content safety, and domain constraints.

Optimise for performance and cost: context reduction, caching/batching, request pacing, and rate-limit handling.

Design and ship REST/gRPC endpoints that orchestrate tools, retrieval, and post-processing (citations/formatting).

Implement and consume MCP (Model Context Protocol) tool adapters files, web, DB connectors with capability negotiation, auth/permissions, and resource limits.

Integrate vector stores (pgvector/FAISS/Pinecone) plus DB/file stores and enterprise connectors (SharePoint, Confluence, Slack, Jira).

Own AuthN/Z and tenancy: JWT/OAuth, role/tenant isolation, secrets management, and audit logging for AI actions.

Set up observability: logs, metrics, traces, and dashboards/alerts for latency, error rate, and token spend.

Manage queues/workflows for background ingestion/summarisation with idempotency and retries.

Enforce data governance: input/output validation, schema contracts, PII handling, and retention policies.

Maintain CI/CD for prompts, retrieval configs, and API changes; support blue/green or canary releases.

Share this job:

f 𝕏 in

About the company

ZNP

Exclusive immediate-hire listing on ZeroNoticePeriod

A client of ZeroNoticePeriod

Mode of interview

CV Screening Technical Assessment Client Interview

Similar jobs on ZNP

Gen AI Developer

Lets Hire! · Pune

Less than 1 Year · 1–5 LPA

Hybrid Immediate

Gen AI Developer

THC · Chennai

Less than 1 Year · 2–5 LPA

Hybrid Immediate

Service Desk Agent

Your Hires · Bengaluru

Less than 1 Year · 2–6 LPA

Hybrid Immediate

Browse all jobs →