terminal@nivith:~
ONLINE
$ whoami
Nivith Avula
Available for interviews
Nivith Avula | Generative AI Engineer

I'm a Generative AI Engineer with 5+ years of experience building and shipping production systems. Most of my work sits at the intersection of LLM apps and backend engineering: search and retrieval over real business data, agent workflows for multi-step requests, and APIs that stay stable under real usage. I've worked across supply chain, banking, healthcare, energy, and manufacturing, and I'm comfortable taking a feature from data prep to deployment.

$ cat ./about.md

About

I build GenAI features that people can rely on day-to-day. That usually means turning messy documents and operational data into something searchable, then wiring it into a clean workflow that returns accurate, grounded answers. When the use case needs more than one step, I add agent-style orchestration to handle routing, checks, and follow-ups.

I care a lot about the "boring but important" parts: retrieval quality, stable prompts, output validation, access control, and good logging/monitoring. I also keep deployments simple with CI/CD and containerized releases, so shipping changes doesn't turn into a fire drill.

What I build
RAG over docs • Agent workflows • Document intelligence • Model/API serving • Cloud-native platforms
How I work
Security-first • clean APIs • CI/CD + IaC • monitoring • iterate with evaluation + UAT
$ ls -la ./experience | sort -r

Experience

Blue Yonder logo
Blue Yonder
Generative AI Engineer
Coppell, TX | Jul 2025 – Present
Production work
Azure OpenAILangChainLangGraphRAGAzure Cognitive SearchAKSFastAPIGo

Built and deployed GenAI solutions for supply-chain planning and forecasting using Azure OpenAI, LangChain, LangGraph, and FastAPI. Implemented RAG over SOPs, planning documents, and operational data using Azure Cognitive Search and embedding models, and used agent workflows for multi-step scenarios that needed routing and validation. Developed backend services in Python and Go, built ingestion pipelines with Azure Data Factory and Databricks, and improved quality through prompt tuning, response checks, and regression testing. Owned production readiness with security controls, CI/CD, and monitoring using AKS, Docker, GitHub Actions, Prometheus, and Grafana.

UBS logo
UBS
AI Engineer
New York, NY | Jul 2024 – Jun 2025
Production work
Amazon BedrockSageMakerOpenSearchFAISSTextractNeo4jFastAPITerraformDocker

Worked on production AI and GenAI systems in a regulated banking environment. Built retrieval-based solutions over policies and historical case data using OpenSearch and embedding models, and designed agent workflows for compliance reasoning and validation. Developed document intelligence pipelines with Textract, built scalable inference services using FastAPI and Docker, and supported deployments on AWS using SageMaker, EKS, and Terraform. Focused heavily on governance, auditability, monitoring, and reliability to meet regulatory requirements.

Landis+Gyr logo
Landis+Gyr
Fullstack AI Engineer
Atlanta, GA | Jan 2024 – May 2024
Production work
Amazon BedrockLangChainCrewAIOpenSearchFAISSSageMakerIoT CoreKinesisGlueEMR/SparkFastAPI

Delivered AI and GenAI features for smart-energy analytics, combining LLM-based insights with predictive modeling and large-scale IoT data pipelines. Built agent workflows using LangChain and CrewAI, implemented retrieval-based search over grid and maintenance data using OpenSearch and FAISS, and developed LLM-enabled applications using Amazon Bedrock and Hugging Face models. Built forecasting and anomaly detection models with SageMaker and TensorFlow, and supported high-volume ingestion using IoT Core, Kinesis, Glue, and Spark on EMR. Deployed services through cloud-native patterns with FastAPI, containerization, and automated delivery.

UHG logo
UHG
Python Full Stack Engineer
Dallas, TX | Sep 2023 – Dec 2023
Production work
PythonFlaskDjangoGKEVertex AIKubeflowMLflowBigQueryDataflow

Built healthcare applications and backend services using Python with Flask and Django, focused on reliable APIs for clinical workflows and analytics. Developed and deployed ML and deep learning models for NLP, computer vision, and time-series use cases, and set up reproducible training and deployment using Kubeflow on GKE. Used Vertex AI for experimentation and deployment, and built data pipelines with BigQuery and Dataflow to support reporting and segmentation. Focused on clean service design, stable integrations, and production support in an Agile environment.

Celanese logo
Celanese
Python Developer
Irving, TX | May 2023 – Aug 2023
Production work
PythonFlaskDjangoFastAPIAzure FunctionsAKSAPI ManagementKafkaTerraformAnsible

Built backend and platform services supporting enterprise manufacturing workflows. Developed Python microservices using Flask, Django, and FastAPI, and deployed them using Azure Functions and AKS. Implemented API gateways with authentication, routing, and rate limiting, and built event-driven pipelines using Kafka and Azure messaging services. Worked extensively on CI/CD, infrastructure automation with Terraform and Ansible, and improving performance and reliability of production systems.

Altimetrik logo
Altimetrik
Python Developer
Pune, India | Aug 2020 – Jul 2022
Production work
PythonDjangoHTML/CSS/JSjQueryJenkinsLinuxAWSSpark

Worked on full-stack and backend systems for data-driven enterprise applications. Built Django-based web applications, internal tools, and admin workflows, and supported data processing using Python and Spark. Contributed to CI pipelines using Jenkins and Git, handled Linux-based deployments, and worked with AWS services for application hosting and data workflows. Gained strong experience in building maintainable systems and working in Agile delivery environments.

$ ls -la ./projects

Projects

🎬
IMDB Conversational Voice Agent
Personal Project
View on GitHubOpen Source
LangGraphOpenAI GPT-4oChromaDBSQLiteStreamlitWhisperTTSPython

A production-grade GenAI-powered conversational agent for exploring IMDB's Top 1000 movies. The system features intelligent query routing that automatically classifies queries as structured (SQL), semantic (vector search), or hybrid, then executes the best strategy and synthesizes a grounded response.

📚
RAG PDF Assistant
Personal Project
View on GitHubOpen Source
FastAPIFAISSOllamaPyPDFPythonGitHub ActionsPytestRuff

A local Retrieval-Augmented Generation (RAG) system that lets users upload PDF documents and ask natural-language questions using a locally running LLM via Ollama — no paid APIs required. Built with a production-style FastAPI backend, FAISS vector store, and a clean CI pipeline.

$ cat ./education

Education

University of North Texas logo
University of North Texas
Master's — Information Systems & Technology
Aug 2022 – May 2024
Certifications
AWS Certified Solutions Architect • HashiCorp Terraform Associate • Microsoft Azure Fundamentals (AZ-900)
$ cat ./skills.txt

Skills

Total: 132 tools
Programming & Backend
15 items
PythonGoJavaScalaFastAPIFlaskDjangoSpring BootREST APIsgRPCGraphQLWebSocketsSQLNoSQLSQLAlchemy
$ echo "hello"

Get In Touch

Fast ways to reach me
Let's talk
Email
Niv*******@gmail.com
Phone
+1 (469) 605-****
Send a quick note
Reach out from here
Your name *
Your email *
Your phone
Reason
Message *
This form submits in-page (no redirect). Click on Submit once and you're done.
exit session