Skip to content
View krishnakoushik225's full-sized avatar

Block or report krishnakoushik225

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
krishnakoushik225/README.md

Typing SVG


coding



Portfolio Resume


LinkedIn Email GitHub


GPA XP Roles Profile Views


About

Full-stack engineer with 3.7 years across healthcare (Optum), fintech (BNP Paribas), and academic platforms (USF) — and a parallel track in applied AI research. I build production systems end-to-end: API design, cloud deployment, observability, and on-call ownership.

On the AI side, I'm more interested in correctness and explainability than benchmark scores. Every system I build includes citation grounding, confidence thresholds, or structured failure modes — because a system that fails loudly is more useful in production than one that fails silently.

Currently exploring: multi-agent coordination patterns, vLLM inference optimisation, Rust for systems programming.


Tech Stack

Skills

Full stack breakdown
Layer Technologies
Languages C#, Java, Python, TypeScript, JavaScript (ES6+), C++, PowerShell
Backend ASP.NET Core, Spring Boot, FastAPI, Django, EF Core, Hibernate
Frontend React, Angular, Next.js, Redux Toolkit, TanStack Query, Tailwind CSS
AI & ML LangGraph, PyTorch, PEFT, LoRA, OpenAI GPT-4, Pinecone, HuggingFace, Ollama
Cloud Azure AKS, Azure Functions, Cosmos DB, AWS (EC2/S3/Lambda/SQS), Docker, Kubernetes
Data Apache Kafka, SQL Server, PostgreSQL, Redis, MySQL, Cosmos DB
Security JWT, OAuth 2.0, RBAC, Spring Security, ASP.NET Identity, Azure Key Vault
Tools Git, Postman, VS Code, IntelliJ IDEA, Jira, Linux/Bash, GitHub Actions

Impact at a Glance

🏥 Healthcare 🎓 Academic 🏦 Fintech 🤖 AI Research
3,500+ events/min Kafka 5,000+ daily users 30% fewer errors F1: 0.589 ECG benchmark
35% MTTR reduction 820ms→310ms API 20 hrs/week saved 60% fewer parameters
100K+ patient records 99.6% uptime 4 trading desks 2.1× train efficiency

What I've Shipped

🏥  Optum        Kafka pipelines at 3,500+ events/min · MTTR reduced 35% · 100K+ patient records
🎓  USF          5,000+ daily users · 820ms→310ms API latency · 99.6% uptime · 60% faster page load
🏦  BNP Paribas  4 trading desks · 30% fewer reconciliation errors · 20 hrs/week saved
🧠  ECG-PEFT     Wav2Vec2+LoRA: F1 0.589 / AUC 0.620 · 60% fewer trainable parameters
🤖  ResearchFlow  Cited answers in <30s · Stateful LangGraph retry on confidence threshold
🔒  ContextFlow  100% on-device inference · Zero cloud API calls · Sub-second on consumer hardware

Featured Projects

AI pipeline

ResearchFlow AI — Agentic LLM Research Platform

Language Last Commit

Self-correcting LangGraph agent: Planner → Decomposer → Search → Verifier → Synthesizer. Delivers citation-grounded answers in under 30 seconds via stateful graphs with conditional retry edges on confidence-scored results — not possible in a linear chain.

Python LangGraph FastAPI Tavily React TypeScript Ollama


ECG-PEFT Bench — Medical AI Benchmarking

Stars Last Commit

Benchmarked 4 PEFT strategies across 3 cardiac foundation models on 10K+ ECG segments. Wav2Vec2 + LoRA achieved best F1/AUC (0.589 / 0.620) with 60% fewer trainable parameters — structured model-selection report for clinical AI pipeline decisions.

Python PyTorch HuggingFace PEFT Wav2Vec2 HuBERT ECG-FM


DocuMind — RAG Document Intelligence

Last Commit Language

Pinecone-backed RAG platform with GPT-4 generation and citation grounding — every answer surfaces the exact source passage for auditable, verifiable multi-document Q&A with sub-second retrieval latency.

Python FastAPI Pinecone OpenAI GPT-4 React TypeScript


ContextFlow AI — Privacy-First Local LLM Chrome Extension

Last Commit

100% on-device inference via Ollama + Llama 3.2 — zero cloud API calls, no data leaves the browser. Manifest V3 service worker as persistent agent runtime with dynamic tool routing and sub-second response times.

TypeScript React Chrome Manifest V3 Ollama Llama 3.2 Vite Tailwind CSS


Workforce management for 500+ employees with optimistic locking for concurrent shift-assignment safety. GenDiff-PEFT — 2.1× training efficiency, 60% fewer parameters, FID within 8% of full fine-tuning. CLAP-AudioLDM — 15%+ CLAP alignment improvement via multi-candidate quality-aware selection.


Professional Experience

Company Role Period
University of South Florida Application Developer — .NET · React · Azure Aug 2024 – Present
Optum (UnitedHealth Group) Associate Software Developer — .NET · Angular · Azure Aug 2023 – Aug 2024
BNP Paribas Software Engineer — Spring Boot · React · AWS Feb 2023 – Jul 2023
Rinex Private Limited Full Stack Intern — Spring Boot · React Jun 2021 – Dec 2021

Education

Degree Institution GPA
M.S. Computer Science & Engineering University of South Florida 3.95 / 4.0
B.Tech. Computer Science & Engineering Amrita Vishwa Vidyapeetham 8.31 / 10

GitHub Activity

contribution snake

Open to SWE · AI/ML Engineer · Full-Stack Engineer roles — US-wide, available June 2026

View Portfolio Download Resume

Profile Views

Pinned Loading

  1. ecg-peft-benchmark ecg-peft-benchmark Public

    Benchmarking PEFT (LoRA vs adapters) for ECG segment classification using Wav2Vec2, HuBERT, and ECG-FM with full metric reporting and ablations.

    Jupyter Notebook 1

  2. APSRTC-Duty-Management-Portal APSRTC-Duty-Management-Portal Public

    Secure full-stack workforce management system built with Spring Boot, React, and PostgreSQL featuring JWT authentication, role-based access control, duty assignment, and leave approval workflows.

    JavaScript

  3. contextflow-ai contextflow-ai Public

    AI-powered Chrome extension that summarizes webpages, answers questions about page content, and explains highlighted text using a local LLM (Ollama + Llama 3.2).

    JavaScript

  4. DocuMind DocuMind Public

    Docu-Mind is an AI-powered document assistant that answers questions from PDFs using natural language. It combines RAG, GPT-4, and Pinecone to deliver fast, accurate, and verifiable responses throu…

    TypeScript

  5. langgraph-research-agent langgraph-research-agent Public

    Self-correcting AI research agent built with LangGraph, Tavily Search, Ollama, FastAPI, and React. Decomposes complex questions, retrieves web evidence, verifies relevance, and generates citation-g…

    Python

  6. GenDiff-PEFT-Efficient-Conditional-Diffusion-Optimization GenDiff-PEFT-Efficient-Conditional-Diffusion-Optimization Public

    Parameter-efficient optimization of conditional diffusion models using multi-resolution attention, classifier-free guidance ablation, and DDIM sampling — achieving 17% FID improvement with 85% redu…

    Jupyter Notebook