Madhav Suri

Hi, I'm

Madhav Suri

Computer Science & Mathematics Student | Software Engineer

The University of Texas at Dallas

I like finding practical problems and turning them into usable software. I build full-stack web applications with thoughtful product flows, backend APIs, data-driven systems, and AI integrations where they make the experience more useful. My focus is on building products that are clear to use, technically grounded, and designed around real workflows.

Building practical web applications across developer tools, fintech risk systems, and AI-assisted workflows.

Madhav Suri smiling at a restaurant table with string-light window decorations behind him
Madhav Suri, a portrait taken in a car on a sunny day
Madhav Suri presenting with his team at a competition, standing in a suit while a teammate speaks into a microphone

About Me

I'm a Computer Science and Mathematics student at UT Dallas interested in backend engineering, AI systems, developer tools, and machine learning. I like projects where the API layer, data model, AI logic, and frontend experience connect into one usable product. Recently, I've been building systems like RepoPulse Agent, RiskOS AI, and D.I.Y.A to strengthen my full-stack engineering, product thinking, and applied AI skills.

Outside of my own projects, I've worked on frontend features for a student platform at ACM UTD, built edge AI and computer vision systems at SoftBank Robotics, analyzed sensor data at Riddell, and led undergraduate research on LLM applications.

Based in Richardson, Texas
Studying B.S. CS & Mathematics
Graduating December 2027
Focus Backend · AI · Dev Tools
Open to SWE Internships

Academics

The University of Texas at Dallas

B.S. in Computer Science and Mathematics

Expected Graduation: December 2027 Richardson, TX

In Progress

Relevant Coursework

  • Data Structures and Algorithms
  • Object-Oriented Programming
  • Discrete Mathematics
  • Database Systems
  • Software Engineering
  • Operating Systems

Certifications

  • AWS Cloud Practitioner
  • Google Project Management Certificate
  • CAPM In Progress

Projects

Each one built end to end — API layer, data model, ML or AI logic, and interface — as a mini product case study.

Project 01

RepoPulse Agent

AI Codebase Intelligence & Safe Change Planning

Connect a GitHub repo, ask grounded questions, plan a change, review the AI's diff, and generate a final engineering report — nothing is applied without human approval.

RepoPulse Agent dashboard — sidebar with the agent workflow (Codebase Map, Ask Repo, Change Planner, Diff Review, Verification, Agent Runs) and the four-step Index → Ask → Plan & Diff → Verify & Approve overview

The Story

GitHub sits on top of millions of repositories — practically the entire open-source world — yet I was only ever using it the boring way: cloning code, scrolling through files, hunting through pull requests. One night, trying to ramp up on an unfamiliar codebase, I caught myself wishing I could just ask the repo what it did instead of reading it line by line.

That frustration turned into a goal: make GitHub something you could point real intelligence at — and not just my projects, but any repository on GitHub — without ever blindly trusting what an AI handed back.

So I built RepoPulse Agent. You connect a repo and it indexes the codebase, answers grounded questions with cited source context, plans changes, generates reviewable diffs, and routes everything through a human approval gate — all backed by 170+ unit tests over a deterministic core.

The result is a platform where you can interrogate any GitHub repo and trust the output: it never auto-merges code, every answer traces back to the files it came from, and each run turns into a shareable engineering report.

At its simplest, RepoPulse Agent lets you point an AI at a GitHub repository and trust what comes back. It is an AI codebase intelligence and safe change-planning platform: it indexes a repository, answers grounded questions with source context, plans code changes, generates reviewable diffs, requires human approval, and produces stakeholder-facing engineering reports — without ever automatically merging code.

Problem

AI coding tools either hallucinate answers about code they haven't read or silently apply changes nobody reviewed. RepoPulse Agent does neither: every answer cites the indexed source files it came from, and every proposed change passes a safety gate and an approve/reject workflow before anything happens.

Technical Depth

  • Deterministic repository indexing first, AI second — file classification, framework detection, and architecture mapping are rule-based and unit-tested, so answers ground in facts
  • Grounded Q&A with cited source context: multi-stage retrieval over the indexed tree, with confidence scores per answer
  • Safety gate that classifies and refuses destructive or review-bypassing requests before any AI call
  • Multi-provider AI layer (OpenAI gpt-4o-mini, Groq, Anthropic, deterministic mock) with budget guard, prompt caching, and per-call cost tracking

Key Features

  • End-to-end agent workflow: Index → Ask → Plan → Diff Review → Verify → Approve → Report
  • Change planner producing risk/test/rollback plans and per-file proposed diffs with approve/reject decisions
  • Sandbox verification that trial-runs only human-approved patches in an isolated clone, comparing baseline vs patched results — the original repository is never modified
  • Final Engineering Report summarizing the whole run (plan, diffs, approvals, verification, conclusion) with exportable Markdown
  • Agent run history and audit logging of every plan, diff, refusal, and approval decision
  • 170+ unit tests (Vitest) over the deterministic core: classification, retrieval, safety, sandbox, and cost controls

Why It Matters

Teams want AI leverage on unfamiliar codebases without losing review control. RepoPulse keeps humans responsible for every decision, makes weak AI output visible through verification gates, and turns each run into a shareable engineering record — the safe-workflow pattern real engineering orgs need before trusting AI with code.

Tech Stack

  • Next.js
  • TypeScript
  • React
  • GitHub API
  • OpenAI
  • Tailwind CSS
  • Vitest
Project 02

RiskOS AI

AI-Assisted Fraud Operations Console for Fintech Risk Teams

A fraud operations console that simulates transaction risk review: deterministic risk rules and an ML scoring pipeline flag suspicious activity, and human analysts review, approve, or override every decision.

RiskOS AI fraud operations dashboard — KPI cards for transactions processed, flagged alerts, critical cases and confirmed fraud, a 7-day risk trend chart, recent critical investigations, and the analyst review queue
111Reliability tests
0.901Model ROC-AUC
82.6%Fraud detection recall
94.3%Automation coverage

The Story

RiskOS AI was built at PaySim, a fintech hackathon where every team was handed the same starting point: a raw dataset of roughly 6.3 million simulated financial transactions and one open-ended prompt — build an application from it.

Most teams reached straight for charts and dashboards. I kept circling back to a different question: what does this data actually represent? It's the bloodstream of a bank — and somewhere in those millions of rows, fraud is hiding in plain sight. I wanted to build something that mirrors how bank fraud detection really works, not just visualize it.

So instead of another analytics view, I built an end-to-end fraud operations console: a deterministic rules engine and an ML risk-scoring pipeline flag suspicious activity, and a human analyst reviews, approves, or overrides every decision.

The result is an explainable, auditable system — every score traces back to the rules and features behind it, every action is logged, and it reflects how a real bank actually defends against fraud rather than treating it as a black box.

At its simplest, RiskOS AI watches a stream of transactions and helps a fraud team decide which ones to trust. Under the hood it is a full risk orchestration system: every transaction is scored by a deterministic fraud rules engine and a Logistic Regression model, combined into a hybrid score that routes flagged cases into an analyst investigation workflow.

Problem

Financial institutions need to detect suspicious transactions without relying on fully automated black-box decisions. Rule-only systems create false positives; AI-only systems are hard to audit. RiskOS AI is designed around explainability, human-in-the-loop review, and operational safety — the system recommends, people decide.

Technical Depth

  • Hybrid scoring: rule-based signals and ML predictions are combined rather than replaced — every score is traceable to the rules and features that produced it
  • RBAC-protected FastAPI endpoints with distinct Analyst and Manager roles, so overrides and approvals are permission-gated
  • Audit logging on every decision and workflow action — investigations are reconstructable after the fact
  • Model evaluation built into the product: confusion matrix, ROC-AUC, precision/recall, and false-positive analysis views (precision 55.4%, F1 0.663 on the seeded dataset — reported honestly, tuned for recall)
  • Data modeling with SQLAlchemy ORM over SQLite; 94.3% seeded automation rate for repeatable demos
  • CI/CD test workflow on GitHub Actions running the 111-test Pytest suite

Key Features

  • Deterministic fraud rules engine and ML risk-scoring pipeline for transaction monitoring
  • Investigation workflow for flagged transactions with approve/override decisions
  • Fraud analytics dashboard with KPI, trend, and review-queue views
  • Notification workflow with Twilio SMS integration
  • Developer console for scenario testing against the rules engine

Why It Matters

Fraud operations live or die on auditability and trust. RiskOS AI shows how a real fintech risk console balances rule-based and ML signals while keeping analysts in control, logging every decision, and gating actions behind role-based access — the kind of explainable, reviewable system financial teams can actually defend.

Tech Stack

  • Python
  • FastAPI
  • React
  • SQLAlchemy
  • SQLite
  • Machine Learning
  • Pytest
  • GitHub Actions
  • Twilio
  • REST APIs
Project 03

D.I.Y.A

AI Academic Workflow Platform with Human-in-the-Loop Review

An AI platform that triages student questions through a human-in-the-loop pipeline — classifying, deduplicating, RAG-grounding and confidence-scoring each one, then escalating only the uncertain ones to the professor.

D.I.Y.A landing page — 'The bridge between students and professors,' an AI-powered academic support platform that connects a class with AI-assisted forums, instant assignment feedback, and smarter office hours
42Reliability tests
7AI workflow stages
13Secure data models
$2/moCost-controlled AI

The Story

I didn't start D.I.Y.A — I inherited it. I was brought onto the project after an intern stepped away mid-build, and what I picked up was a half-finished codebase I hadn't written: a strong idea wrapped around real problems. The task was simple to say and hard to do — make it production-ready, and earn the team's trust while doing it.

So I started where trust is earned fastest: the truth. Before changing a line, I ran a full audit, and it surfaced things that mattered — a critical access-control hole where any logged-in user could read or write any class's data, a fresh clone that crashed on the very first setup step, raw database errors leaking to the browser, and zero tests anywhere.

Then I delivered, in priority order. I locked authorization down so professors reach only their own classes and students only the ones they joined, replaced the leaking errors with a clean handler, and got the project running from a fresh clone in one command. From there I hardened the AI: every model call now runs through a guarded pipeline with deterministic fallbacks and real spend caps, so the public demo can't burn API credits.

The result is a platform that went from inherited-and-unrunnable to deployed, tested, and cost-capped — with a one-click demo a recruiter can open and use. I earned trust the only way that lasts: by shipping something that works.

Problem

Professors drown in repetitive questions while the ones that actually need a human get buried. Fully automating answers is unsafe and unauditable; doing it all by hand doesn't scale. D.I.Y.A is built around human-in-the-loop review — the AI drafts an answer, scores its own confidence, and only escalates what it's unsure about; the professor approves, rejects, or overrides, and every AI answer stays marked "pending review" until they do.

Technical Depth

  • Role-based access control enforced on every group-scoped route — professors are scoped to classes they own, students to classes they joined (this closed a critical cross-tenant data hole)
  • AI workflow orchestration: each question is classified → checked for duplicates → grounded with RAG → drafted → confidence-scored → routed or escalated → recorded as a workflow item
  • Retrieval-augmented generation over uploaded course documents using local 384-dim embeddings (all-MiniLM-L6-v2) — no external vector database
  • Defense-in-depth AI cost controls: a master enable switch, daily/monthly USD budget caps computed from a usage ledger, and per-user/per-IP rate caps — the deployed demo is bounded to ~$2/month
  • Observability: every model call logs latency, tokens, and estimated cost to an admin dashboard with live spend-vs-cap
  • 42-test node:test suite + GitHub Actions CI; a central error handler that never leaks stack traces; graceful keyless fallback so the app still works with no API key

Key Features

  • Student forum with AI-drafted answers held for professor review
  • Professor workflow queue surfacing only escalated, low-confidence questions with their scores
  • Confusion-cluster detection with interventions and tracked effectiveness
  • Approved-answer reuse library that auto-serves future duplicates
  • RAG knowledge base: upload PDFs/DOCX/notes; answers cite the source file
  • AI self-check grading and a live observability / cost dashboard

Why It Matters

D.I.Y.A is the pattern real teams need before they trust AI in production: human-in-the-loop review, role-based access, full auditability, and hard cost governance. But the part I'm proudest of isn't a feature — it's that I took over someone else's unfinished project, found the uncomfortable problems first, and turned it into something live and dependable. That's how you earn trust on a team.

Tech Stack

  • React
  • Vite
  • TypeScript
  • Node.js
  • Express
  • SQLite
  • JWT
  • Anthropic Claude
  • RAG
  • Transformers.js
  • GitHub Actions
  • REST APIs

Additional Project

Project 04Currently working on

Time Machine GitHub Analyzer

Developer Tool

An AI-powered GitHub analyzer that studies commit histories, file diffs, and repository structure to summarize how a codebase changed over time.

  • Analyzes commit history and file diffs
  • Identifies architectural decisions and recurring code patterns
  • Helps reduce manual codebase ramp-up time
  • Built with Python, LLM APIs, and REST workflows

Tech Stack

  • Python
  • LLM APIs
  • REST APIs
  • GitHub APIs

Links

Research

Research Team Lead — URAP

January 2026 — July 2026

Undergraduate Research Assistant Program, UT Dallas

Richardson, TX

  • Led research work involving LLM applications, sensor data, and real-world analysis.
  • Supported applied machine learning research in efficient AI systems and cross-domain human activity recognition.
  • Worked with Python, experimentation, implementation, and machine learning workflows.
  • Presented preliminary research findings at CTURC 2026 at UT Austin.

Experience

Frontend Developer

January 2026 — Present

ACM UTD

Richardson, TX

  • Engineered frontend features for D.I.Y.A, a full-stack academic support platform.
  • Built interfaces for office-hour scheduling, forum-based Q&A, self-check grading, professor analytics, authentication, dashboards, and AI-assisted support flows.
  • Integrated dynamic frontend views with REST APIs using React, Next.js, and TypeScript, working in a Git-based Agile environment.

Software Engineering Intern

August 2025 — December 2025

SoftBank Robotics America

Southlake, TX

  • Developed an edge AI system on NVIDIA Jetson hardware for real-time computer vision and on-device AI inference, building software pipelines that process live camera streams, analyze events, and generate insights directly on the device without relying on continuous cloud processing.
  • Implemented GPU-accelerated inference workflows, computer vision pipelines, and embedded Linux services to support low-latency processing and real-time decision making.
  • Integrated sensor and camera data, optimized system performance, and developed scalable software components for edge computing applications.
  • Technologies: NVIDIA Jetson, Python, C++, Computer Vision, Edge AI, Embedded Linux, CUDA, Real-Time Systems, API Development, System Monitoring.

Data Analyst — Part Time

June 2024 — August 2024

Riddell

Irving, TX

  • Built Python and Excel VBA workflows to process 5,000+ helmet impact sensor readings per session.
  • Reduced manual data preparation time by 35% and improved dataset accuracy by 28%.
  • Created dashboards and visualizations using Matplotlib, Excel, and Tableau.

Skills

Strongest first — the badges are what I reach for daily.

Backend

  • Node.js
  • PostgreSQL
  • Java
  • C++
  • Go
  • Bash
  • API Design
  • Microservices

Frontend

  • JavaScript
  • HTML
  • Firebase
  • Dart

AI & Data

  • Data Analysis
  • Tableau
  • Power BI
  • Matplotlib
  • Excel VBA

Tools & Systems

  • Kubernetes
  • Jenkins
  • Linux/Unix
  • Cloud Computing
  • Agile/Scrum
  • Data Structures & Algorithms

Resume

My full resume, embedded below. You can scroll through it here, or open it full-screen / download a copy.

Your browser can't display the embedded PDF. Open the resume in a new tab.

Contact

Open to software engineering internships and interesting problems. Email is the fastest way to reach me.