Projects & Experiments
My personal lab — side projects, proofs of concept, and experiments at the intersection of AI Agents and Data Governance.
Featured Builds

Sidekick — AI Co-Worker
A proof-of-concept AI co-worker for exploring LLM agents, RAG, memory, and project-scoped document workflows using LangChain and LangGraph.
Whether a local agent loop (worker + evaluator) with ChromaDB-backed RAG and per-project memory could serve as a genuine productivity tool, not just a demo.
Built a working multi-mode agent with full-file RAG retrieval, SQLite checkpoints, and LangSmith tracing. Validated the worker+evaluator pattern at scale.
Personal experiment / POC

SQL Agent Loop
Demonstrates that with a data dictionary and an agentic AI, you can answer any business question in natural language — even when column names are cryptic.
That metadata and governance are the true foundation for effective AI. Without good data definitions, the agent guesses wrong. With them, it reliably translates natural language to accurate SQL.
Validated the pattern end-to-end. The agent uses tools to inspect schema, read the dictionary, and run SQL in a loop — no hardcoded queries.
Personal experiment / POC

Agentic AI POC
Experimental proof of concept for agentic AI patterns — tool-augmented reasoning, agent orchestration loops, memory integration, and plan-and-execute workflows.
How an LLM chooses and invokes tools, receives results, and continues reasoning — without relying on framework abstractions.
Built a custom agent loop with tool registry, long-term memory, planner, and worker from first principles. Useful reference for understanding what frameworks abstract away.
Personal experiment / POC

Career Chatbot
AI agent that acts as a personal career assistant. Uses function calling, a retry loop, and an LLM-as-judge evaluator to stay grounded and in character.
The LLM-as-judge pattern for output quality control — a second model evaluates every reply and triggers a retry with feedback if it fails.
Deployed to Hugging Face Spaces. The evaluator pattern significantly improves response consistency and prevents hallucinations.
Personal experiment / POC

Data CoPilot
Local-first, chat-centric data copilot that routes messages to internal agents for profiling, harmonization, and governance document generation.
A scalable full-stack foundation: React SPA + FastAPI + Postgres (pgvector) + Redis + MinIO — learning each layer step by step.
Core infrastructure operational via Docker Compose. API, workers, and web UI scaffolded. Validates the architecture and data model before full implementation.
Personal experiment / POC

Meeting Mind
A local-first agentic platform that converts meeting transcripts into a time-aware knowledge base. Ingest any transcript format, extract structured ideas, decisions, and actions, then ask natural-language questions across your entire meeting history.
Whether meeting knowledge could be made persistent, searchable, and temporally aware — tracking how decisions and ideas evolve across meetings — without cloud dependency or an LLM API.
Built a full ingestion-to-retrieval pipeline: topic segmentation, structured extraction, temporal supersession linking, and a three-layer store (SQLite + JSON knowledge graph + lexical vector index). Works entirely offline with an optional LLM upgrade path.
Personal experiment / POC

IdeaForge
A multi-agent AI system that takes a raw business idea through a full pipeline — research, debate, business case, and product plan — and delivers production-ready documents. 17 specialized agents across 3 stages with durable human-in-the-loop checkpoints.
Whether 17 coordinated LLM agents working in parallel — each owning a distinct role — could reliably turn a single raw idea into a complete, production-ready business case and product plan.
Built a full multi-tenant platform: LangGraph orchestration, FastAPI backend, Stripe billing, BYOK key vault, Jinja2 web UI, and an MCP server for IDE integration. Pipeline runs end-to-end with durable interrupt/resume at every human checkpoint.
Personal experiment / POC

Asparagus Operations POC
Full data lake architecture built on free Google services: raw storage (Drive), AI-powered ETL (GPT-4o-mini), structured serving (Sheets), and browser-side analytics via DuckDB WASM — plus a full operations workflow for invoice entry and master data management.
A complete data lake — raw zone, AI ETL, structured serving, data catalog, and interactive analytics — can be built entirely on free Google services with no dedicated database, no data warehouse, and no cost per query.
Proved the concept: DuckDB WASM runs SQL in the browser querying data from Google Sheets, GPT-4o-mini extracts structured JSON from PDF/XML invoices, and the entire stack costs under $5/month vs $50–$1,000+ for AWS or Snowflake equivalents.
Personal experiment / POC

Agricultural Settlement Platform
Fully operational agricultural intake, quality, contract, and settlement platform built on Google AppSheet + Apps Script. Replaced manual spreadsheet-driven settlement with a structured, rule-based, auditable engine.
Enterprise architecture principles — governed data, controlled calculation engines, workflow automation — can be applied in low-code environments.
Production-deployed system handling multiple contract types, quality-based payout thresholds, advance payments, PDF generation, and automated email distribution.
Personal project

Local Image Generation POC
Ran Stable Diffusion XL locally to explore text-to-image, image-to-image, and inpainting pipelines — including device support (CUDA, MPS, CPU).
How local diffusion models work in practice: download, memory requirements, precision issues on Apple Silicon, and pipeline differences.
Working t2i, i2i, and inpainting pipelines. Key lesson: MPS requires float32; attention_slicing helps on limited VRAM.
Personal experiment / POC
Experiments & OSS
Interested in collaborating on a technical project?
I'm always open to discussing research prototypes, data infrastructure, or open-source tooling.
Get in Touchmail