My AI Portfolio — Evidence of Daily AI Work | Original

Home 2026.06

I don’t just talk about AI — I use it every day, at scale. This post is a visual portfolio of my AI effort: the tools I’ve built, the tokens I’ve consumed, and the certifications I’ve earned.


🖥️ LLM Training & Inference — My Hardware Setup

Built my machine learning workstation in 2023 and have been training and learning ever since.

Hardware experience:

GPU VRAM Experience Where
NVIDIA RTX 4070 12 GB 3 years Home workstation
NVIDIA H200 141 GB 3 months RunPod / DigitalOcean
AMD MI300X 192 GB HBM3 3 months AMD Developer Cloud

What I’ve trained:

The workstation:

My ML learning station — built in 2023, RTX 4070 12GB, used for daily training and experiments

AMD Developer Cloud — MI300X 192GB HBM3:

AMD Dev Cloud — MI300X instance for large-scale model training


🧠 Enhanced nanoGPT — My Fork

Forked karpathy/nanoGPT and extended it with additional dataset pipelines, scaled training configs, and inline shape annotations for learning. 45 commits, Nov 2025 – Apr 2026.

New dataset pipelines:

Dataset Path Description
FineWeb-Edu data/fineweb/ HuggingFace FineWeb-Edu (10B+ tokens). Shard-based loading, chunked processing, incremental train/val splits.
OpenWebText 10k data/openwebtext_10k/ Quick 10k-subset for fast iteration.
Wikipedia Local data/wikipedia_local/ Tokenize local plain-text dump directly (no HuggingFace download).

Training configs added:

Config Target Notes
train_fineweb.py 125M on FineWeb Tuned for RTX 4070 12 GB (n_embd=384, dropout=0.1).
train_fineweb1_5b.py 1.5B on FineWeb For H200 80 GB.
train_fineweb_gpt3.py GPT-3 style 10B tokens Shard-based loader, wider schedule.
train_fineweb_760m.py 760M on FineWeb For MI300X 192 GB HBM3.
train_gpt2_200m.py GPT-2 200M General-purpose mid-size config.
train_gpt2_200m_smoke.py Smoke test Quick 200M sanity check (~few min).

Model changes:

Enhanced nanoGPT — 45 commits, dataset pipelines, scaled training configs, inline shape annotations

GitHub: lzwjava/nanoGPT


📊 LLM API Usage — The Numbers

OpenRouter — Past Year

1.15B tokens consumed, $239 spend, 155K API requests across multiple models.

OpenRouter Activity Dashboard — 1.15B tokens, $239 spend, 155K requests over 1 year

OpenRouter Model Spend Breakdown — Claude 4 Sonnet $44.40, Claude 3.5 Sonnet $9.67, Grok 3, Mistral, Kimi

OpenRouter Token Usage by Model — MiniMax 240M, Gemini 203M, DeepSeek 110M

Claude API via SSSAICode — April 2026

$171.53 in one month. 2,555 requests. 115M+ tokens. 90.9% cache hit rate.

SSSAICode Claude Usage — Opus 4.6, Opus 4.7, Sonnet 4.6, Haiku 4.5

Xiaomi MIMO Subscription — 500 Million Tokens Used

Pro Monthly Plan with 38B main quota + 8.75B compensation quota (~4.6B free credit). 500M tokens consumed so far.

Xiaomi MIMO Pro Plan — 500M tokens consumed, ~4.6B free credit remaining

Summary

Platform Tokens Period Cost
OpenRouter 1.15B Past year $239
SSSAICode (Claude) 115M+ April 2026 $171.53
Xiaomi MIMO 500M Current plan Free 4.6B credit
Others (GitHub Copilot, etc.) 500M Past year
Total ~2.3B+ Past year

🏢 Enterprise AI Usage — HSBC Bank

At HSBC Bank (via TEKsystems), I built an autonomous AI agent layer on top of GitHub Copilot to automate scripting, logging, documentation, and testing.

What I built:

Results:

Image Source: GitHub Copilot — Visual Studio Code Marketplace

HSBC AIPlayer Contribution Award


🎤 AI Talk at HSBC — From Neural Networks to Agents

Gave a technical talk to 80 participants at HSBC Bank — senior consultants, specialists, associate directors, software engineers, and contractors.

Talk: “From Neural Networks to Agents” — a journey from the simplest neural network (y = wx) through MNIST, Transformers, GPT, nanoGPT, to building personal AI agents.

What I covered:

Feedback:

Slides: Built with Claude Code & Marp, from my public AI response notes.


🛠️ ww — Cross-Platform CLI Toolkit

ww is my flagship CLI toolkit — 255+ commits, 10+ command groups, cross-platform (macOS + Linux). It covers git workflows with AI commit messages, note management, image/PDF processing, web search, GitHub Copilot chat, system utilities, and LLM-powered helpers.

lzwjava@lzw-mac ww % uv run ww --help
Usage: ww <group> [command] [options]

Action:
  ww action [workflow.yml]  Trigger a GitHub Actions workflow

AMD Dev Cloud:
  ww amd-dev-cloud snapshots    List snapshots
  ww amd-dev-cloud start-train  Create GPU droplet for training
  ww amd-dev-cloud end-train    Snapshot and destroy a GPU droplet

Copilot:
  ww copilot auth           Authenticate via GitHub OAuth
  ww copilot chat           Chat with a Copilot model

Git:
  ww git gpa            Git pull --all for all repos
  ww git squash <n>     Squash last n commits
  ww git amend-push     Amend last commit and force push

LLM:
  ww llm compare <prompt>   Compare multiple LLM responses
  ww llm query <question>   Query local RAG documents

Note:
  ww note              Clipboard to note (fast capture)
  ww note process      Drain the note queue
  ww note watch        Auto-process daemon

Screenshot:
  ww screenshot              Capture and create a note
  ww screenshot interact-note  Interactive screenshot note

255+ commits. 10+ command groups. Cross-platform (macOS + Linux).

ww — Cross-platform CLI toolkit on GitHub

GitHub: lzwjava/ww


📝 jekyll-ai-blog — AI-Powered Blog Platform

jekyll-ai-blog is the source for lzwjava.github.io — a Jekyll blog enhanced with AI-powered automation. 10,000+ English posts, 10,000+ Chinese posts, 9,700+ AI answer notes. ~70,000 page views in the past month (Cloudflare Analytics).

lzwjava@lzw-mac jekyll-ai-blog % ls README.md
README.md

What makes it different from a standard Jekyll blog:

Scale:

Metric Count
English posts 10,264
Chinese posts 10,259
AI answer notes 9,794
Python scripts 323
ML scripts 191
Page views (past month) ~70,000

jekyll-ai-blog — AI-powered blog with 10K+ posts, translation, TTS, and PDF pipelines

GitHub: lzwjava/jekyll-ai-blog


🌳 Tree_Of_Thought — Worked with a High School Student on Tree-of-Thought Reasoning

Tree_Of_Thought is a friend’s project — an external Tree-of-Thought reasoning system for physics-heavy problem solving. Instead of relying on a model’s hidden chain-of-thought in one opaque completion, it turns reasoning into an explicit, inspectable, controllable tree with live state, scoring, pruning, and deterministic tool support.

The system combines a FastAPI service for long-lived reasoning sessions, a browser UI for inspecting and pruning branches, a node-level FSM and tree scheduler, a SymPy-backed skill layer for exact symbolic computation, and multi-model routing for planning, modeling, review, and evaluation.

My contribution (1 PR): Added an OpenAI-compatible requester and python-dotenv config so the system can connect to any OpenAI-compatible endpoint (local or cloud).

Context: I mentor a high school student who built this system. During a meeting, he walked me through the full architecture — the reasoning tree, the FSM-based review, the route-local incremental refinement. I introduced him to AI PhD researchers and helped him think about research direction. He’s now exploring physics problem-solving with LLMs, using tools like Codex (GPT-5.4) and building multi-agent collaborative coding systems.

Tree of Thought — terminal tree explorer with node inspection, frontier management, and branch pruning

GitHub: Cerynitius/Tree_Of_Thought


🤖 iclaw — Terminal AI Agent (REPL)

iclaw is a terminal AI agent that codes, searches, and runs commands autonomously — works on personal machines and locked-down enterprise ones. A minimal openclaw implementation, built as a plain Python CLI with no browser extensions or IDE plugins, powered by GitHub Copilot.

lzwjava@lzw-mac iclaw % iclaw

  ██  █████  ██       █████  ██   ██
  ██ ██      ██      ██   ██ ██   ██
  ██ ██      ██      ███████ ██ █ ██
  ██ ██      ██      ██   ██ ██████
  ██  █████  ███████ ██   ██  ███ ██

Available commands:
  /provider_model      Select and authenticate with the model provider
  /model               Select specific model from your provider
  /search              Web search (usage: /search <query>)
  /provider_search     Select the web search provider
  /proxy               Set HTTP/HTTPS proxy (usage: /proxy [url|off])
  /ca_bundle           Set CA bundle for HTTPS (usage: /ca_bundle [path|off])
  /log                 Set log verbosity (usage: /log [verbose|info])
  /copy                Copy last Copilot response to clipboard
  /read                Print file contents to terminal (usage: /read <path>)
  /clear               Clear conversation history
  /compact             Compact conversation history using LLM
  /export              Export full conversation history to JSON file
  /status              Show current settings
  /help                Show available commands
  /exit                Quit the REPL.

Key features:

iclaw — Terminal AI agent REPL with native tool calling

iclaw — Execution log showing autonomous coding and shell commands

GitHub: lzwjava/iclaw


⚙️ zz — Dataset Processing & Training Utilities

zz is a toolkit for ML training pipelines — dataset download, tokenization, extraction, and inference utilities. Used during GPT-2 124M training runs on RunPod H200, DigitalOcean H100, and home RTX 4070. Also hosted on Hugging Face.

lzwjava@lzw-mac zz % tree -L 1
scripts/
  download/     # Dataset download scripts (FineWeb, Wikimedia, HF mirrors)
  extract/      # Data extraction, tokenization, and renaming
  analysis/     # Training duration and metric evaluation
  deepseek/     # LLM inference scripts (DeepSeek-V2-Lite)
logs/           # Training logs and outputs
datasets/       # Downloaded dataset storage

Key capabilities:

zz on Hugging Face — dataset processing & training utilities, 22 commits, 3 contributors

GitHub: lzwjava/zz · Hugging Face: lzwjava/zz


🎓 Certificates

Machine Learning Specialization — DeepLearning.AI & Stanford University

Completed Nov 2023. Three courses: Supervised Machine Learning, Advanced Learning Algorithms, Unsupervised Learning, Recommenders, Reinforcement Learning.

Coursera Machine Learning Specialization Certificate — Zhiwei Li, Nov 2023

Deep Learning Specialization — DeepLearning.AI

Completed Dec 2023. Five courses: Neural Networks, Hyperparameter Tuning, Structuring ML Projects, CNNs, Sequence Models.

Coursera Deep Learning Specialization Certificate — Zhiwei Li, Dec 2023



Back Donate