LexiLingo

The AI English Tutor That Actually Understands You

TRACECAG · Real-Time Voice · Knowledge Graph · CEFR Assessment

Most language apps give you the same lesson regardless of who you are. LexiLingo builds a live knowledge graph of your concept gaps, diagnoses errors in real time, and generates personalized explanations — not templates.

Quick Start · Architecture · API Overview · TRACECAG

1. Project Overview

LexiLingo is a full-stack AI English tutoring platform built as a monorepo with 5 deployable services. It combines a Flutter mobile app, a FastAPI backend, a Python AI service, a React admin dashboard, and an MCP agent server.

Target users

User type	Description
Learner	English learner (A1–C2) using the mobile app for lessons, vocabulary, AI chat, voice practice
Admin	Content manager / super-admin using the dashboard to manage users, courses, and system config
Developer / Agent	IDE tool users and AI agents accessing the MCP server for localization and graph tooling

Core capabilities

Adaptive vocabulary — SM-2 spaced repetition with per-user ease factor (EF-adjusted intervals)
Structured courses — CEFR-tagged A1→C2 curriculum with units, lessons, and exercises
AI tutor (Lexi Chat) — Contextual conversation powered by the TRACECAG pipeline
Topic-based chat — Conversation practice across 68 topic catalogs (60 TraceCAG-generated, A1–C1, 16 categories)
Voice / pronunciation — Real-time dual-stream STT/TTS + HuBERT phoneme analysis with Vietnamese-specific error feedback
CEFR proficiency assessment — Multi-skill diagnostic (grammar, vocabulary, fluency)
Gamification — XP, leagues, achievements, shop, leaderboard, streaks
Knowledge graph — Live KuzuDB graph of learner concept mastery, updated per interaction
Admin dashboard — Full CMS for courses, vocabulary, users, analytics, and AI model config
Content feed — Books, news articles (BBC Learning English), podcasts, YouTube clips

2. Architecture Overview

The system is organized into 8 architectural layers:

Layer	Role	Primary tech
Flutter Mobile App	Cross-platform learner UI (iOS / Android / Web)	Dart 3.8, Flutter 3.24, Provider, GetIt
Backend API Service	REST API, auth, data persistence, business logic	Python 3.11, FastAPI, SQLAlchemy 2 async, PostgreSQL, Redis
AI Service	LLM orchestration, knowledge graph, voice pipeline	Python 3.11, FastAPI, LangGraph, KuzuDB, Faster-Whisper, Piper, HuBERT
Admin Dashboard	Content management and monitoring SPA	TypeScript, React 18, Vite, Zustand, Recharts
MCP Agent Server	IDE integration tools via Model Context Protocol	Python, MCP SDK, stdio transport
Infrastructure & Deployment	Container orchestration, API gateway, observability	Docker Compose, Kong Gateway, PostgreSQL, Redis, MongoDB
CI/CD Pipelines	Automated test, build, deploy, i18n sync	GitHub Actions
Documentation	Architecture docs, feature plans, i18n guides	Markdown

System diagram

Why TRACECAG over RAG?

	Traditional RAG	LexiLingo TRACECAG
Context source	Static document chunks	Live KuzuDB knowledge graph + Redis learner cache
Personalization	None — same docs for everyone	Per-user mastery scores, error history, CEFR level
Retrieval latency	Vector search on every turn	Pre-cached learner profile + graph BFS
Curriculum awareness	Zero	Prerequisite chains: "Past Simple → Past Perfect → Reported Speech"
Error diagnosis	Not possible	Dedicated Diagnose node maps errors to KG concept IDs

3. Key Features

Feature	Description	Main layer
TRACECAG AI Tutor	Responses grounded in learner's live knowledge graph via TraceCag pipeline	AI Service
CEFR Assessment	Multi-skill proficiency test (grammar 40%, vocabulary 30%, fluency 30%)	AI Service + Backend
Spaced Repetition	SM-2 algorithm with per-user ease factor and overdue priority queue	Backend
Voice Pronunciation	Dual-stream WebSocket: simultaneous STT (Whisper) + TTS (Piper) + HuBERT phoneme scoring	AI Service
Structured Curriculum	CEFR A1–C2 courses, units, lessons with XP rewards	Backend + Mobile
Gamification	XP, level-up, leagues, weekly challenges, achievements, in-app shop	Backend + Mobile
Topic Chat	AI conversations on 68 topic catalogs — 60 TraceCAG-generated scenarios (A1–C1, 16 real-world categories)	AI Service
Content Feed	Books, BBC Learning English articles, podcasts, YouTube clips	Backend + Mobile
Admin CMS	Full CRUD for courses, vocabulary, users + analytics and AI model config	Admin Dashboard
MCP IDE Tools	i18n key manager, local model handlers for developer workflow	MCP Server
Multi-language UI	7 locales: English, Vietnamese, Japanese, Korean, Chinese, French, Spanish	Mobile
Offline Support	sqflite local cache for vocabulary and progress	Mobile

4. Tech Stack

Area	Technology
Mobile	Flutter 3.24+, Dart 3.8, Provider, GetIt, sqflite, Dio/http
Frontend (Admin)	React 18, TypeScript 6, Vite, Zustand 5, Recharts, TanStack Query 5
Backend	Python 3.11, FastAPI 0.136+, SQLAlchemy 2 async, Alembic, Pydantic v2
AI Orchestration	LangGraph 1.2+, LangChain Core
LLM (Local)	Qwen3-1.7B(4-bit), LLaMA3-VI 3B (lazy-loaded), Ollama (qwen3 1.7b)
LLM (Cloud)	Google Gemini 2.5 Flash (primary cloud fallback), Groq (qwen3-32b)
Voice STT	Faster-Whisper (base–small, int8, CUDA)
Voice TTS	Piper TTS (en_US-lessac-medium.onnx)
Pronunciation	HuBERT-large-ls960-ft (Facebook), sentence-transformers
Knowledge Graph	KuzuDB 0.11+ (embedded graph DB)
Vector Embeddings	all-MiniLM-L6-v2 (sentence-transformers)
Primary DB	PostgreSQL 14+ (asyncpg)
Cache / Sessions	Redis 7
AI Logs / History	MongoDB (Motor async driver)
Auth	JWT (RS256, python-jose), Firebase Admin SDK, Google OAuth 2.0
API Gateway	Kong (rate limiting, auth, routing)
Container	Docker Compose (production multi-service stack)
CI/CD	GitHub Actions (ci.yml, cd.yml, crowdin-sync.yml)
i18n	Crowdin (synced via GitHub Action)
MCP	Model Context Protocol Python SDK (stdio transport)

5. Getting Started

Prerequisites

Tool	Version
Python	3.11+
Flutter	3.24+
Node.js	18+ (for admin dashboard)
PostgreSQL	14+
Redis	7+
Docker & Docker Compose	Latest (optional but recommended)

Option A — Docker (recommended)

git clone https://github.com/InfinityZero3000/LexiLingo.git
cd LexiLingo

# Copy and fill environment variables
cp .env.example .env # root-level env for compose
cp backend-service/.env.example backend-service/.env
cp ai-service/.env.example ai-service/.env

# Start all services (postgres, redis, mongodb, backend, ai-service)
docker-compose up -d

Option B — All services locally (no Docker)

bash scripts/start-all.sh

This script starts the backend, AI service, and Flutter web in the background, writing logs to logs/.

Manual setup per service

Backend API

cd backend-service
python -m venv venv && source venv/bin/activate
pip install -r requirements.txt
cp .env.example .env # fill DATABASE_URL, SECRET_KEY, etc.

createdb lexilingo
alembic upgrade head # run DB migrations

uvicorn app.main:app --reload --port 8000

AI Service

cd ai-service
bash setup.sh # creates venv, installs deps, copies .env
source venv/bin/activate
# fill .env: GEMINI_API_KEY, KUZU_DB_PATH, REDIS_URL, etc.
uvicorn api.main:app --reload --port 8001

Flutter Mobile App

cd flutter-app
flutter pub get
cp .env.example assets/env/.env # set API_BASE_URL

flutter run # default device
make run-web # Chrome
make run-ios # iOS simulator
make run-android # Android emulator/device

Admin Dashboard

cd admin-service
pnpm install
cp .env.example .env # set VITE_BACKEND_URL, VITE_AI_URL
pnpm dev # dev server at http://localhost:5173

6. Environment Variables

Copy .env.example in each service directory and fill in the required values.

Service	Key variables
`backend-service`	`DATABASE_URL`, `SECRET_KEY`, `REDIS_URL`, `ALLOWED_ORIGINS`
`ai-service`	`GEMINI_API_KEY`, `KUZU_DB_PATH`, `REDIS_URL`, `MONGODB_URI`, `BACKEND_SERVICE_URL`
`admin-service`	`VITE_BACKEND_URL`, `VITE_AI_URL`

7. Scripts

# Start / stop all services locally
bash scripts/start-all.sh
bash scripts/stop-all.sh

# Docker
docker-compose up -d
docker-compose down

# Flutter (via Makefile)
make run-web       # Chrome
make run-ios       # iOS simulator
make build-apk     # Android release

# Backend
alembic upgrade head   # run migrations
pytest tests/          # run tests

8. API Overview

All requests route through Kong Gateway (:80/:443 in production). Interactive OpenAPI docs at http://localhost:8000/docs (backend) and http://localhost:8001/docs (AI service).

Backend /api/v1/ — auth, users, courses, vocabulary, learning, progress, gamification, challenges, games, proficiency, admin, analytics, monitoring, content feeds (books/news/podcasts/youtube)

AI Service /api/v1/ — lexi chat (TraceCag), topic chat, STT/pronunciation, TTS, WebSocket dual-stream (/ws/stream), AI analytics, admin config

9. TRACECAG / Knowledge Graph

The AI tutor uses a LangGraph StateGraph (ai-service/api/services/trace_cag/) with 4 nodes: Diagnose → Retrieve → Ground → Generate. Each response is grounded in the learner's live KuzuDB concept graph rather than static documents. See docs/ARCHITECTURE.md for a detailed pipeline diagram.

TRACE-CAG Architecture

TRACE-CAG runs as a hierarchical, memory-first pipeline with three reuse tiers:

Tier	Mechanism	When triggered
L0 Exact Reuse	Normalized query + level key lookup	Identical query seen before
L1 Concept-State Reuse	Bucket lookup → PCC filter → candidate ranking	Near-hit on concept fingerprint
L2 Grounded Reconstruction	KG expand → retrieve evidence → diagnose → generate	Cache miss or unsafe reuse

Online routing extracts 5 features per query — Intent, Level, Seed Concepts, Neighborhood, Profile State — and passes them through the TRACE Gate / PCC Gate for admissibility checks (intent match, concept scope, level compatibility, freshness, profile epoch) before selecting the reuse tier.

Key locations:

Runtime graph DB: ai-service/data/kuzu_db/
Seed concepts: ai-service/data/kg/*.json (incl. 06_tracecag_topic_expansion.json — 4,040 nodes)
KG synthesis: ai-service/scripts/build_kg.py --force
Pipeline source: ai-service/api/services/trace_cag/
Codebase architecture graph: .understand-anything/knowledge-graph.json

10. Contributing

Contributions are welcome. See CONTRIBUTING.md for guidelines.

11. License

MIT License — see LICENSE.

Copyright (c) 2026 Nguyen Thang

Architecture Docs · Report Issue · Discussions

Built by InfinityZero3000

Name		Name	Last commit message	Last commit date
Latest commit History 1,030 Commits
.claude		.claude
.github		.github
admin-service		admin-service
ai-service		ai-service
backend-service		backend-service
deploy		deploy
docs		docs
flutter-admin		flutter-admin
flutter-app		flutter-app
gateway		gateway
mcp-server		mcp-server
scripts		scripts
.env.production.example		.env.production.example
.gitignore		.gitignore
.pr_agent.toml		.pr_agent.toml
.pre-commit-config.yaml		.pre-commit-config.yaml
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
docker-compose.local.yml		docker-compose.local.yml
docker-compose.production.yml		docker-compose.production.yml
docker-compose.yml		docker-compose.yml
vercel.json		vercel.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LexiLingo

The AI English Tutor That Actually Understands You

1. Project Overview

Target users

Core capabilities

2. Architecture Overview

System diagram

Why TRACECAG over RAG?

3. Key Features

4. Tech Stack

5. Getting Started

Prerequisites

Option A — Docker (recommended)

Option B — All services locally (no Docker)

Manual setup per service

Backend API

AI Service

Flutter Mobile App

Admin Dashboard

6. Environment Variables

7. Scripts

8. API Overview

9. TRACECAG / Knowledge Graph

TRACE-CAG Architecture

10. Contributing

11. License

About

Uh oh!

Releases 6

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

LexiLingo

The AI English Tutor That Actually Understands You

1. Project Overview

Target users

Core capabilities

2. Architecture Overview

System diagram

Why TRACECAG over RAG?

3. Key Features

4. Tech Stack

5. Getting Started

Prerequisites

Option A — Docker (recommended)

Option B — All services locally (no Docker)

Manual setup per service

Backend API

AI Service

Flutter Mobile App

Admin Dashboard

6. Environment Variables

7. Scripts

8. API Overview

9. TRACECAG / Knowledge Graph

TRACE-CAG Architecture

10. Contributing

11. License

About

Topics

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 6

Contributors

Uh oh!

Languages