SubtitleAI v1.2.3 - Production Deployment Summary¶

Status: ✅ LIVE IN PRODUCTION URL: http://subtitles.internal (http://10.89.97.213) Current Version: v1.2.3 (worker) / v1.1.5 (web) Last Updated: 2025-11-25 Cluster: k3s (namespace: subtitleai)

What's Live¶

Core Features ✅¶

Video Upload: MP4, MKV, AVI support up to 2GB
Speech-to-Text: OpenAI Whisper (base model)
Multi-Language: 12 languages supported (EN, ES, FR, DE, IT, PT, RU, JA, KO, ZH, AR, HI)
Output Formats: SRT + ASS (Advanced SubStation Alpha)
Dual-Language Subtitles: Source + translation with perfect alignment (EN→ES tested ✅)
AI Translation: Qwen QwQ-32B via OpenRouter (~$0.0002 per video)
Style Profiles: 4 profiles implemented (default, learning, enhanced, accessibility)
Authentication: Supabase Auth with RLS policies
Job Management: Track progress, view history, download multiple formats

Infrastructure ✅¶

Web App: Next.js 16 (1 replica)
Image: 10.89.97.201:30500/subtitleai-web:v1.1.5
Enhanced UI with format badges (dual-language, style profile)
Worker: Python + Whisper + Celery (1 replica)
Image: 10.89.97.201:30500/subtitleai-worker:v1.2.3
Translation batch size: 10 segments
Individual fallback for perfect alignment
Poller: Database job monitoring (5s interval)
Redis: Task queue (StatefulSet)
LoadBalancer: 10.89.97.213
Uptime: Stable (production ready)

Performance ✅¶

Job pickup: <5 seconds
Transcription: ~4x real-time (5s video = 20s processing)
Translation: ~3.5 minutes for typical video (68 segments)
E2E workflow: ~4-5 minutes for dual-language subtitles

Phase 2 Completion Status¶

✅ COMPLETED & TESTED¶

Dual-Language Subtitles
Source + translation with perfect 1:1 alignment
Tested: English → Spanish ✅
Uses Qwen QwQ-32B open source model
Cost: ~$0.0002 per video (5x cheaper than Claude)
Stricter prompts prevent line skipping/merging
Individual fallback on batch mismatch ensures accuracy
ASS Format Support
Generates Advanced SubStation Alpha files
Dual-language ASS with separated Source/Translation tracks
Source: White, 44pt, MarginV=50 (higher positioning)
Translation: Cyan, 40pt, MarginV=10 (lower positioning)
Tested: Default style with dual-language ✅
Multi-Format Output
SRT + ASS generated for every job
Dual-language creates 2 files (_dual suffix)
Single-language creates 2 files (SRT + ASS)
UI displays all formats with badges
Translation Quality
Perfect alignment verified by user
No offset errors (fixed cascading alignment bug)
Handles short phrases correctly ("No", "Ah", "Yeah")
Graceful fallback when translation fails
UI Enhancements
Purple "Dual-Language" badge
Blue "Style Profile" badge (for ASS files)
Individual download buttons for each format
Shows file size and download counts

🔧 IMPLEMENTED BUT NOT TESTED¶

Single-Language ASS - Code exists, not user-tested
Style Profiles - All 4 implemented, only default tested:
default - Clean, readable (White, 48pt, bottom center)
learning - Larger text (52pt, thicker outline)
enhanced - Same as default, ready for emotion/entity colors
accessibility - High contrast yellow (56pt, top center)

Technical Implementation¶

Translation Model: Qwen QwQ-32B¶

Why Qwen? - Open source model via OpenRouter - Trained on 36 trillion tokens across 119 languages - Matches Claude/GPT quality in multilingual benchmarks - 5x cheaper than Claude 3.5 Haiku - No rate limits (paid tier)

Implementation Details: - Batch size: 10 segments per API call - Strict prompt: "You MUST return EXACTLY N translations" - Validation: Detects count mismatches - Fallback: Translates individually if batch fails - Cost tracking: ~$0.0002 per video (~$0.10/month for 500 videos)

Bug Fixes & Iterations¶

v1.1.0 - v1.1.4 (2025-11-24): - Fixed missing module exports - Fixed MIME type rejection (text/plain for ASS) - Fixed retry duplicate errors (upsert + cleanup) - Tried Gemini models (rate limits + 404 errors)

v1.2.0 - v1.2.3 (2025-11-25): - Switched to Qwen QwQ-32B - Fixed model ID (qwq-32b not qwq-32b-preview) - Critical: Fixed translation alignment offset bug - Problem: LLM returned 17/20 translations → 3-position offset - Solution: Stricter prompt + batch size 5 → individual fallback - Optimized: Batch size 5 → 10 (2x faster, still perfect alignment)

QA Test Results¶

Test Category	Status	Details
Authentication	✅ PASS	Routes protected, RLS policies working
Infrastructure	✅ PASS	All pods healthy
Database	✅ PASS	16 RLS policies, schema isolation
Upload	✅ PASS	Video storage pipeline verified
Job Creation	✅ PASS	Database records & foreign keys
Worker Processing	✅ PASS	Whisper transcription working
Dual-Language Translation	✅ USER TESTED	Perfect alignment (EN→ES)
Translation Fallback	✅ TESTED	Individual retry on mismatch
Multi-Format Download	✅ TESTED	SRT + ASS with badges
Job Retry	✅ TESTED	Upsert + cleanup working
Performance	✅ TESTED	~3.5min translation confirmed
Single-Language ASS	⚠️ NOT TESTED	Code complete, needs testing
Style Profiles	⚠️ PARTIAL	Only default tested
Error Handling	✅ PASS	Graceful degradation working

Overall: 11/13 PASS (85% coverage) - Core dual-language feature fully validated

Known Limitations¶

Style Profiles Not Fully Tested
All 4 profiles implemented but only default tested
Need to test: learning, enhanced, accessibility
Need to verify rendering in VLC/mpv
Single-Language Mode Not Tested
Code exists but not user-validated
Should work (same code path as dual-language)
Translation Languages
Only EN→ES tested
Other language pairs untested but should work
No Translation Preview
Cannot review translations before finalizing
No manual correction interface
No Cost Tracking
API costs not logged in database
No per-job cost visibility
Whisper Model Fixed
Using "base" model only
Cannot select small/medium for better quality

Deployment Commands¶

Check Status¶

kubectl get pods -n subtitleai
kubectl logs -n subtitleai -l app=subtitleai-worker -f

Update Worker¶

cd /root/projects/subtitleai
docker build -t 10.89.97.201:30500/subtitleai-worker:vX.X.X -f worker/Dockerfile worker/
docker push 10.89.97.201:30500/subtitleai-worker:vX.X.X
kubectl set image deployment/subtitleai-worker subtitleai-worker=10.89.97.201:30500/subtitleai-worker:vX.X.X -n subtitleai
kubectl set image deployment/subtitleai-poller subtitleai-poller=10.89.97.201:30500/subtitleai-worker:vX.X.X -n subtitleai
kubectl rollout status deployment/subtitleai-worker -n subtitleai

Update Web¶

cd /root/projects/subtitleai
docker build -t 10.89.97.201:30500/subtitleai-web:vX.X.X .
docker push 10.89.97.201:30500/subtitleai-web:vX.X.X
kubectl set image deployment/subtitleai-web subtitleai-web=10.89.97.201:30500/subtitleai-web:vX.X.X -n subtitleai
kubectl rollout status deployment/subtitleai-web -n subtitleai

Environment Configuration¶

Worker Environment Variables¶

SUPABASE_URL=http://10.89.97.214:8000
SUPABASE_SERVICE_KEY=<service_role_key>
OPENROUTER_API_KEY=<sk-or-v1-...>

K8s Secret¶

# OpenRouter API key stored in secret
kubectl get secret subtitleai-secrets -n subtitleai

Monitoring & Observability¶

Key Metrics to Watch¶

Translation Success Rate - Watch for "ERROR: Expected N translations, got M"
Translation Time - Should be ~3.5min for typical video
API Costs - Monitor OpenRouter usage
Worker Memory - Whisper model requires ~200Mi
Job Completion Rate - Track failed jobs

Logs¶

# Worker logs (transcription + translation)
kubectl logs -n subtitleai -l app=subtitleai-worker -f

# Poller logs (job pickup)
kubectl logs -n subtitleai -l app=subtitleai-poller -f

# Web logs (API requests)
kubectl logs -n subtitleai -l app=subtitleai-web -f

Future Enhancements¶

Immediate Testing Needed¶

Test single-language ASS mode
Test all 4 style profiles in video player
Test other language pairs (EN→FR, EN→DE, etc.)

Phase 3 Potential Features¶

VTT Format Support - WebVTT for web players
Whisper Model Selection - Let users choose base/small/medium
Translation Review Interface - Edit before finalizing
Cost Tracking - Log API costs per job in database
Batch Processing - Process multiple videos at once
Bazarr Integration - Auto-process media library
LLM Enhancement - Grammar cleanup, emotion detection
Custom Style Editor - Create/save custom ASS styles

Documentation¶

Deployment Guide: /root/projects/subtitleai/DEPLOYMENT_COMPLETE.md
Implementation Summary: /root/projects/subtitleai/PHASE2_IMPLEMENTATION_SUMMARY.md
Git Repository: https://github.com/jakecelentano/subtitlesv2

Production Status¶

Current State: ✅ PRODUCTION READY

Core dual-language feature fully tested and working
Translation alignment perfect (user confirmed)
Performance optimized (~3.5min)
UI enhanced with format badges
All critical bugs fixed
Code committed and deployed

Confidence Level: HIGH for dual-language mode, MEDIUM for untested features (single-language, style profiles)

Maintained By: Claude Code Last Updated: 2025-11-25