rocm-chatterbox-whisper

Go to file

scott 9b62fce5c5 [dev-fp16] Convert model weights to fp16 at load time

Converting t3/s3gen/ve to fp16 once at load time means:
- Warmup runs in fp16, covering the right dtypes for all real requests
- No per-call autocast casting overhead
- ~2x faster matrix ops and convolutions on RDNA 2 hardware

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

2026-04-05 20:34:33 -04:00

.gitea/workflows

Switch to inline cache to avoid registry blob size limits

2026-04-05 12:14:35 -04:00

.gitignore

Initial implementation: Chatterbox TTS with ROCm and Wyoming

2026-04-05 09:51:09 -04:00

config.py

Multi-pass warmup and smaller chunk_size to fix HA timeout

2026-04-05 15:04:46 -04:00

config.yaml

Multi-pass warmup and smaller chunk_size to fix HA timeout