kokoro

Author	SHA1	Message	Date
fondoger	6d87f4ae7a	Enable MPS GPU Accerlation on MacOS (#164 ) * Enable MPS GPU Accerlation on MacOS * Fix	2025-04-10 11:40:25 -07:00
hexgrad	1c7bdd971d	Bump to 0.9.4 (#169 )	2025-04-05 15:00:22 -07:00
Omar Irfan Khan	4f5106e327	adding instructions for setting up espeak-ng on windows (#143 )	2025-04-01 10:54:49 -07:00
Ash	e44c9b4add	Update README.md (#154 ) Sorted lang_code tickers in alphabetical order above the input. Before they were oddly stretched out over different steps rather than being sorted and directly above the input.	2025-04-01 10:54:14 -07:00
Michael Currin	26039de2dc	docs: Add missing setup step in README.md (#145 ) * docs: Add missing setup step in README.md * docs: fix README.md	2025-03-25 11:00:52 -07:00
hexgrad	e43d62643e	Remove scipy (#139 ) * Remove scipy * No longer need to replace T * Update README.md * Remove numpy version lock * Update README.md * Update uv.lock	2025-03-18 11:16:34 -07:00
hexgrad	3f9dd88d6f	Bump to 0.8.4 (#120 ) * Bump to 0.8.4 * Update README.md	2025-02-28 18:49:08 -08:00
hexgrad	790ecc9c83	Bump to 0.8.3 (#119 )	2025-02-28 18:05:27 -08:00
szsteven008	c87df60d4c	add onnx export.py (#112 ) * Add files via upload onnx export * Add files via upload KModelForONNX * Add files via upload * Delete export.py * Add files via upload * Add files via upload 修正中文的错误 * Add files via upload 增加duration的输出	2025-02-28 11:01:34 -08:00
hexgrad	b15ef354b2	Bump to 0.8.2 (#117 )	2025-02-27 07:22:30 -08:00
hexgrad	ece280bdcd	Bump to 0.8.1	2025-02-26 17:57:41 -08:00
hexgrad	3a721cce9f	Critical fix to 0.8.0	2025-02-26 17:55:43 -08:00
hexgrad	efa91a8a3f	Match misaki==0.8.0 dev branch (#114 ) * Match misaki==0.8.0 dev branch * en_callable, speed callable	2025-02-26 17:30:50 -08:00
Kirill R.	52f7eb740b	Add Result.text_index to be able to map segments to paragraphs (#111 ) * Add Result.text_index to be able to map segments to paragraphs * Fix speed re: #105	2025-02-23 08:30:25 -08:00
Alessandro Saccoia	2dd9df6779	Fix: add text chunking for non-English language pipeline (#105 ) Co-authored-by: Your Name <your.email@example.com>	2025-02-19 18:15:21 -08:00
Adrian Lyjak	e648c0605a	Add additional onnx compat for https://github.com/pytorch/pytorch/issues/92977 (#104 )	2025-02-18 11:06:48 -08:00
etrotta	cd7afb5c12	Add a CLI interface (#102 ) * Add a CLI interface and update packaging configuration * Support multiple lines in stdin --------- Co-authored-by: Eric Trotta <eric.oliveira@magva.com.br>	2025-02-17 21:07:21 -08:00
Joshua Lochner	5229a254b7	Kokoro.js v1.2.0: Streaming support (#92 ) * Set up JS project * Finalise JS library * Update README * Fix package.json repository url * Rename package -> `kokoro-js` * Fix samples in README * Cleanup README * Bump `phonemizer` version * Create web demo * Run prettier * Link to model used in demo * Enable multithreading in HF space demo (~40% faster) * Add link to demo in README * Bump to v1.0.1 * Update voices * Update versions * Update phonemize JSDoc * Use updated voice pack * Update versions * Update demo (v1.0 & WebGPU support) * Update README * Enforce maximum number of tokens * Update README * [version] Update to 1.1.1 * Create simple sentence splitter * Update `npm run test` * Update API to use sync and async iterators * Add support for streamed generation in kokoro.js * Always split on newlines * Remove debug line * Improvements * Add more matching puntuation marks * Update comments * nits * Export TextSplitterStream too * Update splitter.js * Update README * [version] Update to 1.2.0	2025-02-15 11:06:33 -08:00
Adrian Lyjak	93abff8795	Modify model for ONNX compatibility (#87 )	2025-02-15 11:05:57 -08:00
hexgrad	ce71a10c57	Bump to 0.7.16 (#94 )	2025-02-14 22:49:19 -08:00
Thien Tran	84d64f02d3	replace `np.prod()` with `math.prod()` to make Kokoro `torch.compile`-able (#91 ) * replace np.prod() with math.prod() * another np.prod()	2025-02-14 22:48:51 -08:00
RobViren	330d110c05	Allow pipeline to take a voice style tensor directly. (#93 )	2025-02-14 22:48:08 -08:00
hexgrad	1145c0b7f6	Bump to 0.7.15 (#83 )	2025-02-11 22:54:42 -08:00
hexgrad	bd44d79895	Bump to 0.7.14 (#82 )	2025-02-11 14:08:57 -08:00
CarelessParsley	f77e52fb4c	Port HuggingFace Space to plain Gradio (#81 ) * Port HuggingFace Space to plain Gradio * Update app.py Remove BANNER_TEXT --------- Co-authored-by: Careless Parsley <carelessparsley@gmail.com> Co-authored-by: hexgrad <166769057+hexgrad@users.noreply.github.com>	2025-02-11 13:04:59 -08:00
hexgrad	83c8883d32	Delete .DS_Store	2025-02-11 00:24:22 -08:00
hexgrad	fd62d70ec7	Bump to 0.7.13 (#75 )	2025-02-10 08:54:11 -08:00
hexgrad	15108f11ba	Bump to 0.7.12 (#67 )	2025-02-07 23:09:34 -08:00
hexgrad	00f9cf977c	Bump to 0.7.11 (#66 )	2025-02-07 20:52:27 -08:00
Joshua Lochner	e0bf641def	Update Kokoro.js: WebGPU support, v1.0 integration (#60 ) * Set up JS project * Finalise JS library * Update README * Fix package.json repository url * Rename package -> `kokoro-js` * Fix samples in README * Cleanup README * Bump `phonemizer` version * Create web demo * Run prettier * Link to model used in demo * Enable multithreading in HF space demo (~40% faster) * Add link to demo in README * Bump to v1.0.1 * Update voices * Update versions * Update phonemize JSDoc * Use updated voice pack * Update versions * Update demo (v1.0 & WebGPU support) * Update README * Enforce maximum number of tokens * Update README * [version] Update to 1.1.1	2025-02-07 10:04:41 -08:00
hexgrad	31a2b6337b	Bump to 0.7.9 (#57 )	2025-02-05 16:05:31 -08:00
hexgrad	205ddd9377	Bump to 0.7.8 (#56 )	2025-02-05 11:44:58 -08:00
hexgrad	2e5c856491	Bump to 0.7.6 (#54 )	2025-02-04 23:20:38 -08:00
remsky	8cec8005b3	Add generate_from_tokens method, example (#53 )	2025-02-04 23:18:53 -08:00
hexgrad	b9dbd72b27	Bump to 0.7.4 (#52 )	2025-02-04 12:57:00 -08:00
hexgrad	ed4639ffdf	Bump to 0.7.3 (#49 ) * Bump to 0.7.3 * Update README.md	2025-02-03 22:38:30 -08:00
hexgrad	0922f3cbf9	Bump to 0.7.2 (#48 )	2025-02-03 20:58:12 -08:00
hexgrad	6e1473fa89	Bump to 0.7.1 (#47 )	2025-02-03 20:21:06 -08:00
hexgrad	43bc156514	Word level timestamps (#46 ) * Use MToken * ALIASES #43 * Typo: add missing comma * Change conditions to is not None * Finish WLTs	2025-02-03 18:27:12 -08:00
hexgrad	2f4d94bba2	Backward compatible KModel.Output and KPipeline.Result dataclasses (#40 ) * Backward compatible KModel.Output and KPipeline.Result dataclasses * Typo: Bool => bool * Allow voice=None for quiet pipelines * Specify class names * Fixed and tested * Update README.md	2025-02-02 11:05:25 -08:00
hexgrad	0abd867239	Bump versions for #35 (#37 )	2025-02-01 13:37:16 -08:00
Farseen	87766f8d58	Readme: Add conda env file. (#33 )	2025-02-01 09:30:20 -08:00
hexgrad	213f8a1b15	Fold requirements.txt into setup.py, bump to 0.3.3 (#31 )	2025-01-31 19:03:29 -08:00
remsky	b6cb300d50	feat: integrate loguru for centralized logging, debugs, etc (#30 ) * feat: integrate loguru for controllable logging, debug, etc * Update pipeline.py - Characters swapped out from under me by my ide	2025-01-31 18:59:08 -08:00
hexgrad	396766a5b0	Remove emojis #28 and bump to 0.3.2 (#29 )	2025-01-31 10:14:14 -08:00
remsky	e74290bf5a	feat: add device examples and pipeline updates (#27 )	2025-01-31 10:09:48 -08:00
Temirulan	39ef7993ba	add possibility to load multiple audios with averaging (#21 ) Co-authored-by: Temirulan Mussayev <temirulan@deepinfra.com>	2025-01-30 15:22:32 -08:00
hexgrad	a09db51873	Remove audio embed from README.md	2025-01-29 15:59:50 -08:00
hexgrad	d3f05c498d	Synchronize README.md	2025-01-29 15:58:54 -08:00
hexgrad	3ab76a9817	Japanese and Mandarin Chinese (#20 )	2025-01-29 11:18:39 -08:00

1 2

64 Commits