Commit Graph

69 Commits

Author SHA1 Message Date
styagi130
f1d129d835 Feat: batch support for onnx and triton compatibility (#239)
* Added batch support

Signed-off-by: Siddharth Tyagi <siddhartht@nvidia.com>

* file rename

Signed-off-by: Siddharth Tyagi <siddhartht@nvidia.com>

---------

Signed-off-by: Siddharth Tyagi <siddhartht@nvidia.com>
Co-authored-by: Siddharth Tyagi <siddhartht@nvidia.com>
2025-07-25 17:49:52 -07:00
ArEnSc
d7654ba4e8 Feature/get set voice url (#236)
* add voice data url getter and setter

* Ok Voices.js removing types

* Match the function style.
2025-07-09 14:56:34 -07:00
Kartikeya Mishra
2760831139 feat: set custom cache directory (#227) 2025-06-30 13:52:07 -07:00
Eric Izoita
4b647d371e Fix a couple minor warnings (#210)
* use non-deprecated import path of weight_norm

* don't pass dropout to single layer LSTM

per the docs, dropout is not applied to the last LSTM layer, so there is
no effect to passing it when `num_layers=1`.
2025-06-30 13:50:50 -07:00
Joshua Lochner
2668b2e279 [kokoro-js] Expose wasmPaths environment variables to users (#190)
* Bump dependency versions

* Expose wasmPaths from transformers.js

* Update syntax to be bundler friendly

* Bump transformers.js version

* Bump demo versions

* Update vite config

* [version] Update to 1.2.1
2025-05-03 09:31:18 -07:00
fondoger
6d87f4ae7a Enable MPS GPU Accerlation on MacOS (#164)
* Enable MPS GPU Accerlation on MacOS

* Fix
2025-04-10 11:40:25 -07:00
hexgrad
1c7bdd971d Bump to 0.9.4 (#169) 2025-04-05 15:00:22 -07:00
Omar Irfan Khan
4f5106e327 adding instructions for setting up espeak-ng on windows (#143) 2025-04-01 10:54:49 -07:00
Ash
e44c9b4add Update README.md (#154)
Sorted lang_code tickers in alphabetical order above the input. Before they were oddly stretched out over different steps rather than being sorted and directly above the input.
2025-04-01 10:54:14 -07:00
Michael Currin
26039de2dc docs: Add missing setup step in README.md (#145)
* docs: Add missing setup step in README.md

* docs: fix README.md
2025-03-25 11:00:52 -07:00
hexgrad
e43d62643e Remove scipy (#139)
* Remove scipy

* No longer need to replace T

* Update README.md

* Remove numpy version lock

* Update README.md

* Update uv.lock
2025-03-18 11:16:34 -07:00
hexgrad
3f9dd88d6f Bump to 0.8.4 (#120)
* Bump to 0.8.4

* Update README.md
2025-02-28 18:49:08 -08:00
hexgrad
790ecc9c83 Bump to 0.8.3 (#119) 2025-02-28 18:05:27 -08:00
szsteven008
c87df60d4c add onnx export.py (#112)
* Add files via upload

onnx export

* Add files via upload

KModelForONNX

* Add files via upload

* Delete export.py

* Add files via upload

* Add files via upload

修正中文的错误

* Add files via upload

增加duration的输出
2025-02-28 11:01:34 -08:00
hexgrad
b15ef354b2 Bump to 0.8.2 (#117) 2025-02-27 07:22:30 -08:00
hexgrad
ece280bdcd Bump to 0.8.1 2025-02-26 17:57:41 -08:00
hexgrad
3a721cce9f Critical fix to 0.8.0 2025-02-26 17:55:43 -08:00
hexgrad
efa91a8a3f Match misaki==0.8.0 dev branch (#114)
* Match misaki==0.8.0 dev branch

* en_callable, speed callable
2025-02-26 17:30:50 -08:00
Kirill R.
52f7eb740b Add Result.text_index to be able to map segments to paragraphs (#111)
* Add Result.text_index to be able to map segments to paragraphs

* Fix speed re: #105
2025-02-23 08:30:25 -08:00
Alessandro Saccoia
2dd9df6779 Fix: add text chunking for non-English language pipeline (#105)
Co-authored-by: Your Name <your.email@example.com>
2025-02-19 18:15:21 -08:00
Adrian Lyjak
e648c0605a Add additional onnx compat for https://github.com/pytorch/pytorch/issues/92977 (#104) 2025-02-18 11:06:48 -08:00
etrotta
cd7afb5c12 Add a CLI interface (#102)
* Add a CLI interface and update packaging configuration

* Support multiple lines in stdin

---------

Co-authored-by: Eric Trotta <eric.oliveira@magva.com.br>
2025-02-17 21:07:21 -08:00
Joshua Lochner
5229a254b7 Kokoro.js v1.2.0: Streaming support (#92)
* Set up JS project

* Finalise JS library

* Update README

* Fix package.json repository url

* Rename package -> `kokoro-js`

* Fix samples in README

* Cleanup README

* Bump `phonemizer` version

* Create web demo

* Run prettier

* Link to model used in demo

* Enable multithreading in HF space demo (~40% faster)

* Add link to demo in README

* Bump to v1.0.1

* Update voices

* Update versions

* Update phonemize JSDoc

* Use updated voice pack

* Update versions

* Update demo (v1.0 & WebGPU support)

* Update README

* Enforce maximum number of tokens

* Update README

* [version] Update to 1.1.1

* Create simple sentence splitter

* Update `npm run test`

* Update API to use sync and async iterators

* Add support for streamed generation in kokoro.js

* Always split on newlines

* Remove debug line

* Improvements

* Add more matching puntuation marks

* Update comments

* nits

* Export TextSplitterStream too

* Update splitter.js

* Update README

* [version] Update to 1.2.0
2025-02-15 11:06:33 -08:00
Adrian Lyjak
93abff8795 Modify model for ONNX compatibility (#87) 2025-02-15 11:05:57 -08:00
hexgrad
ce71a10c57 Bump to 0.7.16 (#94) 2025-02-14 22:49:19 -08:00
Thien Tran
84d64f02d3 replace np.prod() with math.prod() to make Kokoro torch.compile-able (#91)
* replace np.prod() with math.prod()

* another np.prod()
2025-02-14 22:48:51 -08:00
RobViren
330d110c05 Allow pipeline to take a voice style tensor directly. (#93) 2025-02-14 22:48:08 -08:00
hexgrad
1145c0b7f6 Bump to 0.7.15 (#83) 2025-02-11 22:54:42 -08:00
hexgrad
bd44d79895 Bump to 0.7.14 (#82) 2025-02-11 14:08:57 -08:00
CarelessParsley
f77e52fb4c Port HuggingFace Space to plain Gradio (#81)
* Port HuggingFace Space to plain Gradio

* Update app.py

Remove BANNER_TEXT

---------

Co-authored-by: Careless Parsley <carelessparsley@gmail.com>
Co-authored-by: hexgrad <166769057+hexgrad@users.noreply.github.com>
2025-02-11 13:04:59 -08:00
hexgrad
83c8883d32 Delete .DS_Store 2025-02-11 00:24:22 -08:00
hexgrad
fd62d70ec7 Bump to 0.7.13 (#75) 2025-02-10 08:54:11 -08:00
hexgrad
15108f11ba Bump to 0.7.12 (#67) 2025-02-07 23:09:34 -08:00
hexgrad
00f9cf977c Bump to 0.7.11 (#66) 2025-02-07 20:52:27 -08:00
Joshua Lochner
e0bf641def Update Kokoro.js: WebGPU support, v1.0 integration (#60)
* Set up JS project

* Finalise JS library

* Update README

* Fix package.json repository url

* Rename package -> `kokoro-js`

* Fix samples in README

* Cleanup README

* Bump `phonemizer` version

* Create web demo

* Run prettier

* Link to model used in demo

* Enable multithreading in HF space demo (~40% faster)

* Add link to demo in README

* Bump to v1.0.1

* Update voices

* Update versions

* Update phonemize JSDoc

* Use updated voice pack

* Update versions

* Update demo (v1.0 & WebGPU support)

* Update README

* Enforce maximum number of tokens

* Update README

* [version] Update to 1.1.1
2025-02-07 10:04:41 -08:00
hexgrad
31a2b6337b Bump to 0.7.9 (#57) 2025-02-05 16:05:31 -08:00
hexgrad
205ddd9377 Bump to 0.7.8 (#56) 2025-02-05 11:44:58 -08:00
hexgrad
2e5c856491 Bump to 0.7.6 (#54) 2025-02-04 23:20:38 -08:00
remsky
8cec8005b3 Add generate_from_tokens method, example (#53) 2025-02-04 23:18:53 -08:00
hexgrad
b9dbd72b27 Bump to 0.7.4 (#52) 2025-02-04 12:57:00 -08:00
hexgrad
ed4639ffdf Bump to 0.7.3 (#49)
* Bump to 0.7.3

* Update README.md
2025-02-03 22:38:30 -08:00
hexgrad
0922f3cbf9 Bump to 0.7.2 (#48) 2025-02-03 20:58:12 -08:00
hexgrad
6e1473fa89 Bump to 0.7.1 (#47) 2025-02-03 20:21:06 -08:00
hexgrad
43bc156514 Word level timestamps (#46)
* Use MToken

* ALIASES #43

* Typo: add missing comma

* Change conditions to is not None

* Finish WLTs
2025-02-03 18:27:12 -08:00
hexgrad
2f4d94bba2 Backward compatible KModel.Output and KPipeline.Result dataclasses (#40)
* Backward compatible KModel.Output and KPipeline.Result dataclasses

* Typo: Bool => bool

* Allow voice=None for quiet pipelines

* Specify class names

* Fixed and tested

* Update README.md
2025-02-02 11:05:25 -08:00
hexgrad
0abd867239 Bump versions for #35 (#37) 2025-02-01 13:37:16 -08:00
Farseen
87766f8d58 Readme: Add conda env file. (#33) 2025-02-01 09:30:20 -08:00
hexgrad
213f8a1b15 Fold requirements.txt into setup.py, bump to 0.3.3 (#31) 2025-01-31 19:03:29 -08:00
remsky
b6cb300d50 feat: integrate loguru for centralized logging, debugs, etc (#30)
* feat: integrate loguru for controllable logging, debug, etc

* Update pipeline.py

- Characters swapped out from under me by my ide
2025-01-31 18:59:08 -08:00
hexgrad
396766a5b0 Remove emojis #28 and bump to 0.3.2 (#29) 2025-01-31 10:14:14 -08:00