fondoger
6d87f4ae7a
Enable MPS GPU Accerlation on MacOS ( #164 )
...
* Enable MPS GPU Accerlation on MacOS
* Fix
2025-04-10 11:40:25 -07:00
hexgrad
1c7bdd971d
Bump to 0.9.4 ( #169 )
2025-04-05 15:00:22 -07:00
Omar Irfan Khan
4f5106e327
adding instructions for setting up espeak-ng on windows ( #143 )
2025-04-01 10:54:49 -07:00
Ash
e44c9b4add
Update README.md ( #154 )
...
Sorted lang_code tickers in alphabetical order above the input. Before they were oddly stretched out over different steps rather than being sorted and directly above the input.
2025-04-01 10:54:14 -07:00
Michael Currin
26039de2dc
docs: Add missing setup step in README.md ( #145 )
...
* docs: Add missing setup step in README.md
* docs: fix README.md
2025-03-25 11:00:52 -07:00
hexgrad
e43d62643e
Remove scipy ( #139 )
...
* Remove scipy
* No longer need to replace T
* Update README.md
* Remove numpy version lock
* Update README.md
* Update uv.lock
2025-03-18 11:16:34 -07:00
hexgrad
3f9dd88d6f
Bump to 0.8.4 ( #120 )
...
* Bump to 0.8.4
* Update README.md
2025-02-28 18:49:08 -08:00
hexgrad
790ecc9c83
Bump to 0.8.3 ( #119 )
2025-02-28 18:05:27 -08:00
szsteven008
c87df60d4c
add onnx export.py ( #112 )
...
* Add files via upload
onnx export
* Add files via upload
KModelForONNX
* Add files via upload
* Delete export.py
* Add files via upload
* Add files via upload
修正中文的错误
* Add files via upload
增加duration的输出
2025-02-28 11:01:34 -08:00
hexgrad
b15ef354b2
Bump to 0.8.2 ( #117 )
2025-02-27 07:22:30 -08:00
hexgrad
ece280bdcd
Bump to 0.8.1
2025-02-26 17:57:41 -08:00
hexgrad
3a721cce9f
Critical fix to 0.8.0
2025-02-26 17:55:43 -08:00
hexgrad
efa91a8a3f
Match misaki==0.8.0 dev branch ( #114 )
...
* Match misaki==0.8.0 dev branch
* en_callable, speed callable
2025-02-26 17:30:50 -08:00
Kirill R.
52f7eb740b
Add Result.text_index to be able to map segments to paragraphs ( #111 )
...
* Add Result.text_index to be able to map segments to paragraphs
* Fix speed re: #105
2025-02-23 08:30:25 -08:00
Alessandro Saccoia
2dd9df6779
Fix: add text chunking for non-English language pipeline ( #105 )
...
Co-authored-by: Your Name <your.email@example.com >
2025-02-19 18:15:21 -08:00
Adrian Lyjak
e648c0605a
Add additional onnx compat for https://github.com/pytorch/pytorch/issues/92977 ( #104 )
2025-02-18 11:06:48 -08:00
etrotta
cd7afb5c12
Add a CLI interface ( #102 )
...
* Add a CLI interface and update packaging configuration
* Support multiple lines in stdin
---------
Co-authored-by: Eric Trotta <eric.oliveira@magva.com.br >
2025-02-17 21:07:21 -08:00
Joshua Lochner
5229a254b7
Kokoro.js v1.2.0: Streaming support ( #92 )
...
* Set up JS project
* Finalise JS library
* Update README
* Fix package.json repository url
* Rename package -> `kokoro-js`
* Fix samples in README
* Cleanup README
* Bump `phonemizer` version
* Create web demo
* Run prettier
* Link to model used in demo
* Enable multithreading in HF space demo (~40% faster)
* Add link to demo in README
* Bump to v1.0.1
* Update voices
* Update versions
* Update phonemize JSDoc
* Use updated voice pack
* Update versions
* Update demo (v1.0 & WebGPU support)
* Update README
* Enforce maximum number of tokens
* Update README
* [version] Update to 1.1.1
* Create simple sentence splitter
* Update `npm run test`
* Update API to use sync and async iterators
* Add support for streamed generation in kokoro.js
* Always split on newlines
* Remove debug line
* Improvements
* Add more matching puntuation marks
* Update comments
* nits
* Export TextSplitterStream too
* Update splitter.js
* Update README
* [version] Update to 1.2.0
2025-02-15 11:06:33 -08:00
Adrian Lyjak
93abff8795
Modify model for ONNX compatibility ( #87 )
2025-02-15 11:05:57 -08:00
hexgrad
ce71a10c57
Bump to 0.7.16 ( #94 )
2025-02-14 22:49:19 -08:00
Thien Tran
84d64f02d3
replace np.prod() with math.prod() to make Kokoro torch.compile-able ( #91 )
...
* replace np.prod() with math.prod()
* another np.prod()
2025-02-14 22:48:51 -08:00
RobViren
330d110c05
Allow pipeline to take a voice style tensor directly. ( #93 )
2025-02-14 22:48:08 -08:00
hexgrad
1145c0b7f6
Bump to 0.7.15 ( #83 )
2025-02-11 22:54:42 -08:00
hexgrad
bd44d79895
Bump to 0.7.14 ( #82 )
2025-02-11 14:08:57 -08:00
CarelessParsley
f77e52fb4c
Port HuggingFace Space to plain Gradio ( #81 )
...
* Port HuggingFace Space to plain Gradio
* Update app.py
Remove BANNER_TEXT
---------
Co-authored-by: Careless Parsley <carelessparsley@gmail.com >
Co-authored-by: hexgrad <166769057+hexgrad@users.noreply.github.com >
2025-02-11 13:04:59 -08:00
hexgrad
83c8883d32
Delete .DS_Store
2025-02-11 00:24:22 -08:00
hexgrad
fd62d70ec7
Bump to 0.7.13 ( #75 )
2025-02-10 08:54:11 -08:00
hexgrad
15108f11ba
Bump to 0.7.12 ( #67 )
2025-02-07 23:09:34 -08:00
hexgrad
00f9cf977c
Bump to 0.7.11 ( #66 )
2025-02-07 20:52:27 -08:00
Joshua Lochner
e0bf641def
Update Kokoro.js: WebGPU support, v1.0 integration ( #60 )
...
* Set up JS project
* Finalise JS library
* Update README
* Fix package.json repository url
* Rename package -> `kokoro-js`
* Fix samples in README
* Cleanup README
* Bump `phonemizer` version
* Create web demo
* Run prettier
* Link to model used in demo
* Enable multithreading in HF space demo (~40% faster)
* Add link to demo in README
* Bump to v1.0.1
* Update voices
* Update versions
* Update phonemize JSDoc
* Use updated voice pack
* Update versions
* Update demo (v1.0 & WebGPU support)
* Update README
* Enforce maximum number of tokens
* Update README
* [version] Update to 1.1.1
2025-02-07 10:04:41 -08:00
hexgrad
31a2b6337b
Bump to 0.7.9 ( #57 )
2025-02-05 16:05:31 -08:00
hexgrad
205ddd9377
Bump to 0.7.8 ( #56 )
2025-02-05 11:44:58 -08:00
hexgrad
2e5c856491
Bump to 0.7.6 ( #54 )
2025-02-04 23:20:38 -08:00
remsky
8cec8005b3
Add generate_from_tokens method, example ( #53 )
2025-02-04 23:18:53 -08:00
hexgrad
b9dbd72b27
Bump to 0.7.4 ( #52 )
2025-02-04 12:57:00 -08:00
hexgrad
ed4639ffdf
Bump to 0.7.3 ( #49 )
...
* Bump to 0.7.3
* Update README.md
2025-02-03 22:38:30 -08:00
hexgrad
0922f3cbf9
Bump to 0.7.2 ( #48 )
2025-02-03 20:58:12 -08:00
hexgrad
6e1473fa89
Bump to 0.7.1 ( #47 )
2025-02-03 20:21:06 -08:00
hexgrad
43bc156514
Word level timestamps ( #46 )
...
* Use MToken
* ALIASES #43
* Typo: add missing comma
* Change conditions to is not None
* Finish WLTs
2025-02-03 18:27:12 -08:00
hexgrad
2f4d94bba2
Backward compatible KModel.Output and KPipeline.Result dataclasses ( #40 )
...
* Backward compatible KModel.Output and KPipeline.Result dataclasses
* Typo: Bool => bool
* Allow voice=None for quiet pipelines
* Specify class names
* Fixed and tested
* Update README.md
2025-02-02 11:05:25 -08:00
hexgrad
0abd867239
Bump versions for #35 ( #37 )
2025-02-01 13:37:16 -08:00
Farseen
87766f8d58
Readme: Add conda env file. ( #33 )
2025-02-01 09:30:20 -08:00
hexgrad
213f8a1b15
Fold requirements.txt into setup.py, bump to 0.3.3 ( #31 )
2025-01-31 19:03:29 -08:00
remsky
b6cb300d50
feat: integrate loguru for centralized logging, debugs, etc ( #30 )
...
* feat: integrate loguru for controllable logging, debug, etc
* Update pipeline.py
- Characters swapped out from under me by my ide
2025-01-31 18:59:08 -08:00
hexgrad
396766a5b0
Remove emojis #28 and bump to 0.3.2 ( #29 )
2025-01-31 10:14:14 -08:00
remsky
e74290bf5a
feat: add device examples and pipeline updates ( #27 )
2025-01-31 10:09:48 -08:00
Temirulan
39ef7993ba
add possibility to load multiple audios with averaging ( #21 )
...
Co-authored-by: Temirulan Mussayev <temirulan@deepinfra.com >
2025-01-30 15:22:32 -08:00
hexgrad
a09db51873
Remove audio embed from README.md
2025-01-29 15:59:50 -08:00
hexgrad
d3f05c498d
Synchronize README.md
2025-01-29 15:58:54 -08:00
hexgrad
3ab76a9817
Japanese and Mandarin Chinese ( #20 )
2025-01-29 11:18:39 -08:00