HiFiPLN v1

Warning

This is an older version of the HiFiPLN vocoder. It has several issues including not working with GPU acceleration enabled in OpenUtau.
Please use HiFiPLN v2 instead.

Info

Multispeaker Community Vocoder model for DiffSinger

Trained with ~95 hours of varied singing data.

The goal of our vocoder model is to provide more quality possibilities that can be brought out of DiffSinger acoustics. The vocoder can be used with any voice.
We would like to send huge thanks to the Western Diffsinger community for providing datasets for training! Without them, this model wouldn’t be possible.

Code used to train the vocoder is avaible on HiFiPLN Github.
Pretrained checkpoint for finetuning is available at the bottom of this page.

  • Model Developed & Trained by Scarfmonster
  • Data coordination by PixPrucer

License

Copyright © 2024, Ryszard Goń, Oliwier Ziembla.
https://utau.pl/vocoders/hifipln-v1/

All model weights in the HiFiPLN Community Vocoder, including model weights available to download on this page, are provided under the Attribution-NonCommercial-ShareAlike 4.0 International license.

TERMS OF REDISTRIBUTION:

  1. Do not sell this vocoder, or charge any fees from redistributing it, as prohibited by the license.
  2. Include a copy of the CC BY-NC-SA 4.0 license, or a link referring to it.
  3. Include a copy of this notice, or any other notices informing that this vocoder is provided by Ryszard Goń and Oliwier Ziembla, that this vocoder is licensed under CC BY-NC-SA 4.0, and with a complete acknowledgement list as shown above.
  4. If you fine-tuned or modified the weights, leave a notice about what has been changed.
  5. Leave a link to the official release page of the vocoder, and tell users that other versions and future updates of this vocoder can be obtained from the website.

How To Use

  1. Download your vocoder of choice
  2. Drag and drop the downloaded .oudep file onto the OpenUtau window.
  3. Change the dsconfig.yaml configuration file of your chosen voice and set vocoder: to a proper value. For your convenienve the setting is listed in the table. Save the file, restart OpenUtau if you had it opened.

Download

By downloading the Vocoders you agree to the Terms of Service.

NameDownloadVersiondsconfig.yamlNotes
hifipln
89.2 MB
1.1vocoder: hifipln_1.1sample_rate: 44100,
n_mels: 128,
hop_length: 512
hifipln
89.2 MB
1.0vocoder: hifipln_1.0sample_rate: 44100,
n_mels: 128,
hop_length: 512

Singing databases used for training the model

NameLengthLanguagesContributor
AdoVoc Pro00:05:28Caló, SpanishAdoVoc Pro
A.I.chi01:47:52English, JapanesePeeslubn
Aida04:00:48EN, JA, DE, FRViolin
Albert01:17:14PolishSzTJ
Aleks00:06:10PolishSzTJ
Ameko Kero02:54:31English, JapaneseHoodyPisDed
Ariel01:09:01Japaneseariika
Brent00:05:15SpanishBeatrix
Cantoria Dataset02:25:08SpanishCantoria Dataset
Codie01:00:37Japanesecode41den
Deshi01:47:45Japanese, TagalogUtaUtaUtau
Esmuc Choir Dataset00:21:31GermanEsmuc Choir Dataset
Evelyn01:05:32EnglishViolin
Filip01:17:49PolishRainygardens
Geppei00:30:15Japanese, Polish, Ukrainianvahntanabe
Hania00:05:32PolishSzTJ
Hisaki02:42:57Japaneseryutsu
Inka00:39:18English, JapanesepostTEENIDOL
Jalo00:54:53PolishSzTJ
Karasu00:49:55Japaneserev
Kazuo00:33:40JapaneseFelipe Souza
Kiiro01:44:54English, JapaneseRyouichi
Konryuu01:10:55JapanesePixPrucer
Kurenai00:55:26Japaneseliure
Leif01:28:40English, JapaneseTigermeat
Lem00:14:22PolishWik
Liee00:25:01JA, EN, Latinjulieraptor
Makam Acapella00:38:53TurkishMakam Acapella
Makku02:06:58JA, EN, ES, ITGianloop
Mat00:35:44Polishhq_png
Matsuki Max01:25:32JapaneseHaraoo
Mava01:46:33English, JapaneseEnzo
Mora01:49:03English, Japanesefunhouse
Namine Criss00:31:02Spanish, JapaneseCrissZ3R0VZ
Nanabot00:29:23EnglishpostTEENIDOL
Naoky03:31:55EN, JA, KO, ZHxuu
——03:55:02——Anonymous Contributor
Paulina00:29:34PolishSzTJ
Peiton02:31:09EnglishNebulaMeadow
PIX04:10:54Polish, JapanesePixPrucer
Otozora Rinly02:49:43JapaneseUniverStars
Ron02:28:10EN, JA, PL, KO, ZHGalanist
Rose00:42:39Polish, JapaneseKisa
Ryszard02:24:16Polish, JapaneseScarfmonster
——01:50:00——Anonymous Contributor
Singing Database02:46:46Chinese, ItalianSinging Database
Ace02:50:26English, JapaneseSpoopyAce
Stefan02:49:07Polish, LatinSzTJ
Suzu01:42:03Japaneseariika
Taylor01:09:24EnglishpostTEENIDOL
Teo Vampa01:56:33JapaneseDelphic
Tetsu01:13:46Japaneseariika
Tiger03:31:27EN, ES, JP, KO, ZH, PT, FRTigermeat
Tomo00:57:00Spanish, JapaneseTomo
Vocadito00:13:37EN, FR, HAW, ES, TL, ValencianVocadito
VocalSet08:46:18VocaliseVocalSet
Wanda01:11:03PolishVieri
Wioletta00:32:56PolishSzTJ
Zethiel Yu02:19:19Englishxiel exalt
Zethiel Zero00:32:07English, Japanesexiel exalt
Total length:98:28:51
Used length:82:06:15

Pitch distribution

Dataset

Dataset pitch Distrbution

After augmentation

Augmented Dataset pitch Distrbution