Librilight Preprocess Scripts Revision #125

HarryHe11 · 2024-01-29T07:55:31Z

✨ Description

This PR aims to enhance the compatibility and efficiency of the librilight preprocessing scripts with the new NS2 framework to be updated by @HeCheng0625. By updating the librilight preprocessing scripts, we ensure they align with the new NS2 dataloader's expectations, facilitating smoother NS2 training processes. Besides, this PR also improves the usability of the original librilight preprocess scripts.

🚧 Related Issues

#115

👨‍💻 Changes Proposed

Updated librilight preprocess scripts for better alignment with the NS2 dataloader, ensuring compatibility and efficiency.
Enhanced the original librilight preprocess scripts, improving usability.

🧑‍🤝‍🧑 Who Can Review?

@HeCheng0625 @lmxue

🛠 TODO

Conduct a comprehensive test of the entire NS2 training pipeline with the updated preprocessing scripts, as coordinated with @HeCheng0625, to validate the enhancements and ensure seamless integration and functionality within the new NS2 framework to be updated.

✅ Checklist

Code has been reviewed
Code complies with the project's code standards and best practices
Code has passed all tests
Code does not affect the normal use of existing features
Code has been commented properly
Documentation has been updated (if applicable)
Demo/checkpoint has been attached (if applicable)

into librilight-update

lmxue

Thanks for your efforts. Please check the comments.

lmxue · 2024-01-30T04:48:54Z

preprocessors/librilight.py

- or not os.path.exists(mfa_model_path)
- or not os.path.exists(mfa_config_path)
- ):
+ if not os.path.exists(mfa_dict_path) or not os.path.exists(mfa_model_path):


I recommend modifying this to separately check if different files exist. Otherwise, it's unclear which specific file is missing.

config/tts.json

lmxue · 2024-01-30T05:25:56Z

bins/tts/preprocess.py

+ if "librilight" in cfg.dataset:
+ return


These two lines of code imply that the LibriLight dataset does not undergo subsequent feature extraction; does it use online feature extraction? Can we add a condition here to determine whether the feature extraction method is online extraction or pre-extraction? This means our system will support two types of feature extraction methods.
P.S., we need to integrate the online feature extraction process later.

HarryHe11 and others added 9 commits January 12, 2024 13:52

Delete utils/whisper.py

33b3b9e

Merge branch 'open-mmlab:main' into main

01d74e1

Merge branch 'open-mmlab:main' into main

dc51643

update preprocessing

43ddfd9

Update librilight.py

11b9314

Update mel.py

00dce0e

use the latest black to re-format codes

53ff2fb

Merge branch 'librilight-update' of https://github.com/HarryHe11/Amphion

5f031cc

into librilight-update

Update librilight, skip silent phones

82f52fa

HarryHe11 requested review from lmxue and HeCheng0625 January 29, 2024 09:08

lmxue requested changes Jan 30, 2024

View reviewed changes

HarryHe11 added 2 commits January 30, 2024 13:58

Update README.md

b89db60

Merge branch 'open-mmlab:main' into librilight-update

3b5f8ae

HarryHe11 added the Status: in progress label Feb 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Librilight Preprocess Scripts Revision #125

Librilight Preprocess Scripts Revision #125

HarryHe11 commented Jan 29, 2024 •

edited

lmxue left a comment

lmxue Jan 30, 2024

lmxue Jan 30, 2024

Librilight Preprocess Scripts Revision #125

Are you sure you want to change the base?

Librilight Preprocess Scripts Revision #125

Conversation

HarryHe11 commented Jan 29, 2024 • edited

✨ Description

🚧 Related Issues

👨‍💻 Changes Proposed

🧑‍🤝‍🧑 Who Can Review?

🛠 TODO

✅ Checklist

lmxue left a comment

Choose a reason for hiding this comment

lmxue Jan 30, 2024

Choose a reason for hiding this comment

lmxue Jan 30, 2024

Choose a reason for hiding this comment

HarryHe11 commented Jan 29, 2024 •

edited