This program can do the labeling work for the Raw Speech Data in Korean, including split and align, cut and storing very well. You can just put the raw data that recorded by the actor(s) with the corresponding script and exacuate the project's core algorithm sequencially.
Stars
0
Forks
0
Watchers
0
Open Issues
0
Overall repository health assessment
No package.json found
This might not be a Node.js project
19
commits
Finalize Selective Composer outputs: s/char AST metric + composition artifacts
d5f2eb7View on GitHubImplement Selective Data Composer (Stage 4.5) — SAM-inspired quality gate
63ace80View on GitHubUpdate .gitignore: exclude docs/*.docx, docs/*.pdf from tracking
42e1a81View on GitHubPre-implementation backup: voice offset fix validated, selective composer planned
fdacad8View on GitHubFix voice offset truncation: sustained silence verification + architecture improvements
599ae06View on GitHubRemove logs/skipped_lines.log from tracking (contains script text)
ae092f8View on GitHubAdd LICENSE (All Rights Reserved) and remove sensitive files from tracking
0de51c5View on GitHubMerge branch 'main' of https://github.com/wkdrns202/TTSDataSetCleanser
7bae718View on GitHubUpdate README with full pipeline documentation based on progress report
29d9d1cView on GitHubUpdate pipeline logs and parameters from Script_5 re-processing (Iteration 7)
3dcfb0bView on GitHub