Commit Graph

12 Commits

Author SHA1 Message Date
Joel Mathew Thomas 489a37c007 combine separation tests
combined separation tests, because classify_audio would fail when other models ran on GPU
2025-01-29 00:28:54 +05:30
Joel Mathew Thomas 7f973db1d0 add test case to test mpariente/ConvTasNet_WHAMsepcleanwrapper 2025-01-28 23:51:21 +05:30
Joel Mathew Thomas fae60a8120 Fix: Corrected shape handling for separated sources, extracting each source as a separate 1D array 2025-01-28 23:34:08 +05:30
Joel Mathew Thomas 9e7c01c19e speech separation works, but ends up on either channels of same audio 2025-01-28 23:19:52 +05:30
Joel Mathew Thomas 5164e2cebd rename samples/ to test_audio, and move it to tests/ 2025-01-11 03:24:05 +05:30
Joel Mathew Thomas 8078d4ebd3 code cleanup
removed unused parameter from demucs_wrapper
2024-12-27 19:01:24 +00:00
Joel Mathew Thomas 7f4883514d change sampling rate for demucs to 44.1Khz 2024-12-26 22:17:40 +05:30
Joel Mathew Thomas a7176a4476 added test for resample module 2024-12-26 20:58:51 +05:30
Joel Mathew Thomas 826af263b3 added tests for demucs-wrapper.py, update deps 2024-12-26 20:46:37 +05:30
Joel Mathew Thomas dc81f36d74 Add test case for classify_audio function
- Implemented test case to verify audio classification with YAMNet
- Used 'cafe_crowd_talk.wav' as test file, expecting 'Speech' as the output class
2024-12-26 01:02:10 +05:30
Joel Mathew Thomas 0ff6a12829 Implement audio normalization and trimming functions
- Normalize audio to the range [-1, 1]
- Trim silence from audio
- Add test cases for both functions
2024-12-24 20:59:41 +05:30
Joel Mathew Thomas 919a798437 Initial setup: added src structure, input functionality, tests, and updated .gitignore 2024-12-24 17:52:27 +05:30