Joel Mathew Thomas
cbf2b022a5
Replace YAMNet model for panns-inference
...
YAMNet model was causing issues, as loading a pytorch framework model, when the tensorflow based YAMNet is loaded in the same environment already, caused segmentation fault
2025-02-26 17:36:27 +05:30
Joel Mathew Thomas
3074084ac1
refactor code, use package libraries for freqsplit/preprocessing/classify
...
- add additional params: sr=None and mono=False for freqsplit/input/file_reader/read_audio()
- remove loading audio using librosa in freqsplit/preprocessing/classify/classify_audio()
- add error handling, if the sr is not 16Khz
- raise RuntimeError, if YAMNet model fails
- update tests/test_preprocessing
2025-02-25 20:13:02 +05:30
Joel Mathew Thomas
0e0e765b6e
install freqsplit as a package
2025-02-25 03:10:44 +05:30
Joel Mathew Thomas
90d3076e65
create test for noisereduce
2025-02-24 20:17:00 +05:30
Joel Mathew Thomas
489a37c007
combine separation tests
...
combined separation tests, because classify_audio would fail when other models ran on GPU
2025-01-29 00:28:54 +05:30
Joel Mathew Thomas
7f973db1d0
add test case to test mpariente/ConvTasNet_WHAMsepcleanwrapper
2025-01-28 23:51:21 +05:30
Joel Mathew Thomas
fae60a8120
Fix: Corrected shape handling for separated sources, extracting each source as a separate 1D array
2025-01-28 23:34:08 +05:30
Joel Mathew Thomas
9e7c01c19e
speech separation works, but ends up on either channels of same audio
2025-01-28 23:19:52 +05:30
Joel Mathew Thomas
5164e2cebd
rename samples/ to test_audio, and move it to tests/
2025-01-11 03:24:05 +05:30
Joel Mathew Thomas
8078d4ebd3
code cleanup
...
removed unused parameter from demucs_wrapper
2024-12-27 19:01:24 +00:00
Joel Mathew Thomas
7f4883514d
change sampling rate for demucs to 44.1Khz
2024-12-26 22:17:40 +05:30
Joel Mathew Thomas
a7176a4476
added test for resample module
2024-12-26 20:58:51 +05:30
Joel Mathew Thomas
826af263b3
added tests for demucs-wrapper.py, update deps
2024-12-26 20:46:37 +05:30
Joel Mathew Thomas
dc81f36d74
Add test case for classify_audio function
...
- Implemented test case to verify audio classification with YAMNet
- Used 'cafe_crowd_talk.wav' as test file, expecting 'Speech' as the output class
2024-12-26 01:02:10 +05:30
Joel Mathew Thomas
0ff6a12829
Implement audio normalization and trimming functions
...
- Normalize audio to the range [-1, 1]
- Trim silence from audio
- Add test cases for both functions
2024-12-24 20:59:41 +05:30
Joel Mathew Thomas
919a798437
Initial setup: added src structure, input functionality, tests, and updated .gitignore
2024-12-24 17:52:27 +05:30