Commit Graph

9 Commits

Author SHA1 Message Date
Joel Mathew Thomas 10921cdad3 disabled cuda for yamnet model, and removed multi-env setup scripts, revert back to single env 2025-02-24 20:49:16 +05:30
Joel Mathew Thomas 7295d4ff3d enable cuda for classify_audio and demucs_wrapper 2025-01-25 17:38:27 +05:30
Joel Mathew Thomas cfd30185bd add resample module in preprocessing package, refactor demucs_wrapper.py 2024-12-26 19:47:24 +05:30
Joel Mathew Thomas d0f6c8c77a disable cuda for classify.py 2024-12-26 16:30:19 +05:30
Joel Mathew Thomas e4abb070db Implement audio classification function using YAMNet
- Create a function to load audio, resample, and classify using YAMNet
- Ensure compatibility with different audio formats and sample rates
- Normalize audio and classify it into one of 600 categories
2024-12-26 00:58:53 +05:30
Joel Mathew Thomas cbebf7bd93 Remove unnecessary libraries and replace with librosa
- Removed unused imports for  and
- Replaced  resampling with  for waveform handling
2024-12-26 00:49:10 +05:30
Joel Mathew Thomas 1ddbbadfc8 Integrate YAMNet for audio classification
- Added sample audio files: cafe_crowd_talk.wav, miaow_16k.wav, and speech_whistling2.wav
- Implemented YAMNet-based audio classification in classify.py
- Updated requirements.txt to include TensorFlow and dependencies for YAMNet
2024-12-26 00:41:23 +05:30
Joel Mathew Thomas 0ff6a12829 Implement audio normalization and trimming functions
- Normalize audio to the range [-1, 1]
- Trim silence from audio
- Add test cases for both functions
2024-12-24 20:59:41 +05:30
Joel Mathew Thomas 341d7fa11c add preprocessing dir, add logging for module imports 2024-12-24 20:41:38 +05:30