235 Commits

Author SHA1 Message Date
Joel Mathew Thomas 3b7bac94f7 create new package spectogram 2025-01-06 21:43:54 +05:30
Joel Mathew Thomas 6795878bab Merge pull request #7 from joelmathewthomas/refactor/code-cleanup
Code refactoring
2024-12-28 00:37:30 +05:30
Joel Mathew Thomas 8078d4ebd3 code cleanup
removed unused parameter from demucs_wrapper
2024-12-27 19:01:24 +00:00
SUFIYANJT fba8fb9b80 Merge pull request #6 from joelmathewthomas/bugfix/demucs
Bugfix/demucs
2024-12-26 22:20:39 +05:30
Joel Mathew Thomas 6529572f4c fix package initializations 2024-12-26 22:18:34 +05:30
Joel Mathew Thomas 7f4883514d change sampling rate for demucs to 44.1Khz 2024-12-26 22:17:40 +05:30
Joel Mathew Thomas 9d80012cc9 Merge pull request #5 from joelmathewthomas/feature/separation-demucs
Add Demucs Wrapper and Preprocessing Enhancements with Tests
2024-12-26 20:59:52 +05:30
Joel Mathew Thomas a7176a4476 added test for resample module 2024-12-26 20:58:51 +05:30
Joel Mathew Thomas 826af263b3 added tests for demucs-wrapper.py, update deps 2024-12-26 20:46:37 +05:30
Joel Mathew Thomas a4b0955074 update dependencies 2024-12-26 19:49:21 +05:30
Joel Mathew Thomas cfd30185bd add resample module in preprocessing package, refactor demucs_wrapper.py 2024-12-26 19:47:24 +05:30
Joel Mathew Thomas 96b4468138 using librosa to load the audio file 2024-12-26 16:50:29 +05:30
Joel Mathew Thomas d0f6c8c77a disable cuda for classify.py 2024-12-26 16:30:19 +05:30
Joel Mathew Thomas 1c71419537 use demucs with subprocess to separate music into bass, drums, vocals and other 2024-12-26 16:03:36 +05:30
Joel Mathew Thomas e7c750483e create separation package 2024-12-26 13:19:08 +05:30
Joel Mathew Thomas 53538230c4 Merge pull request #4 from joelmathewthomas/feature/preprocessing-classify
Implement audio classification using YAMNet in preprocessing pipeline
2024-12-26 01:11:11 +05:30
Joel Mathew Thomas 24ff8195fc Remove unnecessary file from samples/ 2024-12-26 01:08:52 +05:30
Joel Mathew Thomas dc81f36d74 Add test case for classify_audio function
- Implemented test case to verify audio classification with YAMNet
- Used 'cafe_crowd_talk.wav' as test file, expecting 'Speech' as the output class
2024-12-26 01:02:10 +05:30
Joel Mathew Thomas e4abb070db Implement audio classification function using YAMNet
- Create a function to load audio, resample, and classify using YAMNet
- Ensure compatibility with different audio formats and sample rates
- Normalize audio and classify it into one of 600 categories
2024-12-26 00:58:53 +05:30
Joel Mathew Thomas cbebf7bd93 Remove unnecessary libraries and replace with librosa
- Removed unused imports for  and
- Replaced  resampling with  for waveform handling
2024-12-26 00:49:10 +05:30
Joel Mathew Thomas 1ddbbadfc8 Integrate YAMNet for audio classification
- Added sample audio files: cafe_crowd_talk.wav, miaow_16k.wav, and speech_whistling2.wav
- Implemented YAMNet-based audio classification in classify.py
- Updated requirements.txt to include TensorFlow and dependencies for YAMNet
2024-12-26 00:41:23 +05:30
Joel Mathew Thomas 7fbc8ab651 update dependencies
add tensorflow and tensorflow-hub via pip
2024-12-25 23:43:54 +05:30
Joel Mathew Thomas 52e60e8011 set local python version 2024-12-25 23:35:49 +05:30
Joel Mathew Thomas 9de680e2b0 Merge pull request #3 from joelmathewthomas/update-readme
Update README.md
2024-12-25 23:29:12 +05:30
Joel Mathew Thomas a68362721b Update README.md 2024-12-25 23:28:25 +05:30
Joel Mathew Thomas 05a17bf64b Merge pull request #2 from joelmathewthomas/feature/input-and-preprocessing
Input & Preprocessing Stage: Implement Audio File Reading and Format Validation
2024-12-24 21:38:31 +05:30
Joel Mathew Thomas 0ff6a12829 Implement audio normalization and trimming functions
- Normalize audio to the range [-1, 1]
- Trim silence from audio
- Add test cases for both functions
2024-12-24 20:59:41 +05:30
Joel Mathew Thomas 341d7fa11c add preprocessing dir, add logging for module imports 2024-12-24 20:41:38 +05:30
Joel Mathew Thomas c1e724f6c5 ignore vscode config 2024-12-24 18:58:10 +05:30
Joel Mathew Thomas 919a798437 Initial setup: added src structure, input functionality, tests, and updated .gitignore 2024-12-24 17:52:27 +05:30
Joel Mathew Thomas 17f929e504 Setup input package with initial files 2024-12-24 17:07:43 +05:30
Joel Mathew Thomas 41a50ffcbe add requirements.txt 2024-12-24 16:40:11 +05:30
Joel Mathew Thomas f9356ebb4e add src dir 2024-12-24 16:33:27 +05:30
Joel Mathew Thomas fbffd706aa initial commit 2024-12-24 16:32:13 +05:30
Joel Mathew Thomas bec5d5d994 Initial commit 2024-12-21 19:33:34 +05:30