e61ae413af061aba9d6237530a29c26cd7385849
freq-split-enhance
An evolving audio processing pipeline designed to separate and enhance audio components using open-source tools. The project aims to provide a modular framework for working with raw audio files, enabling separation, refinement, and post-processing.
🚀 Current Features
- Audio Input Handling: Uses librosa for reading and handling audio files.
- Preprocessing: Includes resampling, normalization, and trimming using librosa.
- Audio Classification: Utilizes Google's YAMNet model to classify audio content.
- Source Separation: Implements Demucs for music source separation.
- Noise Reduction: Enhances audio by removing background noise using DeepFilterNet.
- Post-Processing: Uses librosa to save processed audio files.
- Modular Architecture: Designed for easy extension and customization.
📁 Project Structure
freq-split-enhance/
├── api/ # API implementation (future work)
├── client/ # Client-side interactions (future work)
├── src/ # Core processing modules
│ ├── input/ # Audio input handling
│ ├── preprocessing/ # Normalization, resampling, trimming
│ ├── separation/ # Source separation with Demucs
│ ├── postprocessing/ # Post-processing and saving results
│ ├── refinement/ # Noise reduction and enhancement
│ ├── spectogram/ # Spectrogram generation and analysis
├── tests/ # Unit tests
├── requirements.txt # Dependencies
├── README.md # Project documentation
├── LICENSE # License information
└── pytest.ini # Pytest configuration
📝 Wiki
For detailed instructions on installing dependencies, setting up Python environments, and configuring the project, visit the Wiki.
🛡️ License
This project is licensed under the Apache License 2.0.
Description
Languages
TypeScript
50.3%
Python
45.5%
Dockerfile
2.3%
JavaScript
0.7%
CSS
0.6%
Other
0.6%