An evolving audio processing pipeline designed to separate and enhance audio components using open-source tools. The project aims to provide a modular framework for working with raw audio files, enabling separation, refinement, and post-processing.

🚀 Current Features

Audio Input Handling: Uses librosa for reading and handling audio files.
Preprocessing: Includes resampling, normalization, and trimming using librosa.
Audio Classification: Utilizes Google's YAMNet model to classify audio content.
Source Separation: Implements Demucs for music source separation.
Noise Reduction: Enhances audio by removing background noise using DeepFilterNet.
Post-Processing: Uses librosa to save processed audio files.
Modular Architecture: Designed for easy extension and customization.

📁 Project Structure

freq-split-enhance/
├── api/                # API implementation (future work)
├── client/             # Client-side interactions (future work)
├── src/                # Core processing modules
│   ├── input/          # Audio input handling
│   ├── preprocessing/  # Normalization, resampling, trimming
│   ├── separation/     # Source separation with Demucs
│   ├── postprocessing/ # Post-processing and saving results
│   ├── refinement/     # Noise reduction and enhancement
│   ├── spectogram/     # Spectrogram generation and analysis
├── tests/              # Unit tests
├── requirements.txt    # Dependencies
├── README.md           # Project documentation
├── LICENSE             # License information
└── pytest.ini          # Pytest configuration

📝 Wiki

For detailed instructions on installing dependencies, setting up Python environments, and configuring the project, visit the Wiki.

🛡️ License

This project is licensed under the Apache License 2.0.

Languages

TypeScript 50.3%

Python 45.5%

Dockerfile 2.3%

JavaScript 0.7%

CSS 0.6%

Other 0.6%