Joel Mathew Thomas a8edb05d28 update pyproject.toml
2025-02-25 03:15:47 +05:30
2025-01-11 03:14:35 +05:30
2025-02-25 03:10:44 +05:30
2025-02-25 03:10:44 +05:30
2024-12-25 23:35:49 +05:30
2024-12-21 19:33:34 +05:30
2025-02-25 03:15:47 +05:30
2025-02-25 01:42:39 +05:30
2025-02-25 01:45:33 +05:30

freq-split-enhance

An evolving audio processing pipeline designed to separate and enhance audio components using open-source tools. The project aims to provide a modular framework for working with raw audio files, enabling separation, refinement, and post-processing.


🚀 Current Features

  • Audio Input Handling: Uses librosa for reading and handling audio files.
  • Preprocessing: Includes resampling, normalization, and trimming using librosa.
  • Audio Classification: Utilizes Google's YAMNet model to classify audio content.
  • Source Separation: Implements Demucs for music source separation.
  • Noise Reduction: Enhances audio by removing background noise using DeepFilterNet.
  • Post-Processing: Uses librosa to save processed audio files.
  • Modular Architecture: Designed for easy extension and customization.

📁 Project Structure

freq-split-enhance/
├── api/                # API implementation (future work)
├── client/             # Client-side interactions (future work)
├── src/                # Core processing modules
│   ├── input/          # Audio input handling
│   ├── preprocessing/  # Normalization, resampling, trimming
│   ├── separation/     # Source separation with Demucs
│   ├── postprocessing/ # Post-processing and saving results
│   ├── refinement/     # Noise reduction and enhancement
│   ├── spectogram/     # Spectrogram generation and analysis
├── tests/              # Unit tests
├── requirements.txt    # Dependencies
├── README.md           # Project documentation
├── LICENSE             # License information
└── pytest.ini          # Pytest configuration

📝 Wiki

For detailed instructions on installing dependencies, setting up Python environments, and configuring the project, visit the Wiki.

🛡️ License

This project is licensed under the Apache License 2.0.

S
Description
No description provided
Readme 70 MiB
Languages
TypeScript 50.3%
Python 45.5%
Dockerfile 2.3%
JavaScript 0.7%
CSS 0.6%
Other 0.6%