Skip to content

6KSFX Synth Dataset: Code for trimming, down-mixing, and labeling sound samples in the dataset, designed for streamlined audio processing and analysis

Notifications You must be signed in to change notification settings

nellyngz95/6KSFX

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

6KSFx- Procedural Audio Dataset Procedural audio—generating sound from scratch using computational processes—offers a powerful approach to creating sound effects, often referred to as digital Foley. However, research and optimization in this field have been hindered by the limited availability of publicly accessible samples and models.

This dataset addresses that gap by providing 6,000 synthetic audio samples to advance research in sound synthesis. With 30 categories containing 100 samples each, this 2.56 GB dataset supports the development of new synthesis techniques and evaluation methods. By making these samples publicly available, we aim to empower researchers, audio developers, and sound designers to optimize and expand procedural audio applications.

Additionally, 50 noise-augmented samples have been included for machine learning applications.

Each sample follows the naming format: [Sound Category] - [Sample Number] - [Label]

Category Label Category Label Category Label Category Label
Applause 1 Gunshots 19 Whoosh 38 Twang 56
Church Bells 3 Helicopter 21 Fireworks 40 Bounce Rubber 58
Bubbles 5 Pouring Hot Water 23 Concrete Footsteps 42 Electric Buzz 60
Droplets 7 Rain 25 Wood Footsteps 44
Crackling 9 Thunder 27 Space Cannon 46
Debris Glass 11 Waves 29 Spray 48
Engine 13 Wind 31 Metal Debris 50
Explosions 15 Boat 33 Rocket 52
Fire 17 Jet 35 Concrete Debris 54

The dataset is available for download in Zenodo: https://zenodo.org/records/14517916 Paper of the Dateset: https://arxiv.org/abs/2501.17198

In Github you will find the scripts for trimming, down-mixing, and labeling sound samples in the dataset, designed for streamlined audio processing and analysis.

About

6KSFX Synth Dataset: Code for trimming, down-mixing, and labeling sound samples in the dataset, designed for streamlined audio processing and analysis

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages