{"id":74,"date":"2019-12-09T16:57:00","date_gmt":"2019-12-09T15:57:00","guid":{"rendered":"https:\/\/project.inria.fr\/desed\/?page_id=74"},"modified":"2020-04-20T15:20:09","modified_gmt":"2020-04-20T13:20:09","slug":"synthetic-data","status":"publish","type":"page","link":"https:\/\/project.inria.fr\/desed\/download\/synthetic-data\/","title":{"rendered":"Synthetic soundscapes"},"content":{"rendered":"<h4>Downloading the data<\/h4>\n<p>This page explains how to download the audio files and the scripts used to generate synthetic soundscapes. Three different scenario can be considered.<\/p>\n<h5>User who just wants to download dcase2019 dataset<\/h5>\n<ul>\n<li>Download <code>DESED_synth_dcase2019.tar.gz<\/code> from <strong><a href=\"https:\/\/doi.org\/10.5281\/zenodo.3550598\" rel=\"nofollow\">DESED_synthetic<\/a><\/strong>.<\/li>\n<li><code>tar -xzvf DESED_synth_dcase2019.tar.gz<\/code> to extract it.<\/li>\n<\/ul>\n<h5>User who wants to reproduce dcase2019 dataset<\/h5>\n<ul>\n<li>Clone the repo github <a href=\"https:\/\/github.com\/turpaultn\/DESED\">(<\/a><a href=\"https:\/\/github.com\/turpaultn\/DESED\">https:\/\/github.com\/turpaultn\/DESED<\/a><a href=\"https:\/\/github.com\/turpaultn\/DESED\">)<\/a><\/li>\n<li>Follow the instructions in <code>synthetic\/create_dcase2019_dataset.sh<\/code><\/li>\n<li><em>Be careful, the distortions done on Matlab are up to you to create, it will be updated later to do it in python. For now, if you do not want to create them, uncomment corresponding lines in `create_dcase2019_dataset.sh` to download the eval set to get the distortions data. <\/em><\/li>\n<\/ul>\n<h5>User who wants to create new synthetic data<\/h5>\n<ul>\n<li>Download <code>DESED_synth_soundbank.tar.gz <\/code> from <strong><a href=\"https:\/\/doi.org\/10.5281\/zenodo.3550598\" rel=\"nofollow\">DESED_synthetic<\/a><\/strong>.<\/li>\n<li><code>tar -xzvf DESED_synth_soundbank.tar.gz <\/code> to extract it.<\/li>\n<li><code>cd synthetic\/src<\/code><\/li>\n<li><code>python get_background_training.py<\/code> to download SINS background files.<\/li>\n<li>See examples of code to create files in the repo github in <code>synthetic\/src<\/code>. Described in <a href=\"#gendata\">Generating new synthetic data<\/a>.<\/li>\n<\/ul>\n<h4 id=\"gendata\">Generating new synthetic data<\/h4>\n<p><a href=\"https:\/\/project.inria.fr\/desed\/files\/2020\/02\/soundbank_diagram.png\"><img loading=\"lazy\" decoding=\"async\" class=\"size-large wp-image-175 aligncenter\" src=\"https:\/\/project.inria.fr\/desed\/files\/2020\/02\/soundbank_diagram-1024x425.png\" alt=\"\" width=\"900\" height=\"374\" srcset=\"https:\/\/project.inria.fr\/desed\/files\/2020\/02\/soundbank_diagram-1024x425.png 1024w, https:\/\/project.inria.fr\/desed\/files\/2020\/02\/soundbank_diagram-300x125.png 300w, https:\/\/project.inria.fr\/desed\/files\/2020\/02\/soundbank_diagram-768x319.png 768w, https:\/\/project.inria.fr\/desed\/files\/2020\/02\/soundbank_diagram-150x62.png 150w\" sizes=\"auto, (max-width: 900px) 100vw, 900px\" \/><\/a>Data are generated using <a href=\"https:\/\/github.com\/justinsalamon\/scaper\">Scaper<\/a>. In the following you have examples of how to use it.<br \/>\nFor more information, do not hesitate to check their <a href=\"https:\/\/scaper.readthedocs.io\/en\/latest\/\">docs<\/a>.<\/p>\n<p>Examples of how to generate new sounds in the same way as the Desed_synthetic dataset:<\/p>\n<ul>\n<li><code>generate_training.py<\/code>, uses <code>event_occurences_train.json<\/code> for co-occurrence of events.<\/li>\n<li><code>generate_eval_FBSNR.py<\/code> generates similar subsets with different foreground-background sound to noise ratio (fbsnr): 30dB, 24dB, 15dB, 0dB. Uses <code>event_occurences_eval.json<\/code> for occurence and co-occurrence of events.<\/li>\n<li><code>generate_eval_var_onset.py<\/code> generates subsets with a single event per file, the difference between subsets is the onset position:\n<ol>\n<li>Onset between 0.25s and 0.75s.<\/li>\n<li>Onset between 5.25s and 5.75s.<\/li>\n<li>Onset between 9.25s and 9.75s.<\/li>\n<\/ol>\n<\/li>\n<li><code>generate_eval_long_short.py<\/code> generates subsets with a long event in the background and short events in the foreground, the difference beteen subsets is the FBSNR: 30dB, 15dB, 0dB.<\/li>\n<li><code>generate_eval_distortion.py<\/code> generates distortion subsets, not yet in python, see <code>generate_eval_distortion.m<\/code> for matlab code (will be updated later).<\/li>\n<\/ul>\n<p>When a script is generating multiple subfolder but only one csv file, it means it is the same csv for the different cases. Example: when modifying the FBSNR, we do not change the labels (onset, offsets).<\/p>\n<p><em>Note: The training soundbank can be divided in a training\/validation soundbank if you want to create validation data<\/em><\/p>\n<h4>Class-wise statistics for in terms of isolated events.<\/h4>\n<table>\n<tbody>\n<tr class=\"odd\">\n<td style=\"text-align: left;\"><\/td>\n<td style=\"text-align: center;\">Development set<\/td>\n<td style=\"text-align: center;\">Evaluation set<\/td>\n<\/tr>\n<tr class=\"odd\">\n<td style=\"text-align: left;\">Alarm\/bell\/ringing<\/td>\n<td style=\"text-align: center;\">190<\/td>\n<td style=\"text-align: center;\">63<\/td>\n<\/tr>\n<tr class=\"even\">\n<td style=\"text-align: left;\">Blender<\/td>\n<td style=\"text-align: center;\">98<\/td>\n<td style=\"text-align: center;\">27<\/td>\n<\/tr>\n<tr class=\"odd\">\n<td style=\"text-align: left;\">Cat<\/td>\n<td style=\"text-align: center;\">88<\/td>\n<td style=\"text-align: center;\">26<\/td>\n<\/tr>\n<tr class=\"even\">\n<td style=\"text-align: left;\">Dishes<\/td>\n<td style=\"text-align: center;\">109<\/td>\n<td style=\"text-align: center;\">34<\/td>\n<\/tr>\n<tr class=\"odd\">\n<td style=\"text-align: left;\">Dog<\/td>\n<td style=\"text-align: center;\">136<\/td>\n<td style=\"text-align: center;\">43<\/td>\n<\/tr>\n<tr class=\"even\">\n<td style=\"text-align: left;\">Electric shaver\/toothbrush<\/td>\n<td style=\"text-align: center;\">56<\/td>\n<td style=\"text-align: center;\">17<\/td>\n<\/tr>\n<tr class=\"odd\">\n<td style=\"text-align: left;\">Frying<\/td>\n<td style=\"text-align: center;\">64<\/td>\n<td style=\"text-align: center;\">17<\/td>\n<\/tr>\n<tr class=\"even\">\n<td style=\"text-align: left;\">Running water<\/td>\n<td style=\"text-align: center;\">68<\/td>\n<td style=\"text-align: center;\">20<\/td>\n<\/tr>\n<tr class=\"odd\">\n<td style=\"text-align: left;\">Speech<\/td>\n<td style=\"text-align: center;\">128<\/td>\n<td style=\"text-align: center;\">47<\/td>\n<\/tr>\n<tr class=\"even\">\n<td style=\"text-align: left;\">Vacuum cleaner<\/td>\n<td style=\"text-align: center;\">74<\/td>\n<td style=\"text-align: center;\">20<\/td>\n<\/tr>\n<tr class=\"odd\">\n<td style=\"text-align: left;\">Total<\/td>\n<td style=\"text-align: center;\">1011<\/td>\n<td style=\"text-align: center;\">314<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n","protected":false},"excerpt":{"rendered":"<p>Downloading the data This page explains how to download the audio files and the scripts used to generate synthetic soundscapes. Three different scenario can be considered. User who just wants to download dcase2019 dataset Download DESED_synth_dcase2019.tar.gz from DESED_synthetic. tar -xzvf DESED_synth_dcase2019.tar.gz to extract it. User who wants to reproduce dcase2019\u2026<\/p>\n<p> <a class=\"continue-reading-link\" href=\"https:\/\/project.inria.fr\/desed\/download\/synthetic-data\/\"><span>Continue reading<\/span><i class=\"crycon-right-dir\"><\/i><\/a> <\/p>\n","protected":false},"author":1380,"featured_media":0,"parent":72,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"footnotes":""},"class_list":["post-74","page","type-page","status-publish","hentry"],"_links":{"self":[{"href":"https:\/\/project.inria.fr\/desed\/wp-json\/wp\/v2\/pages\/74","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/project.inria.fr\/desed\/wp-json\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/project.inria.fr\/desed\/wp-json\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/project.inria.fr\/desed\/wp-json\/wp\/v2\/users\/1380"}],"replies":[{"embeddable":true,"href":"https:\/\/project.inria.fr\/desed\/wp-json\/wp\/v2\/comments?post=74"}],"version-history":[{"count":16,"href":"https:\/\/project.inria.fr\/desed\/wp-json\/wp\/v2\/pages\/74\/revisions"}],"predecessor-version":[{"id":217,"href":"https:\/\/project.inria.fr\/desed\/wp-json\/wp\/v2\/pages\/74\/revisions\/217"}],"up":[{"embeddable":true,"href":"https:\/\/project.inria.fr\/desed\/wp-json\/wp\/v2\/pages\/72"}],"wp:attachment":[{"href":"https:\/\/project.inria.fr\/desed\/wp-json\/wp\/v2\/media?parent=74"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}