README Title: Nature 4.0: A networked sensor system for integrated biodiversity monitoring Authors: Dirk Zeuss [1], Lisa Bald [1], Nicolas Friess [1], Stephan Wöllauer [1], Kim Lindner [2], Viviane Kohlbrecher [2] and Nina Farwig [2] Affiliations: [1] Department of Geography, Environmental Informatics, Philipps-University Marburg, Deutschhausstrasse 12, 35032 Marburg, Germany [2] Department of Biology, Conservation Ecology, Philipps-University Marburg, Karl-von-Frisch-Strasse 8, 35032 Marburg, Germany Date: 08.05.2023 Language: English Document type: TAR files (.tar), MP3 files (.mp3), CSV file (.csv), BIT file (.bit) and text file (.txt) Keywords: Marburg Open Forest, Temperate european forest, Birds, Audio Funding: LOEWE – Landes-Offensive zur Entwicklung Wissenschaftlich-ökonomischer Exzellenz Description: The dataset includes audio recordings from the University forest of the Philipps University of Marburg (Marburg Open Forest). The Marburg Open Forest is a temperate european forest located about 10km from the city center of Marburg (Lahn), Germany. 48 AudioMoths were distributed throughout the forest for the recordings. Recordings were 48 kHz 16 bit mono. A recording is always one minute long. The recording rhythm was 1 minute recording then one minute pause in which no recording took place. Recordings were made 24 hours a day for around two weeks every month. The files are named according to the recording date using one of the following two naming conventions: Either Year_Month_Day__Hour_Minute (e.g. 2021_10_01__00_00) or Year_Month_Day__Hour_Minute_Second (e.g. 2021_10_31__23_58_03). The audiodata was recorded from April 2020 to October 2022. To comply with privacy regulations, human vocalisations have been detected and removed from the dataset using voice activity detection (VAD). The highly sensitive VAD detector Silero VAD v4.0 (release Oct 28, 2022) has been applied (https://github.com/snakers4/silero-vad). The entire dataset consists of 14093659 files in WAV format and is around 77 TB large. Due to the size of the complete data set, provision via download is not practical. The complete dataset can be requested by clicking on the file complete_dataset.bit. The full dataset is available upon request from 01. January 2024 onwards. An example dataset is available here, that includes recordings of one AudioMoth (ID: 248D9B025FDF0BD1) from mid-March (2021-03-16) to December 2021 (2021-12-31) in MP3 format. In total, this dataset consists of 130873 MP3 files, therefore the dataset is about 30 GB large. The individual MP3 files are grouped by month of recording in TAR archives. The dataset that is available here contains the following files: [1] README file: containing all metadata information [2] 2021_03.tar file: 8163 MP3 files recorded on the following days: 2021-03-16 to 2021-03-27 and 2021-03-31 [3] 2021_04.tar file: 12060 MP3 files recorded on the following days: 2021-04-01 to 2021-04-10 and 2021-04-23 to 2021-04-30 [4] 2021_05.tar file: 9833 MP3 files recorded on the following days: 2021-05-01 to 2021-05-14 [5] 2021_06.tar file: 19338 MP3 files recorded on the following days: 2021-06-02 to 2021-06-30 [6] 2021_07.tar file: 9818 MP3 files recorded on the following days: 2021-07-03 to 2021-07-14 and 2021-07-29 to 2021-07-31 [7] 2021_08.tar file: 18788 MP3 files recorded on the following days: 2021-08-01 to 2021-08-22 and 2021-08-27 to 2021-08-31 [8] 2021_09.tar file: 13915 MP3 files recorded on the following days: 2021-09-01 to 2021-09-04, 2021-09-07 to 2021-09-17 and 2021-09-25 to 2021-09-30 [9] 2021_10.tar file: 16345 MP3 files recorded on the following days: 2021-10-01 to 2021-10-05, 2021-10-10 to 2021_10_20 and 2021-10-23 to 2021-10-31 [10] 2021_11.tar file: 16817 MP3 files recorded on the following days: 2021-11-01, 2021-11-06 to 2021-11-17, 2021-11-19 to 2021-11-30 [11] 2021_12.tar file: 5794 MP3 files recorded on the following days: 2021-12-01 and 2021-12-24 to 2021-12-31 [12] complete_dataset.bit file: Click on this file to contact us if you want to get access to the complete dataset [13] available_recording_dates.csv file: This file contains the recording dates of all files that are available via request. The column “time” describes the recording date and the column “device” the ID of the AudioMoth that recorded the file. In total 14093659 files are available. Version: Version 1.0