maad.util.xc_selection

maad.util.xc_selection(df_dataset, max_nb_files=100, max_length='01:00', min_length='00:10', min_quality='B', verbose=False)[source]

Select a maximum number of recordings depending on their quality and duration in order to create an homogeneous dataset.

Parameters:
df_datasetpandas DataFrame

Dataframe containing all the recordings metadata

max_nb_filesint, optional

Max number of audio files per species. The default is 100.

max_lengthstring, optional

Max duration of the audio files. The default is ‘01:00’.

min_lengthstring, optional

Min duration of the audio files. The default is ‘00:10’.

min_qualitystring, optional

Min quality of the audio files. The default is ‘B’.

verboseboolean, optional

Print messages during the execution of the function. The default is False.

Returns:
df_dataset_outpandas DataFrame

Dataframe containing the selected recordings metadata