.. DO NOT EDIT.
.. THIS FILE WAS AUTOMATICALLY GENERATED BY SPHINX-GALLERY.
.. TO MAKE CHANGES, EDIT THE SOURCE PYTHON FILE:
.. "_auto_examples/2_advanced/plot_compare_auto_and_manual_rois_selection.py"
.. LINE NUMBERS ARE GIVEN BELOW.

.. only:: html

    .. note::
        :class: sphx-glr-download-link-note

        :ref:`Go to the end <sphx_glr_download__auto_examples_2_advanced_plot_compare_auto_and_manual_rois_selection.py>`
        to download the full example code.

.. rst-class:: sphx-glr-example-title

.. _sphx_glr__auto_examples_2_advanced_plot_compare_auto_and_manual_rois_selection.py:


Find Regions of interest (ROIs) in a spectrogram
================================================

A spectrogram is a time-frequency (2d) representation of a audio recording. 
Each acoustic event nested in the audio recording is represented by an acoustic
signature. When sounds does not overlap in time and frequency, it is possible
to extract automatically the acoustic signature as a region of interest (ROI) 
by different image processing tools such as binarization, double thresholding,
mathematical morphology tools...

**Dependencies**: To execute this example you will need to have installed the 
scikit-image, scikit-learn and pandas Python packages.

.. GENERATED FROM PYTHON SOURCE LINES 19-21

.. code-block:: Python

    # sphinx_gallery_thumbnail_path = './_images/sphx_glr_plot_compare_auto_and_manual_rois_selection_005.png'


.. GENERATED FROM PYTHON SOURCE LINES 22-24

Load required modules
---------------------

.. GENERATED FROM PYTHON SOURCE LINES 24-34

.. code-block:: Python

    import numpy as np
    import pandas as pd
    from sklearn.cluster import KMeans
    from sklearn.preprocessing import StandardScaler
    from maad import sound, rois, features
    from maad.util import (
       power2dB, plot2d, format_features, read_audacity_annot, 
       overlay_rois, overlay_centroid
       )


.. GENERATED FROM PYTHON SOURCE LINES 35-36

First, load and audio file and compute the power spectrogram.

.. GENERATED FROM PYTHON SOURCE LINES 36-60

.. code-block:: Python

    s, fs = sound.load('../../data/cold_forest_daylight.wav')

    dB_max = 96

    Sxx_power, tn, fn, ext = sound.spectrogram(
       x=s, 
       fs=fs, 
       nperseg=1024, 
       noverlap=1024//2
       )

    # Convert the power spectrogram into dB, add dB_max which is the maximum decibel
    # range when quantification bit is 16bits and display the result
    Sxx_db = power2dB(Sxx_power) + dB_max

    plot2d(
       im=Sxx_db, 
       **{
          'vmin':0, 
          'vmax':dB_max, 
          'extent':ext
          }
       )


.. image-sg:: /_auto_examples/2_advanced/images/sphx_glr_plot_compare_auto_and_manual_rois_selection_001.png
   :alt: plot compare auto and manual rois selection
   :srcset: /_auto_examples/2_advanced/images/sphx_glr_plot_compare_auto_and_manual_rois_selection_001.png
   :class: sphx-glr-single-img


.. GENERATED FROM PYTHON SOURCE LINES 61-71

Then, relevant acoustic events are extracted directly from the power 
spectrogram based on a double thresholding technique. The result is binary
image called a mask. Double thresholding technique is more sophisticated than
basic thresholding based on a single value. First, a threshold selects pixels
with high value (i.e. high acoustic energy). They should belong to an acoustic
event. They are called seeds. From these seeds, we aggregate pixels connected
to the seed with value higher than the second threslhold. These new pixels 
become seed and the aggregating process continue until no more new pixels are
aggregated, meaning that there is no more connected pixels with value upper 
than the second threshold value.

.. GENERATED FROM PYTHON SOURCE LINES 71-166

.. code-block:: Python


    # First we remove the stationary background in order to increase the contrast [1]
    # Then we convert the spectrogram into dB
    Sxx_power_noNoise= sound.median_equalizer(
       Sxx=Sxx_power, 
       display=True, 
       **{'extent':ext}
       )
    Sxx_db_noNoise = power2dB(Sxx_power_noNoise)

    # Then we smooth the spectrogram in order to facilitate the creation of masks as
    # small sparse details are merged if they are close to each other
    Sxx_db_noNoise_smooth = sound.smooth(
       Sxx=Sxx_db_noNoise, 
       std=0.5, 
       display=True, 
       savefig=None, 
       **{
          'vmin':0, 
          'vmax':dB_max, 
          'extent':ext
          }
       )

    # Then we create a mask (i.e. binarization of the spectrogram) by using the 
    # double thresholding technique
    im_mask = rois.create_mask(
       im=Sxx_db_noNoise_smooth, 
       mode_bin='relative', 
       bin_std=8, 
       bin_per=0.5,
       verbose=False, 
       display=False
       )

    # Finaly, we put together pixels that belong to the same acoustic event, and 
    # remove very small events (<=25 pixel²)
    im_rois, df_rois = rois.select_rois(
       im_bin=im_mask, 
       min_roi=25, 
       max_roi=None, 
       display= True,
       **{'extent':ext}
       )

    # format dataframe df_rois in order to convert pixels into time and frequency
    df_rois = format_features(
       df=df_rois, 
       tn=tn, 
       fn=fn
       )

    # overlay bounding box on the original spectrogram
    ax0, fig0 = overlay_rois(
       im_ref=Sxx_db, 
       rois=df_rois, 
       **{
          'vmin':0, 
          'vmax':dB_max, 
          'extent':ext
          }
       )

    # Compute centroids
    df_centroid = features.centroid_features(
       Sxx=Sxx_db, 
       rois=df_rois, 
       im_rois=im_rois
       )

    # format dataframe df_centroid in order to convert pixels into time and frequency
    df_centroid = format_features(
       df=df_centroid, 
       tn=tn, 
       fn=fn
       )

    # overlay centroids on the original spectrogram
    ax0, fig0 = overlay_centroid(
       im_ref=Sxx_db, 
       centroid=df_centroid, 
       savefig=None,
       **{
          'vmin':0,
          'vmax':dB_max,
          'extent':ext,
          'ms':4, 
          'marker':'+', 
          'color':'red',
          'fig':fig0, 
          'ax':ax0
          }
       )


.. rst-class:: sphx-glr-horizontal


    *

      .. image-sg:: /_auto_examples/2_advanced/images/sphx_glr_plot_compare_auto_and_manual_rois_selection_002.png
         :alt: Spectrogram without stationnary noise
         :srcset: /_auto_examples/2_advanced/images/sphx_glr_plot_compare_auto_and_manual_rois_selection_002.png
         :class: sphx-glr-multi-img

    *

      .. image-sg:: /_auto_examples/2_advanced/images/sphx_glr_plot_compare_auto_and_manual_rois_selection_003.png
         :alt: Orignal Spectrogram, Blurred Spectrogram (std=0.5)
         :srcset: /_auto_examples/2_advanced/images/sphx_glr_plot_compare_auto_and_manual_rois_selection_003.png
         :class: sphx-glr-multi-img

    *

      .. image-sg:: /_auto_examples/2_advanced/images/sphx_glr_plot_compare_auto_and_manual_rois_selection_004.png
         :alt: Selected ROIs
         :srcset: /_auto_examples/2_advanced/images/sphx_glr_plot_compare_auto_and_manual_rois_selection_004.png
         :class: sphx-glr-multi-img

    *

      .. image-sg:: /_auto_examples/2_advanced/images/sphx_glr_plot_compare_auto_and_manual_rois_selection_005.png
         :alt: ROIs Overlay
         :srcset: /_auto_examples/2_advanced/images/sphx_glr_plot_compare_auto_and_manual_rois_selection_005.png
         :class: sphx-glr-multi-img


.. rst-class:: sphx-glr-script-out

 .. code-block:: none

    /Users/jsulloa/miniconda3/envs/maad/lib/python3.11/site-packages/maad/util/miscellaneous.py:977: FutureWarning: Setting an item of incompatible dtype is deprecated and will raise in a future error of pandas. Value '[  6.98920635   6.98920635   4.99229025   8.01088435   3.9938322
       4.99229025   2.99537415   2.99537415   2.99537415   4.99229025
       4.99229025   8.01088435   5.9907483   15.99854875  25.00789116
      17.99546485  15.99854875   5.9907483  118.00380952 141.99002268
     161.00716553   6.98920635  11.0062585   14.00163265 132.00544218
      41.00643991 150.00090703  14.00163265  10.00780045   8.01088435
      15.0000907    9.0093424   18.9939229    6.98920635  21.98929705
      47.99564626  14.00163265  11.0062585   15.0000907  123.99455782
      16.9970068   68.01124717   5.9907483   15.0000907   40.00798186
      29.00172336   5.9907483   17.99546485  17.99546485  11.0062585
      14.00163265  11.0062585   12.00471655  63.99419501  11.0062585
       3.9938322    9.0093424   14.00163265  15.0000907   16.9970068
       9.0093424   11.0062585  135.99927438   8.01088435   8.01088435
      11.0062585    5.9907483    8.01088435   5.9907483   12.00471655
      15.0000907    6.98920635  15.0000907    8.01088435   5.9907483
       5.9907483    5.9907483    9.0093424    4.99229025  14.00163265
      14.00163265   9.0093424   14.00163265  11.0062585    5.9907483
       8.01088435   3.9938322   19.99238095   5.9907483   15.99854875
      58.00344671   5.9907483    4.99229025  27.00480726  17.99546485
       4.99229025  10.00780045  15.99854875  15.0000907   90.99900227
      15.99854875   8.01088435  14.00163265  17.99546485  21.98929705
      13.0031746   13.0031746    9.0093424   15.99854875  53.01115646
       5.9907483   45.00027211  10.00780045  88.00362812   8.01088435
      15.0000907    4.99229025  81.98965986  43.00335601   5.9907483
      74.00199546  15.99854875  80.99120181  47.99564626  44.00181406
      42.00489796  41.00643991  96.98975057  44.00181406  42.00489796
      42.00489796  41.00643991  42.00489796  42.00489796  47.99564626
      43.00335601  35.99092971  41.00643991  42.00489796  39.00952381
      40.00798186  40.00798186  40.00798186  40.00798186  40.00798186
      12.00471655  39.00952381  39.00952381  43.00335601  38.01106576
      35.99092971  47.99564626  57.00498866  33.99401361  12.00471655
      12.00471655  17.99546485  63.99419501  53.01115646  62.99573696
      14.00163265  60.99882086  23.01097506  59.00190476  12.00471655
      21.98929705  10.00780045  76.99736961  12.00471655  11.0062585
      60.99882086  61.99727891  28.00326531  60.00036281  60.99882086
      57.00498866  11.0062585   10.00780045  76.99736961  79.99274376
      48.99410431   8.01088435  15.99854875  50.99102041  15.0000907
      15.0000907   14.00163265  36.98938776  15.0000907   24.00943311
      24.00943311  10.00780045  15.99854875  33.99401361  18.9939229
      12.00471655  20.990839    17.99546485  16.9970068   15.0000907
      15.0000907   28.00326531  30.99863946  25.00789116  26.00634921
      24.00943311  19.99238095  16.9970068   29.00172336  30.99863946
      24.00943311  17.99546485  18.9939229   23.01097506  17.99546485
      29.00172336  29.00172336  16.9970068   21.98929705  17.99546485
      19.99238095  25.00789116  16.9970068   20.990839    27.00480726
      14.00163265  26.00634921  10.00780045  19.99238095  15.0000907
      14.00163265  13.0031746   14.00163265]' has dtype incompatible with int64, please explicitly cast to a compatible dtype first.
      df.update(pd.DataFrame(area,
    /Users/jsulloa/miniconda3/envs/maad/lib/python3.11/site-packages/maad/util/miscellaneous.py:977: FutureWarning: Setting an item of incompatible dtype is deprecated and will raise in a future error of pandas. Value '[ 86.1328125   0.          0.          0.          0.          0.
       0.          0.          0.          0.          0.          0.
       0.          0.          0.          0.          0.          0.
      86.1328125 172.265625  344.53125     0.          0.          0.
     172.265625    0.        172.265625    0.          0.          0.
       0.          0.          0.          0.          0.         86.1328125
       0.          0.          0.        172.265625   86.1328125  86.1328125
       0.          0.          0.          0.          0.          0.
       0.          0.         86.1328125   0.          0.         86.1328125
       0.          0.          0.          0.          0.          0.
       0.         86.1328125  86.1328125   0.          0.          0.
       0.          0.          0.          0.          0.          0.
       0.          0.          0.          0.          0.          0.
       0.          0.          0.          0.          0.          0.
       0.          0.          0.          0.          0.          0.
      86.1328125   0.          0.          0.          0.          0.
       0.          0.          0.         86.1328125   0.          0.
       0.          0.          0.          0.          0.          0.
       0.          0.          0.         86.1328125   0.         86.1328125
       0.          0.          0.         86.1328125   0.          0.
      86.1328125   0.         86.1328125   0.          0.          0.
       0.          0.          0.          0.          0.          0.
       0.          0.          0.          0.          0.          0.
       0.          0.          0.          0.          0.          0.
       0.          0.          0.          0.          0.          0.
       0.          0.          0.          0.          0.          0.
       0.          0.          0.          0.          0.          0.
       0.          0.          0.          0.          0.          0.
       0.          0.          0.          0.          0.          0.
       0.          0.          0.          0.          0.          0.
       0.          0.          0.          0.          0.          0.
       0.          0.          0.          0.          0.          0.
       0.          0.          0.          0.          0.          0.
       0.          0.          0.          0.          0.          0.
       0.          0.          0.          0.          0.          0.
       0.          0.          0.          0.          0.          0.
       0.          0.          0.          0.          0.          0.
       0.          0.          0.          0.          0.          0.
       0.          0.          0.          0.          0.       ]' has dtype incompatible with int64, please explicitly cast to a compatible dtype first.
      df.update(pd.DataFrame(area,


.. GENERATED FROM PYTHON SOURCE LINES 167-171

Let's compare with the manual annotation (Ground Truth GT) obtained with 
Audacity software.
Each acoustic signature is manually selected and labeled. All similar acoustic 
signatures are labeled with the same name

.. GENERATED FROM PYTHON SOURCE LINES 171-227

.. code-block:: Python

    df_rois_GT = read_audacity_annot('../../data/cold_forest_daylight_label.txt')  ## annotations using Audacity

    # drop rows with frequency and time outside of tn and fn
    df_rois_GT = df_rois_GT[(df_rois_GT.min_t >= tn.min()) & 
                            (df_rois_GT.max_t <= tn.max()) & 
                            (df_rois_GT.min_f >= fn.min()) & 
                            (df_rois_GT.max_f <= fn.max())]

    # format dataframe df_rois in order to convert time and frequency into pixels
    df_rois_GT = format_features(df_rois_GT, tn, fn)

    # overlay bounding box on the original spectrogram
    ax1, fig1 = overlay_rois(
       im_ref=Sxx_db, 
       rois=df_rois_GT, 
       **{
          'vmin':0,
          'vmax':dB_max,
          'extent':ext
          }
       )

    # Compute centroids
    df_centroid_GT = features.centroid_features(
       Sxx=Sxx_db, 
       rois=df_rois_GT
       )

    # format dataframe df_centroid_GT in order to convert pixels into time and frequency
    df_centroid_GT = format_features(
       df=df_centroid_GT, 
       tn=tn, 
       fn=fn
       )

    # overlay centroids on the original spectrogram
    ax1, fig1 = overlay_centroid(
       im_ref=Sxx_db, 
       centroid=df_centroid_GT, 
       savefig=None, 
       **{
          'vmin':0,
          'vmax':dB_max,
          'extent':ext,
          'ms':2, 
          'marker':'+',
          'color':'red',
          'fig':fig1, 
          'ax':ax1
          }
       )

    # print informations about the rois
    print ('Total number of ROIs : %2.0f' %len(df_rois_GT))
    print ('Number of different ROIs : %2.0f' %len(np.unique(df_rois_GT['label'])))


.. image-sg:: /_auto_examples/2_advanced/images/sphx_glr_plot_compare_auto_and_manual_rois_selection_006.png
   :alt: ROIs Overlay
   :srcset: /_auto_examples/2_advanced/images/sphx_glr_plot_compare_auto_and_manual_rois_selection_006.png
   :class: sphx-glr-single-img


.. rst-class:: sphx-glr-script-out

 .. code-block:: none

    /Users/jsulloa/miniconda3/envs/maad/lib/python3.11/site-packages/maad/util/miscellaneous.py:977: FutureWarning: Setting an item of incompatible dtype is deprecated and will raise in a future error of pandas. Value '[ 43.00335601  27.00480726  36.98938776  40.00798186  26.00634921
      33.99401361  12.00471655  13.0031746   14.00163265  45.00027211
       9.0093424   20.990839    43.00335601  12.00471655  39.00952381
      12.00471655  25.00789116  42.00489796  24.00943311  12.00471655
      11.0062585   11.0062585   11.0062585   44.00181406 129.01006803
      38.01106576  13.0031746   12.00471655  30.99863946   9.0093424
      29.00172336  38.01106576  43.00335601  13.0031746   23.01097506
      40.00798186  41.00643991  50.99102041  11.0062585   43.00335601
      15.0000907   15.0000907   31.99709751  46.99718821  34.99247166
      30.00018141  32.99555556  35.99092971  32.99555556  47.99564626
     167.99637188  30.00018141  10.00780045  12.00471655  26.00634921
      70.00816327  30.99863946  38.01106576  60.00036281  15.99854875
      21.98929705  45.99873016  24.00943311  35.99092971  44.00181406
      44.00181406  25.00789116  16.9970068   45.99873016  28.00326531
      48.99410431  53.01115646  31.99709751  43.00335601  15.0000907
      46.99718821  40.00798186 124.99301587  11.0062585   30.99863946
      13.0031746   35.99092971  19.99238095  31.99709751  19.99238095
      45.00027211  13.0031746   16.9970068   19.99238095  38.01106576
      30.00018141  14.00163265  23.01097506  28.00326531  13.0031746
      47.99564626  13.0031746   16.9970068   39.00952381  39.00952381
      30.00018141  48.99410431  34.99247166 136.99773243  33.99401361
      23.01097506  36.98938776  15.99854875  14.00163265  43.00335601
      31.99709751  18.9939229   39.00952381  18.9939229   35.99092971
      30.00018141  15.0000907   50.99102041  15.99854875  74.00199546
      14.00163265 129.01006803  36.98938776  14.00163265  35.99092971
      30.99863946  15.0000907   12.00471655  34.99247166  18.9939229
      40.00798186  15.0000907   15.99854875  15.0000907   45.00027211
      16.9970068   30.00018141  15.0000907   35.99092971  16.9970068
      14.00163265  32.99555556  34.99247166  12.00471655  14.00163265
      28.00326531  10.00780045  15.99854875  35.99092971 164.00253968
      36.98938776  14.00163265  13.0031746   81.98965986  17.99546485
      36.98938776  15.99854875  33.99401361  13.0031746 ]' has dtype incompatible with int64, please explicitly cast to a compatible dtype first.
      df.update(pd.DataFrame(area,
    /Users/jsulloa/miniconda3/envs/maad/lib/python3.11/site-packages/maad/util/miscellaneous.py:977: FutureWarning: Setting an item of incompatible dtype is deprecated and will raise in a future error of pandas. Value '[  0.          0.          0.         86.1328125   0.          0.
       0.          0.          0.         86.1328125   0.          0.
      86.1328125   0.         86.1328125   0.          0.         86.1328125
       0.          0.          0.          0.          0.         86.1328125
     172.265625    0.          0.          0.          0.          0.
       0.          0.         86.1328125   0.          0.         86.1328125
      86.1328125  86.1328125   0.         86.1328125   0.          0.
       0.         86.1328125  86.1328125   0.         86.1328125  86.1328125
       0.         86.1328125 172.265625    0.          0.          0.
       0.        172.265625    0.         86.1328125  86.1328125   0.
       0.         86.1328125   0.          0.          0.         86.1328125
       0.          0.         86.1328125   0.          0.         86.1328125
       0.          0.          0.          0.          0.        172.265625
       0.          0.          0.          0.          0.          0.
       0.          0.          0.          0.         86.1328125  86.1328125
       0.          0.         86.1328125   0.          0.         86.1328125
       0.         86.1328125   0.         86.1328125   0.         86.1328125
       0.        172.265625    0.         86.1328125  86.1328125   0.
       0.         86.1328125   0.          0.          0.          0.
      86.1328125   0.          0.         86.1328125   0.         86.1328125
       0.        172.265625    0.          0.          0.          0.
       0.          0.         86.1328125   0.          0.          0.
       0.          0.          0.          0.          0.          0.
      86.1328125   0.          0.          0.          0.          0.
       0.          0.          0.          0.          0.        172.265625
       0.          0.          0.          0.          0.         86.1328125
       0.          0.          0.       ]' has dtype incompatible with int64, please explicitly cast to a compatible dtype first.
      df.update(pd.DataFrame(area,
    Total number of ROIs : 159
    Number of different ROIs :  8


.. GENERATED FROM PYTHON SOURCE LINES 228-237

Now we cluster the ROIS depending on 3 ROIS features :
- centroid_f : frequency position of the roi centroid 
- duration_t : duration of the roi
- bandwidth_f : frequency bandwidth of the roi
The clustering is done by the so-called KMeans clustering algorithm.
The number of attended clustering is the number of clusters found with 
manual annotation.
Finally, each rois is labeled with the corresponding cluster number predicted
by KMeans

.. GENERATED FROM PYTHON SOURCE LINES 237-283

.. code-block:: Python


    # select features to perform KMeans clustering
    FEATURES = ['centroid_f','duration_t','bandwidth_f','area_tf']

    # Prepare the features in order to have zero mean and same variance
    X = StandardScaler().fit_transform(df_centroid[FEATURES])

    # perform KMeans with the same number of clusters as with the manual annotation  
    NN_CLUSTERS = len(np.unique(df_rois_GT['label'])) 
    labels = KMeans(
       n_clusters=NN_CLUSTERS, 
       random_state=0, 
       n_init='auto'
       ).fit_predict(X)

    # Replace the unknow label by the cluster number predicted by KMeans
    df_centroid['label'] = [str(i) for i in labels] 

    # overlay color bounding box corresponding to the label, and centroids
    # on the original spectrogram
    ax2, fig2 = overlay_rois(
       im_ref=Sxx_db, 
       rois=df_centroid, 
       **{
          'vmin':0,
          'vmax':dB_max,
          'extent':ext
          }
       )

    ax2, fig2 = overlay_centroid(
       im_ref=Sxx_db, 
       centroid=df_centroid, 
       savefig=None, 
       **{
          'vmin':0,
          'vmax':dB_max,
          'extent':ext,
          'ms':2, 
          'marker':'+', 
          'color':'red',
          'fig':fig2, 
          'ax':ax2
          }
       )


.. image-sg:: /_auto_examples/2_advanced/images/sphx_glr_plot_compare_auto_and_manual_rois_selection_007.png
   :alt: ROIs Overlay
   :srcset: /_auto_examples/2_advanced/images/sphx_glr_plot_compare_auto_and_manual_rois_selection_007.png
   :class: sphx-glr-single-img


.. GENERATED FROM PYTHON SOURCE LINES 284-293

It is possible to extract Rois directly from the audio waveform without 
computing the spectrogram. This works well if there is no big overlap between
each acoustic signature and you 
First, we have to define the frequency bandwidth where to find acoustic events
In our example, there are clearly 3 frequency bandwidths (low : l, medium:m
and high : h). 
We know that we have mostly short (ie. s) acoustic events in low, med and high
frequency bandwidths but also a long (ie l) acoustic events in med.
To extract 

.. GENERATED FROM PYTHON SOURCE LINES 293-357

.. code-block:: Python


    df_rois_sh = rois.find_rois_cwt(s, fs, flims=[7000, 8000], tlen=0.2, th=0.000001)
    df_rois_sm = rois.find_rois_cwt(s, fs, flims=[3500, 5500], tlen=0.2, th=0.000001)
    df_rois_lm = rois.find_rois_cwt(s, fs, flims=[2000, 7500], tlen=2,   th=0.0001)
    df_rois_sl = rois.find_rois_cwt(s, fs, flims=[1800, 3000], tlen=0.2, th=0.000001)

    ## concat df
    df_rois_WAV =pd.concat([df_rois_sh, df_rois_sm, df_rois_lm, df_rois_sl], ignore_index=True)

    # drop rows with frequency and time outside of tn and fn
    df_rois_WAV = df_rois_WAV[
       (df_rois_WAV.min_t >= tn.min()) & 
       (df_rois_WAV.max_t <= tn.max()) & 
       (df_rois_WAV.min_f >= fn.min()) & 
       (df_rois_WAV.max_f <= fn.max())
       ]

    # format dataframe df_rois_WAV in order to convert pixels into time and frequency
    df_rois_WAV = format_features(
       df=df_rois_WAV, 
       tn=tn, 
       fn=fn
       )

    # display rois
    ax3, fig3 = overlay_rois(
       im_ref=Sxx_db, 
       rois=df_rois_WAV,
       **{
          'vmin':0,
          'vmax':dB_max,
          'extent':ext
          }
       )

    # get features: centroids
    df_centroid_WAV = features.centroid_features(
       Sxx=Sxx_db,
       rois=df_rois_WAV
       )

    # format dataframe df_centroid_WAV in order to convert pixels into time and frequency
    df_centroid_WAV = format_features(
       df=df_centroid_WAV,
       tn=tn, 
       fn=fn
       )

    # display centroids
    ax3, fig3 = overlay_centroid(
       im_ref=Sxx_db, 
       centroid=df_centroid_WAV,
       savefig=None, 
       **{'vmin':0,
          'vmax':dB_max,
          'extent':ext,
          'ms':2,
          'marker':'+', 
          'color':'red',
          'fig':fig3, 
          'ax':ax3
          }
       )


.. image-sg:: /_auto_examples/2_advanced/images/sphx_glr_plot_compare_auto_and_manual_rois_selection_008.png
   :alt: ROIs Overlay
   :srcset: /_auto_examples/2_advanced/images/sphx_glr_plot_compare_auto_and_manual_rois_selection_008.png
   :class: sphx-glr-single-img


.. rst-class:: sphx-glr-script-out

 .. code-block:: none

    /Users/jsulloa/miniconda3/envs/maad/lib/python3.11/site-packages/maad/util/miscellaneous.py:977: FutureWarning: Setting an item of incompatible dtype is deprecated and will raise in a future error of pandas. Value '[ 23.01097506  23.01097506  23.01097506  23.01097506  23.01097506
      23.01097506  23.01097506  23.01097506  23.01097506  23.01097506
      23.01097506  23.01097506  23.01097506  23.01097506  23.01097506
      23.01097506  23.01097506  23.01097506  23.01097506  23.01097506
      23.01097506  23.01097506  23.01097506  23.01097506  23.01097506
      23.01097506  23.01097506  23.01097506  23.01097506  23.01097506
      23.01097506  23.01097506  23.01097506  23.01097506  23.01097506
      23.01097506  23.01097506  23.01097506  23.01097506  23.01097506
      23.01097506  23.01097506  23.01097506  23.01097506  23.01097506
      23.01097506  23.01097506  23.01097506  46.99718821  46.99718821
      46.99718821  46.99718821  46.99718821  46.99718821  46.99718821
      46.99718821  46.99718821  46.99718821  46.99718821  46.99718821
      46.99718821  46.99718821  46.99718821  46.99718821  46.99718821
      46.99718821  46.99718821  46.99718821  46.99718821  46.99718821
      46.99718821  46.99718821  46.99718821  46.99718821  46.99718821
      46.99718821  46.99718821  46.99718821  46.99718821  46.99718821
      46.99718821  46.99718821  46.99718821  46.99718821  46.99718821
      46.99718821  46.99718821  46.99718821  46.99718821  46.99718821
      46.99718821  46.99718821  46.99718821  46.99718821  46.99718821
      46.99718821  46.99718821  46.99718821  46.99718821  46.99718821
      46.99718821  46.99718821  46.99718821  46.99718821  46.99718821
      46.99718821  46.99718821  46.99718821  46.99718821  46.99718821
      46.99718821  46.99718821  46.99718821  46.99718821  46.99718821
      46.99718821  46.99718821  46.99718821 127.98839002 127.98839002
     127.98839002 127.98839002 127.98839002 127.98839002 127.98839002
     127.98839002 127.98839002  28.00326531  28.00326531  28.00326531
      28.00326531  28.00326531  28.00326531  28.00326531  28.00326531
      28.00326531  28.00326531  28.00326531  28.00326531  28.00326531
      28.00326531  28.00326531  28.00326531  28.00326531  28.00326531
      28.00326531  28.00326531  28.00326531  28.00326531  28.00326531
      28.00326531  28.00326531  28.00326531  28.00326531  28.00326531
      28.00326531  28.00326531  28.00326531  28.00326531  28.00326531
      28.00326531  28.00326531  28.00326531  28.00326531  28.00326531
      28.00326531  28.00326531  28.00326531  28.00326531  28.00326531
      28.00326531  28.00326531  28.00326531  28.00326531  28.00326531
      28.00326531  28.00326531  28.00326531  28.00326531  28.00326531
      28.00326531  28.00326531  28.00326531  28.00326531  28.00326531
      28.00326531  28.00326531  28.00326531  28.00326531  28.00326531
      28.00326531  28.00326531  28.00326531  28.00326531  28.00326531
      28.00326531  28.00326531  28.00326531  28.00326531  28.00326531
      28.00326531  28.00326531  28.00326531  28.00326531  28.00326531
      28.00326531  28.00326531  28.00326531  28.00326531  28.00326531
      28.00326531  28.00326531  28.00326531  28.00326531  28.00326531
      28.00326531  28.00326531  28.00326531  28.00326531  28.00326531
      28.00326531  28.00326531]' has dtype incompatible with int64, please explicitly cast to a compatible dtype first.
      df.update(pd.DataFrame(area,
    /Users/jsulloa/miniconda3/envs/maad/lib/python3.11/site-packages/maad/util/miscellaneous.py:977: FutureWarning: Setting an item of incompatible dtype is deprecated and will raise in a future error of pandas. Value '[  0.          0.          0.          0.          0.          0.
       0.          0.          0.          0.          0.          0.
       0.          0.          0.          0.          0.          0.
       0.          0.          0.          0.          0.          0.
       0.          0.          0.          0.          0.          0.
       0.          0.          0.          0.          0.          0.
       0.          0.          0.          0.          0.          0.
       0.          0.          0.          0.          0.          0.
       0.          0.          0.          0.          0.          0.
       0.          0.          0.          0.          0.          0.
       0.          0.          0.          0.          0.          0.
       0.          0.          0.          0.          0.          0.
       0.          0.          0.          0.          0.          0.
       0.          0.          0.          0.          0.          0.
       0.          0.          0.          0.          0.          0.
       0.          0.          0.          0.          0.          0.
       0.          0.          0.          0.          0.          0.
       0.          0.          0.          0.          0.          0.
       0.          0.          0.          0.          0.          0.
       0.          0.          0.          0.         86.1328125 172.265625
      86.1328125 172.265625   86.1328125   0.         86.1328125  86.1328125
      86.1328125   0.          0.          0.          0.          0.
       0.          0.          0.          0.          0.          0.
       0.          0.          0.          0.          0.          0.
       0.          0.          0.          0.          0.          0.
       0.          0.          0.          0.          0.          0.
       0.          0.          0.          0.          0.          0.
       0.          0.          0.          0.          0.          0.
       0.          0.          0.          0.          0.          0.
       0.          0.          0.          0.          0.          0.
       0.          0.          0.          0.          0.          0.
       0.          0.          0.          0.          0.          0.
       0.          0.          0.          0.          0.          0.
       0.          0.          0.          0.          0.          0.
       0.          0.          0.          0.          0.          0.
       0.          0.          0.          0.          0.          0.
       0.          0.          0.          0.          0.          0.       ]' has dtype incompatible with int64, please explicitly cast to a compatible dtype first.
      df.update(pd.DataFrame(area,


.. GENERATED FROM PYTHON SOURCE LINES 358-359

Prepare the features in order to have zero mean and same variance

.. GENERATED FROM PYTHON SOURCE LINES 359-397

.. code-block:: Python

    X = StandardScaler().fit_transform(df_centroid_WAV[FEATURES])

    # perform KMeans with the same number of clusters as with the manual annotation  
    labels = KMeans(
       n_clusters=NN_CLUSTERS, 
       random_state=0,
       n_init='auto'
       ).fit_predict(X)

    # Replace the unknow label by the cluster number predicted by KMeans
    df_centroid_WAV['label'] = [str(i) for i in labels] 

    # overlay color bounding box corresponding to the label, and centroids
    # on the original spectrogram
    ax4, fig4 = overlay_rois(
       im_ref=Sxx_db, 
       rois=df_centroid_WAV, 
       **{
          'vmin':0,
          'vmax':dB_max,
          'extent':ext
          }
       )

    ax4, fig4 = overlay_centroid(
       im_ref=Sxx_db, 
       centroid=df_centroid_WAV, 
       savefig=None, 
       **{
          'vmin':0,
          'vmax':dB_max,
          'extent':ext,
          'ms':2,
          'fig':fig4, 
          'ax':ax4
          }
       )


.. image-sg:: /_auto_examples/2_advanced/images/sphx_glr_plot_compare_auto_and_manual_rois_selection_009.png
   :alt: ROIs Overlay
   :srcset: /_auto_examples/2_advanced/images/sphx_glr_plot_compare_auto_and_manual_rois_selection_009.png
   :class: sphx-glr-single-img


.. GENERATED FROM PYTHON SOURCE LINES 398-400

**References** 

[1] Towsey, M., 2013b. Noise Removal from Wave-forms and Spectrograms Derived from Natural Recordings of the Environment. Queensland University of Technology, Brisbane


.. rst-class:: sphx-glr-timing

   **Total running time of the script:** (0 minutes 9.403 seconds)


.. _sphx_glr_download__auto_examples_2_advanced_plot_compare_auto_and_manual_rois_selection.py:

.. only:: html

  .. container:: sphx-glr-footer sphx-glr-footer-example

    .. container:: sphx-glr-download sphx-glr-download-jupyter

      :download:`Download Jupyter notebook: plot_compare_auto_and_manual_rois_selection.ipynb <plot_compare_auto_and_manual_rois_selection.ipynb>`

    .. container:: sphx-glr-download sphx-glr-download-python

      :download:`Download Python source code: plot_compare_auto_and_manual_rois_selection.py <plot_compare_auto_and_manual_rois_selection.py>`

    .. container:: sphx-glr-download sphx-glr-download-zip

      :download:`Download zipped: plot_compare_auto_and_manual_rois_selection.zip <plot_compare_auto_and_manual_rois_selection.zip>`


.. only:: html

 .. rst-class:: sphx-glr-signature

    `Gallery generated by Sphinx-Gallery <https://sphinx-gallery.github.io>`_