Data Analysis

During initial data collection, it can be useful to set record_duration: to 2-5 minutes and then rename each recording as they complete, using the following as an example:

2024-08-10_13:54:00_TRAIN-clap.mkv
2024-08-10_13:58:17_TRAIN-clap.mkv
2024-08-10_14:00:26_TEST-nomatch.mkv
2024-08-10_14:02:38_TEST-nomatch.mkv
2024-08-10_14:04:57_TEST-nomatch.mkv
2024-08-10_14:07:20_TEST-nomatch.mkv
2024-08-10_14:09:45_TEST-clap.mkv
2024-08-10_14:17:07_TEST-nomatch.mkv
2024-08-10_14:19:35_TEST-nomatch.mkv
2024-08-10_14:22:02_TEST-clap.mkv
  • TRAIN files were created with many variations of the sound being searched for, using different background noise, volumes, etc. These files will be cut into 1-second clips for model training.

  • TEST files include variation, but may have only one instance of a sound being searched for–one needle in the haystack. These will be used to test the quality of each ML iteration.

Once data is collected, it can be retrieve from _workspace/rotating on the recording device and copied to the same _workspace/rotating location on the device used for training.

Testing Data:

Test data is essentially the same as training data, except it is collected with the intent of being used only for testing.

Follow the process for tagging and then move data to _workspace/test/<model>/ or _workspace/test/nomatch/:

Ultimately, these videos will be used to determine the accuracy of each model.

Training Data:

In order to determine if something is or is not, the source audio must be broken up into short consumable segments and segments matching the target model must be reviewed and saved (tagged) manually.

Project Timing

  • APR is designed for generating reports.

  • Report granularity uses 1-minute cycles. + 1 clap or 999 claps within 1 minute is logged as one hit.

  • Each recording is broken into 1-second clips.

  • Each clip overlaps the next by 0.1 seconds to prevent dead zones

Open and review captured (from rotating/) using the inspection tool:

python3 -m apr -a review

The review option provides a GUI to help simplify the process of reviewing and tagging 1-second clips.

Keyboard Shortcuts:

  • Left/Right: Navigate 1 frame left or right

  • PgUp/PgDn: Navigate 60 frames left or right

  • Home/End: Navigate to start or end

  • Up: Replay audio clip