在线时间:8:00-16:00
迪恩网络APP
随时随地掌握行业动态
扫描二维码
关注迪恩网络微信公众号
开源软件名称(OpenSource Name):DTaoo/Discriminative-Sounding-Objects-Localization开源软件地址(OpenSource Url):https://github.com/DTaoo/Discriminative-Sounding-Objects-Localization开源编程语言(OpenSource Language):Python 99.1%开源软件介绍(OpenSource Introduction):Discriminative Sounding Objects LocalizationCode for our NeurIPS 2020 paper Discriminative Sounding Objects Localization via Self-supervised Audiovisual Matching (The previous title is Learning to Discriminatively Localize Sounding Objects in a Cocktail-party Scenario). The code is implemented on PyTorch with python3. The code for TPAMI version can be found here: https://github.com/GeWu-Lab/CSOL_TPAMI2021. Requirements
Running ProcedureFor experiments on Music or AudioSet-instrument, the training and evaluation procedures are similar, respectively under the folder Data Preparation
The sounding object bounding box annotations on solo and duet are stored in TrainingStage one
After training of stage one, we will get the cluster pseudo labels and object dictionary of different classes in the folder Stage two
EvaluationStage oneWe first generate localization results and save then as a pkl file, then calculate metrics, IoU and AUC and also generate visualizations, by running
Stage twoFor evaluation of stage two, i.e., class-aware sounding object localization in multi-source scenes, we first match the cluster pseudo labels generated in stage one with gt labels to accordingly assign one object category to each center representation in the object dictionary by running
It is necessary to manually ensure there is one-to-one matching between object category and each center representation. Then we generate the localization results and calculate metrics, CIoU AUC and NSA, by running
ResultsThe two tables respectively show our model's performance on single-source and multi-source scenarios. The following figures show the category-aware localization results under multi-source scenes. The green boxes mean the sounding objects while the red boxes are silent ones. |
2023-10-27
2022-08-15
2022-08-17
2022-09-23
2022-08-13
请发表评论