You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Thank you very much for your excellent work.
One problem I am confused about is the definition of the crossmodal loss function and coseparation loss function. In the train.py, why random numbers and opt.gt_percentage are used to select which audio feature (audio_embedding_A1_pred or audio_embedding_A1_gt) is used. According to the method of the paper, shouldn't the predictive features be used?
Thank you very much for your excellent work.
One problem I am confused about is the definition of the
crossmodal loss functionandcoseparation loss function. In the train.py, why random numbers andopt.gt_percentageare used to select which audio feature (audio_embedding_A1_predoraudio_embedding_A1_gt) is used. According to the method of the paper, shouldn't the predictive features be used?