audio_manual interface duplicates audio_spans when video field is present

Hi Prodigy team,

When using the audio_manual interface with both audio_spans and a video field, the audio_spans regions are rendered twice in the interface (appears duplicated).

Here is my recipe function:

@prodigy.recipe('video.example')
def example():
    stream = [{
        'video': 'VIDEO_FILE_WITH_AUDIO.mp4',
        'audio_spans': [
            {'start': 1.89, 'end': 2.83, 'label': 'Segment_1'}
        ]
    }]
    stream = Stream.from_iterable(stream)

    view_id = 'audio_manual'
    dataset = 'test'

    return {
        'view_id': view_id,
        'dataset': dataset,
        'stream': stream,
        'config': {
            'labels': ['Example']
        }
    }

My questions:

  • Is this a known bug with audio_manual when a video field is present?
  • Is there a recommended approach for combined video + audio span annotation?

Thank you!