Submission Guidelines

Rules for Participation

To ensure a fair and standardized evaluation, all participants must adhere to the following rules:

1. Training Data Regulations

This is an open-condition challenge, and participants are permitted to use any public or private datasets to train their systems, in addition to the provided TidyVoiceX training partition. The use of all non-challenge data must be fully disclosed in the system description paper.
Strict restriction on Common Voice data: The only data permitted from the Mozilla Common Voice (MCV) dataset is the official TidyVoiceX dataset, as defined in the Challenge Task and Dataset section. The use of any other data from the MCV corpus is strictly forbidden.

2. Prohibition of Re-identification and Data Recombination

Any attempt to determine the real-world identity of a speaker, or to link speakers in the challenge data to individuals or records in external datasets, services, or metadata (“data recombination”) is strictly forbidden. This includes (but is not limited to) voiceprint matching against external corpora, cross-dataset record linkage, or the use of personally identifying metadata. Violations result in disqualification and notification to the organizers and hosting institutions.

3. Scope of Tasks: Verification Only

This challenge evaluates speaker verification (same/different identity) on the provided splits. Speaker identification (closed-set or open-set classification among K speakers) is out of scope and will not be scored.

4. Data Integrity

Manual correction or re-labeling of the officially provided challenge data is strictly prohibited.

5. Use of Pre-trained Models

The use of publicly available, pre-trained models is permitted and encouraged. This includes (but is not limited to):

Pre-trained speaker recognition models (e.g., models trained on VoxCeleb, VoxBlink, etc.)
Self-supervised learning (SSL) models (e.g., wav2vec2, HuBERT, WavLM, etc.)
Any other publicly available pre-trained models

All pre-trained models used must be explicitly and thoroughly declared in the system description paper, including their source, training data, and how they were integrated into the system.

6. Data Augmentation

The use of external, non-speech data for data augmentation (e.g., noise or reverberation from public corpora like MUSAN) is permitted and encouraged, but has to be explicitly and thoroughly disclosed in the system description paper.

7. Submission Process

Each participating team should submit the results on the evaluation set to the CodaBench platform that will be shared during the evaluation phase.

8. Eligibility for Final Ranking

To be eligible for inclusion in the final ranking and challenge results, participants must submit a system description paper to the dedicated Interspeech 2026 special session.

9. System Description

Each submission must be accompanied by a detailed system description paper in the Interspeech format that specifies:

All data usage (including external sets) and preprocessing techniques
Model architecture and all of its components
Computational resources and a comprehensive training setup

Submission Format

The following guidelines for preparing submission files and viewing results apply to the TidyVoice Challenge 2026 competition on CodaBench: https://www.codabench.org/competitions/13187/. Please submit your results and check the leaderboard on that page.

Overview

Participants must submit a single ZIP file containing similarity scores for both trial lists. To minimize file size, submissions contain only the scores (one per line), without the trial pair names. Scores must be in the exact same order as the trial lists.

Required Files

Your submission ZIP must contain exactly two files:

submission.zip
├── tv26_eval-A.txt    # Scores for Trial List 1 (4,000,000 lines)
└── tv26_eval-U.txt    # Scores for Trial List 2 (1,280,000 lines)

File Format

Each file contains one score per line:

tv26_eval-A.txt: 4,000,000 scores (one per trial in Trial List 1)
tv26_eval-U.txt: 1,280,000 scores (one per trial in Trial List 2)

Example (first 5 lines of tv26_eval-A.txt):

Format Requirements

Requirement	Description
Score format	Floating point number (e.g., `0.862541` or `-1.234`)
One score per line	No additional columns, tabs, or spaces
Exact line count	Must match the trial list exactly
No header	Start directly with scores
Correct order	Scores must correspond to trials in the same order as provided

⚠️ Important: Higher scores indicate higher likelihood that the two utterances are from the same speaker.

Validation Script (Recommended)

Because the website is sensitive to the exact format, naming, and folder structure of submission files, we strongly recommend using our automated validation script to generate your submission file.

Using the Validation Script

The validation script is available at:
prepare_submission.py

How It Works

This script assumes your score file contains three columns (enrollment, test, score) without headers. The script will:

Read your score file with three columns (space separated): enroll, test, score
Compare it with the original trial pair files we provided
Generate the correct submission format
Verify the order of scores matches the trial lists
Create properly formatted tv26_eval-A.txt and tv26_eval-U.txt files
Create the .zip file

We highly recommend using this code to generate your submission file to avoid formatting issues that could result in error on the CodaBench website.

After Submission

Submit your ZIP file on the TidyVoice Challenge 2026 CodaBench competition page. If your submission is correctly formatted, you will see the EER% for tv26_eval-A.txt displayed automatically.

Viewing Complete Results

To view the EER% for tv26_eval-U.txt and minDCFs for both trial lists on CodaBench:

Go to https://www.codabench.org/competitions/13187/
Click on the “File name” of your submission
Navigate to “LOGS”
Open “Scoring logs”

This will display the complete evaluation metrics for your submission.

TidyVoice2026 Challenge - Interspeech 2026