I want to evaluate the sequential tone detection system. Although the results are similar to what is expected, the problem is that the data size is different between the predicted data and the actual data, which makes the evaluation using confusion matrix or f1-score should not be possible. So, what evaluation method is recommended for this scenario?
$\begingroup$ $\endgroup$
Add a comment |
- The Overflow Blog
-
-
- Featured on Meta
-
-
Related
Hot Network Questions
- Is it impossible for a valid email to have a `.test` domain?
- Difference between Rational Numbers and Fractions
- Received an informal PhD offer then was ghosted. What happened?
- Relationship between algebraic number theory and analytic number theory
- Can my familiar hold an action to deliver Cure Wounds to me when I drop to less than 1 HP?
- Huffman compressor/decompressor in C17
- When is the union of two disjoint zero-dimensional subspaces a zero-dimensional subspace?
- Even after new gaskets and cleaning frame, why do fridge doors suddenly require too much force to open, and make loud pops when opening?
- Why does the attribute of ' both Jews and proselytes ' appear only against Romans in Acts 2:10?
- Did Bertrand Russell say that he would never die for his beliefs because they might be wrong?
- Feasibility of a thermite-based “Lava bomb”
- When do airliner windows fog up on the inside?
- Trip Hazard: Tête-à-Tête
- PSE Advent Calendar 2025 (Day 13): Snow Day!
- Vintage sci-fi book with human descendants
- Active and passive transformations of scalar field
- PSE Advent Calendar 2025 (Day 12): Hiding in plain sight
- a^b=b^a (very nice symmetric equation)
- Theoretically, can black holes enable time travel?
- Is it feasible at all to build data centers in space?
- Incorrect Markings
- Translation in FormBuilder on a multilingual CiviCRM site?
- Equation of circle after affine transformation
- Is there any way to tell what the difference is in these file size updates for iOS 26.2 by looking at this screenshot?