Viterbi algorithm does not apply to activation probabilities

I would like to use the output of Crepe to determine whether singer is active versus silent at the perceptual level. That should change at the level of seconds, not milliseconds. Setting a hard threshold based on confidence, though, results in a quick alternation between the two states. The alternation shows in the thick vertical lines in the plots below. 

Viterbi would be a straightforward approach to smoothing this out. The current version, though, only applies smoothing to the pitch. I wrote an extension and added it to a pull request in case it would be useful for others: https://github.com/marl/crepe/pull/26. 

<img width="644" alt="Screen Shot 2020-06-02 at 4 59 50 PM" src="https://user-images.githubusercontent.com/12517650/83581473-bf32b100-a4f3-11ea-8ecb-6bd97b0a9d68.png">

<img width="643" alt="Screen Shot 2020-06-02 at 5 15 31 PM" src="https://user-images.githubusercontent.com/12517650/83581849-c5755d00-a4f4-11ea-9855-b33d7f4f4806.png">


Code for this plot:
```
import csv
import matplotlib.pyplot as plt
import numpy as np

f0 = []
conf = []
thresh = 0.5

with open('MUSDB18HQ/train/Music Delta - Hendrix/vocals.f0.csv') as csv_file:
    csv_reader = csv.reader(csv_file, delimiter=',')
    line_count = 0
    for row in csv_reader:
        if line_count == 0:
            print(f'Column names are {", ".join(row)}')
            line_count += 1
        else:
            f0.append(float(row[1]))
            conf.append(float(row[2]))
            line_count += 1
    print(f'Processed {line_count} lines.')

voiced = [1 if c > thresh else 0 for c in conf]
# plt.plot(np.array(f0) * np.array(voiced))
plt.plot(np.array(voiced))
plt.show()
```

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Viterbi algorithm does not apply to activation probabilities #59

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Viterbi algorithm does not apply to activation probabilities #59

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions