The bottleneck to this is that paralellization is hard-coded in the `cnmf` module. To implement it, we will need to copy and modify that code.