Text this: Note-based alignment using score-driven non-negative matrix factorisation for audio recordings.