![]() But yes you can identify which content probably has errors and flag it as such. That's why lawyers get paid the big bucks (for now.). The change here is one of scale rather than quality.īy the time you're prosecuting someone in court, yes of course you double, triple, quadruple check everything. Bear in mind that a lot of these errors have little to no semantic impact, being on the same level as typos or misspellings in a written communication.īear in mind too that if law enforcement (honest or not) is so interested in you that they're willing to record your conversations, your day is already ruined, you just don't know it yet. Can you rely on it when you want to quickly read through hours of conversation and make decisions about whether to invest further resources (which might just mean another hour of listening back to the original audio)? Also absolutely. Would you want to review this fully before going into court, absolutely - because you'd want to play the recording to a jury for emotional impact. ![]() That just means 2-3% of your content needs to be double-checked by a person at the audio level, saving huge amounts of time - equally true of human transcription, in which individual words are often. Given the parallel advances in resynthesis and understanding idiomatic speech, in a year or two I probably won't need to cut out all those uuh like um y'know by hand ever again, and every recording can be given an noise reduction bath and come out sounding like it was recorded in a room full of soft furniture. Having this fully open is a big deal though - now that level of transcription ability can be wrapped as an audio plugin and just used wherever. I'm sure it's been an absolute godsend for law enforcement other people who need to gather poor-quality audio at scale, though much less great for the targets of repressive authority. I edit a podcast and have > a decade of pro audio editing experience in the film industry, and I was already using a commercial AI transcription service to render the content to text and sometimes edit it as such (outputting edited audio).Įxisting (and affordable) offerings are so good that they can cope with shitty recordings off a phone speaker and maintain ~97% accuracy over hour-long conversations.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |