Project Goal
- Use the latest AI/ML tools and technologies available to automate Audiobook Quality Control (QC) tasks to save time and money.
- Reduce time spent on QC tasks from 2-3 days to a few minutes.
- A system that can be used by PRH divisions in all territories, not just the US.
Project Description
Orion QC (Quality Control) is a system to automate the testing and validation process of Audio Books produced by the PRH Audio Production Group. This system will supplement the existing manual Proofreading and QC process.
It is difficult for a human to pay attention to every detail, so letting a computer take over the tasks is beneficial.
Manual QC |
Orion QC |
|
General |
||
Unique, custom-built system. (There is nothing like this out there!) |
❌ |
✅ |
Manually listen to the audiobook and compares it to the manuscript |
✅ |
❌ |
Does not lead to fatigue, no human errors |
❌ |
✅ |
Proofread / QC in a few minutes |
❌ |
✅ |
Inputs |
||
Manuscript: PDF or eBook |
✅ |
(partial support) |
Manuscript: Text format |
✅ |
✅ |
Audio Files: WAV / MP3 / FLAC |
✅ |
✅ |
Proofreading / QC tasks |
||
Detect missing content |
✅ |
✅ |
Identify additional or repeated content |
✅ |
✅ |
Detect misread audio content |
✅ |
✅ |
Detect long silence (over 3 sec) |
✅ (difficult) |
✅ |
Detect Incidental noise and digital glitches |
✅ |
❌ (roadmap) |
Output |
||
Structured Excel document with discrepancies |
✅ |
✅ |
Problem location as a timestamp in the audio file |
✅ |
✅ |
Surrounding text at the problem location (before and after) |
✅ |
✅ |
Color-coded HTML file to clearly display discrepancies |
❌ |
✅ |
Play a short clip of the discrepancy |
❌ |
✅ |
Project Dates
Research and Proof of concept 2021
Alpha release Q3 2021
Beta release Q1 2022
Team
PRH US IT Content Applications Group:
* Srinivasan Muruganandam
* Justin Branco
* Dzmitry Kasinets
PRH US Audio Production Group:
* Ok Hee Kolwitz
Top 3 Benefits
- Takes a few minutes to QC hours of Audio Books, compared to 2-3 days for manual QC for a 5-hour audiobook
- Cost savings for the company
- Can proofread in any of the 37 languages supported by Amazon Transcribe
Top 3 Challenges
- Identifying true differences (that the audio team really cares about) between the manuscript and the audio file. Minimize false negatives/differences.
- Reference document / Manuscript text does not "exactly" match the output of Amazon Transcribe service. For example, the manuscript may have a date of 2/24/2022 and the audio transcript may have February 24, 2020, and the application should not show them as a difference
- Text matches should be exact, not fuzzy
Further Resources
More information about this project (Please contact me if you are unable to access the file)