Voice Transcription Feature
Status: Requirements In Progress ready for Work In Progress Done
Technical Complexity: Easy Medium Hard / Complex
Summary:
Key point 1
Key point 2
Picture showing what this project is about (e.g. a key mockup, drawing; anything visual)
1. Problem
Research consistently shows that EHR use accounts for a disproportionate share of physician time — time that could otherwise be spent with patients. A study by the American Medical Association and Dartmouth-Hitchcock found that physicians spent 49% of their working day on EHRs and desk work, compared to just 27% in direct face time with patients, with an additional 1–2 hours of data entry after hours (Source). Supporting findings from separate studies estimate that physicians spend nearly 45% of their time on the EHR overall (Source), and approximately 4.5 hours per day on documentation alone (Source).
This administrative burden has a direct impact on the quality of care. Every patient visit generates significant documentation overhead, forcing clinicians to act as both caregivers and data entry operators simultaneously — splitting their attention between the patient in front of them and the screen and keyboard. The result is reduced face time, divided focus, and a clinical environment where the demands of the system compete directly with the needs of the patient.
Desired State:
In the desired state, clinicians are freed from the burden of real-time documentation, allowing them to focus entirely on the patient in front of them. Rather than dividing their attention between the conversation and the keyboard, doctors can maintain eye contact, build rapport, and engage fully in the clinical encounter.
In the background, an AI system listens to the consultation, transcribes the conversation, and automatically structures the relevant information into a clinical note. Once the visit is complete, the clinician simply reviews the generated note and approves it — reducing documentation time to a fraction of what it is today, without compromising accuracy or completeness.
2. User Stories
As a clinician, I want to maintain full eye contact and engagement with my patient during a consultation so that I can build trust and deliver better care without being distracted by documentation.
I want the consultation to be automatically transcribed so that I no longer need to type during patient visits.
I want the AI to structure the transcription into a clinical note so that I do not have to manually organise or format documentation after the visit.
I want to review and approve an AI-generated note so that I can confirm accuracy and make any corrections before it is saved to the patient record.
As a patient, I want my clinician to be fully present and attentive during my visit so that I feel heard and receive a higher quality of care.
3. Market Analysis
Solutions | Visuals | Key Features |
|---|---|---|
|
| |
Nabla Ambient Clinical Voice: |
|
|
Oracle Clinical AI Assistant: |
|
|
Oracle Health Clinical AI Assistant:
|
|
|
Generating SOAP Notes with AI: |
| How medical Large Language Models (LLMs) can automate the creation of SOAP notes—a standardized clinical documentation format (Subjective, Objective, Assessment, and Plan)
|
Athena Voice to Text: |
|
|
Market Opportunity for OpenMRS
OpenMRS serves the healthcare system primarily in low-to middle-income countries (LMICS) where:
Clinician-to-patient ratios are significantly high
Documentation burden is proportionately higher with fewer support staff
4. Technical Considerations & Dependencies
Choice of transcription engine (Speech-to-Text)
Selection of the AI/LLM model responsible for structuring the transcription into a clinical note
Ability to map transcribed content to standard clinical note formats (e.g. SOAP notes)
Mapping AI-generated content to the correct fields, concepts, and data structures within the EMR
Handling of structured data (e.g. diagnoses, medications, orders) vs. free text
Patient consent mechanisms before recording begins
Where data is processed and stored — on-device, or in the cloud etc
User interface for reviewing, editing, and approving generated notes
Ability to make corrections and have the AI learn from feedback over time
Reliable internet connectivity requirements, particularly in low-resource settings
Offline or low-bandwidth fallback options
Support for multiple languages and local dialects, particularly relevant for diverse or low-resource settings
5. Sketches