In a speech recognition system, a method and system for updating a language model during a correction session can include automatically comparing dictated text to replacement text, determining if the replacement text is on an alternative word list if the comparison is close enough to indicate that the replacement text represents correction of a mis-recognition error rather than an edit, and updating the language model without user interaction if the replacement text is on the alternative word list. If the replacement text is not on the alternative word list, a comparison is made between dictated word digital information and replacement word digital information, and the language model is updated if the digital comparison is close enough to indicate that the replacement text represents correction of a mis-recognition error rather than an edit.
|
8. A system for updating a language model during a correction session, comprising:
a means for automatically comparing a dictated word to a replacement word; if said comparison is close enough, within a predetermined statistical quantity, to indicate that said replacement word represents correction of a misrecognition error rather than an edit, a means for determining if said replacement word is on an alternative word list; and if said replacement word is on said alternative word list, a means for updating said language model without user interaction.
1. In a speech recognition system, a method of updating a language model during a correction session, comprising the steps of:
automatically comparing a dictated word to a replacement word; if said comparison is close enough, within a predetermined statistical quantity, to indicate that said replacement word represents correction of a misrecognition error rather than an edit, determining if said replacement word is on an alternative word list; and if said replacement word is on said alternative word list, updating said language model without user interaction.
15. A machine readable storage, having stored thereon a computer program having a plurality of code sections executable by a machine for causing the machine to perform the steps of:
automatically comparing a dictated word to a replacement word; if said comparison is close enough, within a predetermined statistical quantity, to indicate that said replacement word represents correction of a misrecognition error rather than an edit, determining if said replacement word is on an alternative word list; and if said replacement word is on said alternative word list, updating said language model without user interaction.
4. In a speech recognition system, a method of updating a language model during a correction session, comprising the steps of:
automatically comparing a dictated word to a replacement word; if said comparison is close enough, within a predetermined statistical quantity, to indicate that said replacement word represents correction of a misrecognition error rather than an edit, determining if said replacement word is on an alternative word list; and if said replacement word is not on said alternative word list, comparing dictated word digital information to replacement word digital information, and if said digital comparison is close enough, within a predetermined statistical quantity, to indicate that said replacement word represents correction of a misrecognition error rather than an edit, updating said language model.
11. A system for updating a language model during a correction session, comprising:
a means for automatically comparing a dictated word to a replacement word; if said comparison is close enough, within a predetermined statistical quantity, to indicate that said replacement word represents correction of a misrecognition error rather than an edit, a means for determining if said replacement word is on an alternative word list; and if said replacement word is not on said alternative word list, a means for comparing dictated word digital information to replacement word digital information, and if said digital comparison is close enough, within a predetermined statistical quantity, to indicate that said replacement word represents correction of a misrecognition error rather than an edit, a means for updating said language model.
18. A machine readable storage, having stored thereon a computer program having a plurality of code sections executable by a machine for causing the machine to perform the steps of:
automatically comparing a dictated word to a replacement word; if said comparison is close enough, within a predetermined statistical quantity, to indicate that said replacement word represents correction of a misrecognition error rather than an edit, determining if said replacement word is on an alternative word list; and if said replacement word is not on said alternative word list, comparing dictated word digital information to replacement word digital information, and if said digital comparison is close enough, within a predetermined statistical quantity, to indicate that said replacement word represents correction of a misrecognition error rather than an edit, updating said language model.
2. The method of
3. The method of
5. The method of
converting audio of said dictated word into dictated word digital information; converting said replacement word into replacement word digital information; and using said dictated word digital information and said replacement word digital information in said digital comparison step.
6. The method of
7. The method of
9. The system of
10. The system of
12. The system of
a means for converting audio of said dictated word into dictated word digital information; a means for converting said replacement word into replacement word digital information; and a means for using said dictated word digital information and said replacement word digital information in said digital comparing means.
13. The system of
14. The system of
16. The machine readable storage of
17. The machine readable storage of
19. The machine readable storage of
converting audio of said dictated word into dictated word digital information; converting said replacement word into replacement word digital information; and using said dictated word digital information and said replacement word digital information in said digital comparison step.
20. The machine readable storage of
21. The machine readable storage of
|
(Not Applicable)
(Not Applicable)
1. Field of the Invention
This invention relates generally to speech dictation systems, and more particularly to a method of updating language models in speech recognition engines of speech applications during sessions in which speech misrecognitions are corrected.
2. Description of Related Art
Speech recognition is the process by which an acoustic signal received by a transducive element, such as a microphone, is converted to a set of text words by a computer. These recognized words may then be used in a variety of computer software applications for purposes such as document preparation, data entry, and command and control. Improvements to speech dictation systems provide an important way to enhance user productivity. One style of improvement is to offer users the ability to make changes directly to dictated text, bypassing interaction with correction dialogs. Unless the system monitors changes and decides which are corrections to be sent to the speech engine for processing as corrections, and which are edits to be ignored by the system, the user will not receive the benefit of continual improvement in recognition accuracy that occurs when the engine receives correction information.
In a speech recognition system, a method of updating a language model for use when correcting dictated text comprises the steps of dictating a dictated word, providing a replacement word, and automatically comparing the dictated word to the replacement word using any suitable comparison means, such as using an algorithm to compare phonetics, grammar, spelling, or the context of surrounding words. If the comparison is close enough, within a predetermined statistical quantity, to indicate that the replacement word is a correction of a mis-recognition error rather than an edit, the method further comprises the step of determining if the replacement word is on an alternative word list. The alternative word can be preexisting or can be generated by any suitable method, including by the use of an algorithm which identifies words which have similar phonetics, grammar, and/or spelling. The method further comprises updating the language model without user interaction if the replacement word is on the alternative word list. If the replacement word is not on the alternative word list, dictated word digital information is compared to replacement word digital information, and the language model is updated if the digital comparison is close enough, within a predetermined statistical quantity, to indicate that the replacement word represents correction of a mis-recognition error rather than an edit.
The method can further comprise the steps of, prior to the digital comparison step, converting the audio of the dictated word into dictated word digital information and the text of the replacement word into replacement word digital information, and using the dictated word digital information and the replacement word digital information in the digital comparison step.
In the method, the replacement word can be generated by any suitable method, such as typing over the dictated word, pasting over the dictated word, or deleting the dictated word and replacing it with the replacement word. The dictated word can consist of a single word or a plurality words, but is generally a single word. Similarly, the replacement word can consist of a single word or a plurality of words, but is generally a single word.
According to a second aspect of the invention, the invention comprises a system for updating a language model during a correction session, which comprises a means for automatically comparing a dictated word to a replacement word using an suitable comparison means, such as using an algorithm to compare phonetics, grammar, spelling, and/or the context of surrounding words. If the comparison is close enough, within a predetermined statistical quantity, to indicate that the replacement word represents correction of a misrecognition error rather than an edit, the system further comprises a means for updating the language model without user interaction if the replacement word is on the alternative word list. The alternative word can be preexisting or can be generated by any suitable means, including by the use of an algorithm which identifies words which have similar phonetics, grammar, and/or spelling. If the replacement word is not on the alternative word list, the system further comprises a means for comparing dictated word digital information to replacement word digital information, and if the digital comparison is close enough, within a predetermined statistical quantity, to indicate that the replacement word represents correction of a misrecognition error rather than an edit, a means for updating the language model.
According to a third aspect, the invention comprises a machine readable storage, having stored thereon a computer program having a plurality of code sections executable by a machine for causing the machine to perform a series of steps. The machine readable storage causes the machine to perform the step of automatically comparing a dictated word to a replacement word using any suitable comparison means, including using an algorithm to compare phonetics, grammar, spelling, and/or the context of surrounding words. Further, the machine readable storage causes the machine to perform the steps of determining if the replacement word is on an alternative word list if the comparison is close enough, within a predetermined statistical quantity, to indicate that the replacement word represents correction of a misrecognition error rather than an edit, and updating the language model without user interaction if the replacement word is on the alternative word list. If the replacement word is not on the alternative word list, the machine readable storage causes the machine to perform the step of comparing dictated word digital information to replacement word digital information, and if the digital comparison is close enough, within a predetermined statistical quantity, to indicate that the replacement word represents correction of a misrecognition error rather than an edit, updating the language model.
There are presently shown in the drawings embodiments which are presently preferred, it being understood, however, that the invention is not limited to the precise arrangements and instrumentalities shown, wherein:
The various hardware requirements for the computer system as described herein can generally be satisfied by any one of many commercially available high speed multimedia personal computers offered by manufacturers such as International Business Machines Corporation (IBM).
In a preferred embodiment which shall be discussed herein, operating system 24 is one of the Windows family of operating systems. The system is not limited in this regard, however, and the invention can also be used with any other type of computer operating system, such as Windows NT, Windows 95, or Windows 98, all of which are available from Microsoft Corporation of Redmond, Washington. The system as disclosed herein can be implemented by a programmer, using commercially available development tools for the operating system described above. As shown in
Audio signals representative of sound received in microphone 30 or contained in a recording on a transcription device are processed within computer 20 using conventional computer audio circuitry so as to be made available to the operating system 24 in digitized form. The audio signals received by the computer are conventionally provided to the speech recognition engine application 26 via the computer operating system 24 in order to perform speech recognition functions. As in conventional speech recognition systems, the audio signals are processed by the speech recognition engine 26 to identify words spoken by a user into microphone 30 or words spoken by a user and recorded on a transcription device.
Audio that is recorded on a transcription device can be transferred to the speech recognition system in a number of ways. The transcription device is connected to the computer system using a suitable cable. In the case of digital transcription devices, a digital output on the transcription device can be connected to a digital input on the computer system. Alternatively, in the case of analog transcription devices, a cable can be connected from the transcription device's analog output to the analog input of the computer system's sound board. One variety of transcription device contains software which cooperates with the speech recognition system. Such software enables the speech recognition system to view dictation recordings located on transcription devices as computer files similar to the manner in which files can be viewed on a magnetic disk drive. For example, when properly connected to the computer system, the transcription device can appear to the speech recognition application as a bulk data storage medium, such as a magnetic disk drive. In this case, the user may open a dialog box while in the speech recognition application and select the dictation recording to transfer to the speech recognition system. The dictation recording is then transferred from the dictation device, to the computer system, and to the speech recognition system as a computer file.
Another variety of transcription device comes with software tools that copy the dictation recording to the computer system. In this case, the transcription device is connected to the computer system in the manner described above. The software tools provided with the transcription device can be used to transfer the dictation recording from the transcription device to the computer system, storing the dictation recording as a computer file. Then, through the use of a dialog box in the speech recognition application, the user can select the desired dictation recording which appears as a computer file from the computer system's hard drive.
Regardless of how the dictation recording is transferred, it should be appreciated that either a digital recording or an analog recording can be transferred. In the case of an analog recording, as the transcription device plays the dictation recording, the computer system can digitally record the dictation recording. The resulting computer file containing the dictation recording can then be made available to the speech recognition system.
A method for automatically updating language models in a speech recognition application, in accordance with an inventive arrangement, is illustrated by flow chart 50 in FIG. 4. From start block 52, a speaker undertakes a speech recognition session with a speech application in accordance with the step of block 54.
According to a preferred embodiment of the invention, the system monitors whether a dictated word is replaced by a replacement word. It should be understood that the dictated word can be a plurality of dictated words and the replacement word can be a plurality of replacement words. In most cases, however, the dictated word and the replacement word will each consist of a single word.
There are numerous situations in which the system will determine that a dictated word has been replaced by a replacement word. For example, if a new word is typed or otherwise inserted into a document, a determination is made as to whether the user has removed text immediately contiguous to the new word which has been inserted. If such removal has occurred, the system presumes that a misrecognition error has occurred and that the new word is a replacement word. Similarly, if the backspace key or the delete key has been used to remove characters immediately contiguous to new text, the system again concludes that a misrecognition error has occurred and that the new text is considered a replacement word. In contrast, if new text is inserted without overwriting dictated text, the system can conclude that the new text is simply being added and that no speech misrecognition error has occurred. In such a case, the new text is not characterized as a replacement word.
In the step of block 56, the system initially detects whether a dictated word has been replaced by a replacement word. Such replacement may occur by typing over all or a portion of a dictated word, pasting over all or a portion of a dictated word, or deleting all or a portion of a dictated word and replacing it with a replacement word. It should be understood, however, that the invention is not limited to these specific replacement methods, and that replacement may occur by any suitable replacement method known in the art. The dictated word may consist of a single word or a plurality of words. Similarly, the replacement word may consist of a single word or a plurality of words.
If a replacement is not made in accordance with block 56, the system branches to step 74, which detects whether additional input is available for evaluation. If more input is available for evaluation, the system branches back to the step of block 54. Otherwise, the method branches on to the step of block 76, in accordance with which the algorithm of the present invention stops and awaits a signal to return to the starting step of block 52.
If a determination is made that a dictated word has been replaced with a replacement word, in accordance with the step of block 56, the method branches to the step of block 58, which compares the dictated word to the replacement word. Afterwards, according to block 60, a determination is made as to whether the replacement word is on an alternative word list.
The alternative word list can be preexisting or can be generated by any suitable method, including by the use of an algorithm which identifies words which have similar phonetics, grammar, and/or spelling to the dictated word. The alternative word list is typically comprised of words which may sound similar to the words the speech recognition engine has identified. Essentially, the words contained in the alternative word list are the less preferred word identification choices which were also considered by the speech recognition engine when it attempted to identify a particular word or words spoken by the user. In some cases, an identified word selected by the speech recognition engine is an error, and one of the words contained on the alternative word list is actually the word spoken by the user.
If the replacement word is on the alternative word list, the system concludes that a speech misrecognition error occurred and continues on to the step of block 72, in accordance with which a language model is updated with a correction. As is known by those skilled in the art, it should be understood that the language model consists of statistical information about word patterns. Accordingly, correcting the language model is not an acoustic correction, but a statistical correction. After the language model is updated, the system proceeds to the step of block 74, described above.
By way of example, if a user of a speech recognition system dictates the word "step" but the system interprets it to be the word "steep," a speech misrecognition error has occurred. The user may choose to correct the error by simply using a backspace or delete key to remove an "e" from the word "steep." The system recognizes this change, classifies the word "steep" as the dictated word and the word "step" as a replacement word, and compares the dictated word to the replacement word, in accordance with the step of block 58.
The system then determines whether the replacement word is on an alternative word list, according to block 60. If the replacement word is on an alternative word list, the language model is updated with the correction, according to block 72, so that the system learns how to properly recognize the user's dictation of the word "step."
In some cases, the replacement word is not found on an alternative word list. In those situations, according to the step of block 62, the method determines whether a close match, within a predetermined statistical quantity, exists between the dictated word and the replacement word. This determination can be made through the use of any suitable comparison process, such as using an algorithm to compare the phonetics, grammar, spelling, and/or context of surrounding words of the dictated word and the replacement word. For certain words, such as the word "two," the context of surrounding words can be particularly useful in the comparison step. For example, if a user dictates "divide two by three," the surrounding words "divide" and "three" dramatically increase the statistical probability that the user dictated the word "two," as opposed to "to" or "too."
If a close match does not exist between the dictated word and the replacement word, as determined by a predetermined statistical quantity, the method branches along to the step of block 74, described above. If a close match does exist, the system needs to compare the audio of the dictated speech to the replacement word in order to determine whether the correction is an edit or a speech misrecognition error. A direct comparison cannot be made because the audio of the dictated word is a waveform, whereas the replacement word is a series of characters. Therefore, the audio of the dictated word and the characters of the replacement word must both be converted into information that can be directly compared.
Accordingly, when a close match exists, in accordance with the step of block 62, the method proceeds to the step of block 64. In this step, audio of the dictated word is converted into dictated word digital information. The invention then branches on to the step of block 66, in which the characters of the replacement word are converted into replacement word digital information. Methods of converting speech to text and/or speech to text are well known in the art. Speech to text methods typically comprise a two step process, in which the speech is first converted into a form of computer generated digital information, and the computer generated digital information is then converted into text. Similarly, in text to speech methods, text is typically first converted into a form of computer generated digital information, after which the system provides audio that is consistent with the computer generated digital information. In the invention, any text to speech conversion method suitable for converting a replacement word into replacement word digital information may be employed. Additionally, any speech to text conversion method suitable for converting a dictated word into dictated word digital information may be used.
Subsequently, during the step of block 68, the dictated word digital information is compared to the replacement word digital information. According to the step of block 70, if a close match exists within a predetermined statistical quantity, the method proceeds to block 72, described above, in accordance with which the language model is updated with the correction. The method then continues on to block 74, where the system determines whether additional information is available for evaluation. If a close match does not exist, within a predetermined statistical quantity, the method proceeds to block 74, described above.
For example, if the user dictates the word "step," the system erroneously identifies it as the word "steep," and the user corrects the error by removing an "e" from the word "steep," the system uses a comparison method, as described above, to compare the dictated word "steep" to the replacement word "step." The system can then determine whether the replacement text "step" is on an alternative word list. If "step" is not on the alternative word list, a determination is made as to whether a close match exists between "step" and "steep" within a predetermined statistical quantity, according to step 62. If a close match exists, the system converts audio of the user's dictated word into dictated word digital information according to step 64, and converts the word "step" into replacement word digital information according to step 66. Afterwards, a digital comparison occurs, according to the step of block 68. If the comparison reveals that there is a close match within a predetermined statistical quantity, the language model is updated so that the system will learn to properly recognize the user's dictation of the word "step," according to the block of step 72.
After a user dictates a single word or a plurality of words, audio of that dictation is automatically saved by the system. The audio can remain stored until the user requests deletion of the saved audio. The system can be configured to automatically ask a user whether stored audio should be removed. Saving the audio until the user requests its deletion permits the user to edit dictation at a future point in time because audio of the user's dictated speech is available for conversion into dictated word digital information, which can then be compared to replacement word digital information.
It should be understood that the examples and embodiments described herein are for illustrative purposes only and that various modifications or changes in light thereof will be suggested to persons skilled in the art and are to be included within the spirit and purview of this application. The invention can take other specific forms without departing from the spirit or essential attributes thereof.
Nassiff, Amado, Ortega, Kerry A.
Patent | Priority | Assignee | Title |
10056077, | Mar 07 2007 | Nuance Communications, Inc | Using speech recognition results based on an unstructured language model with a music system |
10157609, | Jun 09 2009 | Microsoft Technology Licensing, LLC | Local and remote aggregation of feedback data for speech recognition |
10229682, | Feb 01 2017 | International Business Machines Corporation | Cognitive intervention for voice recognition failure |
10319004, | Jun 04 2014 | Microsoft Technology Licensing, LLC | User and engine code handling in medical coding system |
10331763, | Jun 04 2014 | Microsoft Technology Licensing, LLC | NLU training with merged engine and user annotations |
10366424, | Jun 04 2014 | Microsoft Technology Licensing, LLC | Medical coding system with integrated codebook interface |
10373711, | Jun 04 2014 | Microsoft Technology Licensing, LLC | Medical coding system with CDI clarification request notification |
10460288, | Feb 18 2011 | Microsoft Technology Licensing, LLC | Methods and apparatus for identifying unspecified diagnoses in clinical documentation |
10496743, | Jun 26 2013 | Microsoft Technology Licensing, LLC | Methods and apparatus for extracting facts from a medical text |
10504622, | Mar 01 2013 | Microsoft Technology Licensing, LLC | Virtual medical assistant methods and apparatus |
10754925, | Jun 04 2014 | Microsoft Technology Licensing, LLC | NLU training with user corrections to engine annotations |
10861438, | Apr 17 2006 | III Holdings 1, LLC | Methods and systems for correcting transcribed audio files |
10886028, | Feb 18 2011 | Microsoft Technology Licensing, LLC | Methods and apparatus for presenting alternative hypotheses for medical facts |
10902845, | Dec 10 2015 | Microsoft Technology Licensing, LLC | System and methods for adapting neural network acoustic models |
10949602, | Sep 20 2016 | Microsoft Technology Licensing, LLC | Sequencing medical codes methods and apparatus |
10956860, | Feb 18 2011 | Microsoft Technology Licensing, LLC | Methods and apparatus for determining a clinician's intent to order an item |
10971147, | Feb 01 2017 | International Business Machines Corporation | Cognitive intervention for voice recognition failure |
10978192, | Mar 08 2012 | Microsoft Technology Licensing, LLC | Methods and apparatus for generating clinical reports |
11024406, | Mar 12 2013 | Microsoft Technology Licensing, LLC | Systems and methods for identifying errors and/or critical results in medical reports |
11024424, | Oct 27 2017 | Microsoft Technology Licensing, LLC | Computer assisted coding systems and methods |
11101024, | Jun 04 2014 | Microsoft Technology Licensing, LLC | Medical coding system with CDI clarification request notification |
11133091, | Jul 21 2017 | Microsoft Technology Licensing, LLC | Automated analysis system and method |
11152084, | Jan 13 2016 | Microsoft Technology Licensing, LLC | Medical report coding with acronym/abbreviation disambiguation |
11183300, | Jun 05 2013 | Microsoft Technology Licensing, LLC | Methods and apparatus for providing guidance to medical professionals |
11250856, | Feb 18 2011 | Microsoft Technology Licensing, LLC | Methods and apparatus for formatting text for clinical fact extraction |
11495208, | Jul 09 2012 | Microsoft Technology Licensing, LLC | Detecting potential significant errors in speech recognition results |
11520610, | May 18 2017 | PELOTON INTERACTIVE INC | Crowdsourced on-boarding of digital assistant operations |
11551006, | Sep 09 2019 | International Business Machines Corporation | Removal of personality signatures |
11594211, | Apr 17 2006 | III Holdings 1, LLC | Methods and systems for correcting transcribed audio files |
11742088, | Feb 18 2011 | Microsoft Technology Licensing, LLC | Methods and apparatus for presenting alternative hypotheses for medical facts |
11881302, | Mar 01 2013 | Microsoft Technology Licensing, LLC | Virtual medical assistant methods and apparatus |
6735565, | Sep 17 2001 | Nuance Communications Austria GmbH | Select a recognition error by comparing the phonetic |
6934682, | Mar 01 2001 | Nuance Communications, Inc | Processing speech recognition errors in an embedded speech recognition system |
7027985, | Sep 08 2000 | Microsoft Technology Licensing, LLC | Speech recognition method with a replace command |
7260534, | Jul 16 2002 | Nuance Communications, Inc | Graphical user interface for determining speech recognition accuracy |
7310602, | Sep 27 2004 | Kabushiki Kaisha Equos Research | Navigation apparatus |
7386454, | Jul 31 2002 | Microsoft Technology Licensing, LLC | Natural error handling in speech recognition |
7565282, | Apr 14 2005 | Nuance Communications, Inc | System and method for adaptive automatic error correction |
7627562, | Jun 13 2006 | Microsoft Technology Licensing, LLC | Obfuscating document stylometry |
7640158, | Nov 08 2005 | SOLVENTUM INTELLECTUAL PROPERTIES COMPANY | Automatic detection and application of editing patterns in draft documents |
7831423, | May 25 2006 | 3M HEALTH INFORMATION SYSTEMS, INC | Replacing text representing a concept with an alternate written form of the concept |
7844464, | Jul 22 2005 | 3M Innovative Properties Company | Content-based audio playback emphasis |
7974844, | Mar 24 2006 | Kabushiki Kaisha Toshiba; Toshiba Digital Solutions Corporation | Apparatus, method and computer program product for recognizing speech |
7983914, | Aug 10 2005 | Nuance Communications, Inc | Method and system for improved speech recognition by degrading utterance pronunciations |
8019602, | Jan 20 2004 | Microsoft Technology Licensing, LLC | Automatic speech recognition learning using user corrections |
8280733, | Jan 20 2004 | Microsoft Technology Licensing, LLC | Automatic speech recognition learning using categorization and selective incorporation of user-initiated corrections |
8355920, | Jul 31 2002 | Nuance Communications, Inc | Natural error handling in speech recognition |
8447602, | Mar 26 2003 | Microsoft Technology Licensing, LLC | System for speech recognition and correction, correction device and method for creating a lexicon of alternatives |
8473295, | Aug 05 2005 | Microsoft Technology Licensing, LLC | Redictation of misrecognized words using a list of alternatives |
8635243, | Mar 07 2007 | Microsoft Technology Licensing, LLC | Sending a communications header with voice recording to send metadata for use in speech recognition, formatting, and search mobile search application |
8768706, | Jul 22 2005 | 3M Innovative Properties Company | Content-based audio playback emphasis |
8775176, | Aug 31 2006 | Microsoft Technology Licensing, LLC | Method and system for providing an automated web transcription service |
8838457, | Mar 07 2007 | Nuance Communications, Inc | Using results of unstructured language model based speech recognition to control a system-level function of a mobile communications facility |
8880405, | Mar 07 2007 | Nuance Communications, Inc | Application text entry in a mobile environment using a speech processing facility |
8886540, | Mar 07 2007 | Nuance Communications, Inc | Using speech recognition results based on an unstructured language model in a mobile communication facility application |
8886545, | Mar 07 2007 | Microsoft Technology Licensing, LLC | Dealing with switch latency in speech recognition |
8914292, | Mar 07 2007 | Vlingo Corporation | Internal and external speech recognition use with a mobile communication facility |
8914402, | Mar 07 2007 | Vlingo Corporation | Multiple web-based content category searching in mobile search application |
8949130, | Mar 07 2007 | Nuance Communications, Inc | Internal and external speech recognition use with a mobile communication facility |
8949266, | Mar 07 2007 | Microsoft Technology Licensing, LLC | Multiple web-based content category searching in mobile search application |
8996379, | Mar 07 2007 | Nuance Communications, Inc | Speech recognition text entry for software applications |
9070368, | Aug 31 2006 | Microsoft Technology Licensing, LLC | Method and system for providing an automated web transcription service |
9111540, | Jun 09 2009 | Microsoft Technology Licensing, LLC | Local and remote aggregation of feedback data for speech recognition |
9218811, | Jun 28 2013 | Google Technology Holdings LLC | Electronic device and method for managing voice entered text using gesturing |
9245522, | Apr 17 2006 | III Holdings 1, LLC | Methods and systems for correcting transcribed audio files |
9431015, | Jun 28 2013 | Google Technology Holdings LLC | Electronic device and method for managing voice entered text using gesturing |
9460710, | Mar 07 2007 | Nuance Communications, Inc. | Dealing with switch latency in speech recognition |
9495956, | Mar 07 2007 | Nuance Communications, Inc | Dealing with switch latency in speech recognition |
9619572, | Mar 07 2007 | Nuance Communications, Inc | Multiple web-based content category searching in mobile search application |
9715876, | Dec 04 2007 | III Holdings 1, LLC | Correcting transcribed audio files with an email-client interface |
9858256, | Apr 17 2006 | III Holdings 1, LLC | Methods and systems for correcting transcribed audio files |
Patent | Priority | Assignee | Title |
5027406, | Dec 06 1988 | Nuance Communications, Inc | Method for interactive speech recognition and training |
5884258, | Oct 31 1996 | Microsoft Technology Licensing, LLC | Method and system for editing phrases during continuous speech recognition |
5909667, | Mar 05 1997 | Nuance Communications, Inc | Method and apparatus for fast voice selection of error words in dictated text |
Executed on | Assignor | Assignee | Conveyance | Frame | Reel | Doc |
Sep 14 1999 | ORTEGA, KERRY A | International Business Machines Corporation | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 010306 | /0796 | |
Sep 27 1999 | International Business Machines Corporation | (assignment on the face of the patent) | / | |||
Sep 27 1999 | NASSIFF, AMADO | International Business Machines Corporation | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 010306 | /0796 | |
Sep 30 2008 | International Business Machines Corporation | Nuance Communications, Inc | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 021924 | /0158 |
Date | Maintenance Fee Events |
Nov 18 2005 | M1551: Payment of Maintenance Fee, 4th Year, Large Entity. |
Jan 11 2010 | M1552: Payment of Maintenance Fee, 8th Year, Large Entity. |
Dec 11 2013 | M1553: Payment of Maintenance Fee, 12th Year, Large Entity. |
Date | Maintenance Schedule |
Jul 09 2005 | 4 years fee payment window open |
Jan 09 2006 | 6 months grace period start (w surcharge) |
Jul 09 2006 | patent expiry (for year 4) |
Jul 09 2008 | 2 years to revive unintentionally abandoned end. (for year 4) |
Jul 09 2009 | 8 years fee payment window open |
Jan 09 2010 | 6 months grace period start (w surcharge) |
Jul 09 2010 | patent expiry (for year 8) |
Jul 09 2012 | 2 years to revive unintentionally abandoned end. (for year 8) |
Jul 09 2013 | 12 years fee payment window open |
Jan 09 2014 | 6 months grace period start (w surcharge) |
Jul 09 2014 | patent expiry (for year 12) |
Jul 09 2016 | 2 years to revive unintentionally abandoned end. (for year 12) |