A voicemail platform which provides a voicemail to text conversion service to a user includes a storage system which stores username data for a user of a voicemail to text conversion service, and a processing system. The processing system receives a voicemail message for the user, sends the voicemail message and the username data to a speech engine, receives text from the speech engine which is converted from the voicemail message using the username data to correctly spell all occurrences of the user's name within the voicemail message, and sends the converted text to a device of the user.
|
1. A voicemail platform which provides a voicemail to text conversion service, comprising:
a memory which stores an email address and a name of a user for the user of a voicemail to text conversion service, the email address comprising the name of the user including at least a first and last name of the user; and
a processor which:
receives a voicemail message for the user,
sends the voicemail message and the email address together to a speech engine external to the voicemail platform, the speech engine configured to parse the email address and extract a correct spelling of the name of the user from the email address, recognize every occurrence of the name of the user within the voicemail message, and use the email address to correctly spell each corresponding occurrence of the name of the user within converted text of the voicemail message,
receives the converted text from the speech engine which is converted from the voicemail message using the email address to correctly spell all occurrences of the name of the user within the voicemail message, and
sends the converted text to a device of the user.
4. A method for providing a voicemail to text conversion service, comprising:
storing an email address and a name of a user for the user of a voicemail to text conversion service in a memory of a voicemail platform, the email address comprising the name of the user including at least a first and last name of the user;
receiving a voicemail message for the user at the voicemail platform;
sending the voicemail message and the email address together from the voicemail platform to a speech engine external to the voicemail platform, the speech engine configured to parse the email address and extract a correct spelling of the name of the user from the email address, recognize every occurrence of the name of the user within the voicemail message, and use the email address to correctly spell each corresponding occurrence of the name of the user within converted text of the voicemail message;
receiving the converted text at the voicemail platform from the speech engine which is converted, by a processor, from the voicemail message using the email address to correctly spell all occurrences of the name of the user within the voicemail message; and
sending the converted text from the voicemail platform to a device of the user.
7. A non-transitory computer-readable storage medium encoded with an executable computer program for providing a voicemail to text conversion service and that, when executed by a processor, causes the processor to perform operations comprising:
storing an email address and a name of a user for the user of a voicemail to text conversion service in a memory of a voicemail platform, the email address comprising the name of the user including at least a first and last name of the user;
receiving a voicemail message for the user at the voicemail platform;
sending the voicemail message and the email address together from the voicemail platform to a speech engine external to the voicemail platform, the speech engine configured to parse the email address and extract a correct spelling of the name of the user from the email address, recognize every occurrence of the name of the user within the voicemail message, and use the email address to correctly spell each corresponding occurrence of the name of the user within converted text of the voicemail message;
receiving the converted text at the voicemail platform from the speech engine which is converted from the voicemail message using the email address to correctly spell all occurrences of the name of the user within the voicemail message; and
sending the converted text from the voicemail platform to a device of the user.
2. The voicemail platform according to
3. The voicemail platform according to
5. The method according to
6. The method according to
8. The non-transitory computer-readable storage medium according to
9. The non-transitory computer-readable storage medium according to
10. The voicemail platform according to
|
1. Field of the Disclosure
The present disclosure relates to voicemail to text conversion. More particularly, the present disclosure relates to a manner of improving the accuracy of a voicemail to text conversion of a user's name.
2. Background Information
A voicemail to text conversion service is a service which converts a voicemail message to text. Typically, a user's name (i.e., a called party's name) will be mentioned at least once in a voicemail message. However, a name is a difficult word for a speech engine to accurately convert to text. Thus, a user of a voicemail to text conversion service may find his or her name to be constantly misspelled in their converted text messages.
In view of the foregoing, the present disclosure, through one or more of its various aspects, embodiments and/or specific features or sub-components, is thus intended to bring out one or more of the advantages as specifically noted below.
In a networked deployment, the computer system may operate in the capacity of a server or as a client user computer in a server-client user network environment, or as a peer computer system in a peer-to-peer (or distributed) network environment. The computer system 100, or portions thereof, can also be implemented as or incorporated into various devices, such as a personal computer (PC), a tablet PC, a set-top box (STB), a personal digital assistant (PDA), a mobile device, a palmtop computer, a laptop computer, a desktop computer, a communications device, a wireless telephone, a personal trusted device, a web appliance, or any other machine capable of executing a set of instructions (sequential or otherwise) that specify actions to be taken by that machine. In a particular embodiment, the computer system 100 can be implemented using electronic devices that provide voice, video or data communication. Further, while a single computer system 100 is illustrated, the term “system” shall also be taken to include any collection of systems or sub-systems that individually or jointly execute a set, or multiple sets, of instructions to perform one or more computer functions.
As illustrated in
In a particular embodiment, as depicted in
In an alternative embodiment, dedicated hardware implementations, such as application specific integrated circuits, programmable logic arrays and other hardware devices, can be constructed to implement one or more of the methods described herein. Applications that may include the apparatus and systems of various embodiments can broadly include a variety of electronic and computer systems. One or more embodiments described herein may implement functions using two or more specific interconnected hardware modules or devices with related control and data signals that can be communicated between and through the modules, or as portions of an application-specific integrated circuit. Accordingly, the present system encompasses software, firmware, and hardware implementations, or combinations thereof.
In accordance with various embodiments of the present disclosure, the methods described herein may be implemented by software programs executable by a computer system. Further, in an exemplary, non-limited embodiment, implementations can include distributed processing, component/object distributed processing, and parallel processing. Alternatively, virtual computer system processing can be constructed to implement one or more of the methods or functionality as described herein.
The present disclosure contemplates a computer-readable medium 182 that includes instructions 184 or receives and executes instructions 184 responsive to a propagated signal, so that a device connected to a network 101 can communicate voice, video or data over the network 101. Further, the instructions 184 may be transmitted or received over the network 101 via the network interface device 140.
The voicemail platform 201 receives a voicemail message from a calling party which is directed to a voicemail user (a called party). The voicemail platform 201 may be a centralized computer system which stores incoming voicemail messages in personal mailboxes associated with user phone numbers. The voicemail messages may be stored in a storage system which includes storage media such as, but not limited to, hard disk drives.
The voicemail platform 201 may also store data pertinent to each voicemail user. For example, the voicemail platform 201 may store an email address associated with each voicemail user. For a voicemail user who utilizes a voicemail to text conversion service, the converted text can be delivered to the user in the form of an email message addressed to the email address stored in the voicemail platform 201. The voicemail platform 201 may also store other data pertinent to each voicemail user, such as a user's name.
The voicemail platform 201 sends the voicemail, as well as username data of the voicemail user, to the speech engine 202. In this regard, the voicemail platform 201 may include a processing system including one or more processors programmed to perform the algorithm illustrated in
The speech engine 202 determines the correct spelling of the user's name from the username data. The username data may be an email address of the user. In this regard, a user's email address typically contains all or part of a user's name. For example, a person named ‘Rick Jones’ who is an employee of XYZ corporation may have an email address of ‘rick.jones@xyz.com’. The speech engine 202 may be capable of parsing the email address and extracting the correct spelling of the user's name from the email address.
Alternatively, the voicemail platform 201 may parse the user's email address and extract the correct spelling of the user's name from the email address, and then send the correct spelling of the user's name to the speech engine 202 as the username data.
The speech engine 202 performs a voicemail to text conversion algorithm to convert the voicemail to text. In this regard, the speech engine 202 may include a processing system including one or more processors programmed to perform the algorithm illustrated in
The voicemail platform 201 then receives the converted text from the speech engine 202 (S303), and delivers the converted text to a device 203 of the user, such as, but not limited to, a phone, a PDA, a tablet device or a PC (S304). The converted text may be delivered to the voicemail user in a variety of formats, such as, but not limited to, an email or a Short Message Service (SMS) text message.
Although the invention has been described with reference to several exemplary embodiments, it is understood that the words that have been used are words of description and illustration, rather than words of limitation. Changes may be made within the purview of the appended claims, as presently stated and as amended, without departing from the scope and spirit of the invention in its aspects. Although the invention has been described with reference to particular means, materials and embodiments, the invention is not intended to be limited to the particulars disclosed; rather the invention extends to all functionally equivalent structures, methods, and uses such as are within the scope of the appended claims.
According to an aspect of the present disclosure, a voicemail platform which provides a voicemail to text conversion service to a user includes a storage system which stores username data for a user of a voicemail to text conversion service, and a processing system. The processing system receives a voicemail message for the user, sends the voicemail message and the username data to a speech engine, receives text from the speech engine which is converted from the voicemail message using the username data to correctly spell all occurrences of the user's name within the voicemail message, and sends the converted text to a device of the user.
The converted text may be delivered to the device of the user in the form of an email message. The username data may be an email address of the user. The email address may contain at least part of the user's name.
The voicemail platform may determine the username data from an email address of the user. The device of the user may be a phone, a PDA, a tablet device or a PC. The voicemail platform may be operated by a telephone carrier, and the speech engine may be operated by a third-party vendor.
According to another aspect of the present disclosure, a method for providing a voicemail to text conversion service to a user includes storing username data for a user of a voicemail to text conversion service in a storage system of a voicemail platform, receiving a voicemail message for the user at the voicemail platform, sending the voicemail message and the username data from the voicemail platform to a speech engine, receiving text at the voicemail platform from the speech engine which is converted from the voicemail message using the username data to correctly spell all occurrences of the user's name within the voicemail message, and sending the converted text from the voicemail platform to a device of the user.
According to another aspect of the present disclosure, a non-transitory computer-readable medium storing a program for providing a voicemail to text conversion service to a user includes code for storing username data for a user of a voicemail to text conversion service in a storage system of a voicemail platform, code for receiving a voicemail message for the user at the voicemail platform, code for sending the voicemail message and the username data from the voicemail platform to a speech engine, code for receiving text at the voicemail platform from the speech engine which is converted from the voicemail message using the username data to correctly spell all occurrences of the user's name within the voicemail message, and code for sending the converted text from the voicemail platform to a device of the user.
While a computer-readable medium herein may be shown to be a single medium, the term “computer-readable medium” includes a single medium or multiple media, such as a centralized or distributed database, and/or associated caches and servers that store one or more sets of instructions. The term “computer-readable medium” shall also include any medium that is capable of storing, encoding or carrying a set of instructions for execution by a processor or that cause a computer system to perform any one or more of the methods or operations disclosed herein.
In a particular non-limiting, exemplary embodiment, the computer-readable medium can include a solid-state memory such as a memory card or other package that houses one or more non-volatile read-only memories. Further, the computer-readable medium can be a random access memory or other volatile re-writable memory. Additionally, the computer-readable medium can include a magneto-optical or optical medium, such as a disk or tapes or other storage device to capture carrier wave signals such as a signal communicated over a transmission medium. Accordingly, the disclosure is considered to include any computer-readable medium or other equivalents and successor media, in which data or instructions may be stored.
Although the present specification describes components and functions that may be implemented in particular embodiments with reference to particular standards and protocols, the disclosure is not limited to such standards and protocols. For example, standards for power over ethernet represent an example of the state of the art. Such standards are periodically superseded by faster or more efficient equivalents having essentially the same functions. Accordingly, replacement standards and protocols having the same or similar functions are considered equivalents thereof.
The illustrations of the embodiments described herein are intended to provide a general understanding of the structure of the various embodiments. The illustrations are not intended to serve as a complete description of all of the elements and features of apparatus and systems that utilize the structures or methods described herein. Many other embodiments may be apparent to those of skill in the art upon reviewing the disclosure. Other embodiments may be utilized and derived from the disclosure, such that structural and logical substitutions and changes may be made without departing from the scope of the disclosure. Additionally, the illustrations are merely representational and may not be drawn to scale. Certain proportions within the illustrations may be exaggerated, while other proportions may be minimized. Accordingly, the disclosure and the figures are to be regarded as illustrative rather than restrictive.
One or more embodiments of the disclosure may be referred to herein, individually and/or collectively, by the term “invention” merely for convenience and without intending to voluntarily limit the scope of this application to any particular invention or inventive concept. Moreover, although specific embodiments have been illustrated and described herein, it should be appreciated that any subsequent arrangement designed to achieve the same or similar purpose may be substituted for the specific embodiments shown. This disclosure is intended to cover any and all subsequent adaptations or variations of various embodiments. Combinations of the above embodiments, and other embodiments not specifically described herein, will be apparent to those of skill in the art upon reviewing the description.
The Abstract of the Disclosure is provided to comply with 37 C.F.R. §1.72(b) and is submitted with the understanding that it will not be used to interpret or limit the scope or meaning of the claims. In addition, in the foregoing Detailed Description, various features may be grouped together or described in a single embodiment for the purpose of streamlining the disclosure. This disclosure is not to be interpreted as reflecting an intention that the claimed embodiments require more features than are expressly recited in each claim. Rather, as the following claims reflect, inventive subject matter may be directed to less than all of the features of any of the disclosed embodiments. Thus, the following claims are incorporated into the Detailed Description, with each claim standing on its own as defining separately claimed subject matter.
The above disclosed subject matter is to be considered illustrative, and not restrictive, and the appended claims are intended to cover all such modifications, enhancements, and other embodiments which fall within the true spirit and scope of the present disclosure. Thus, to the maximum extent allowed by law, the scope of the present disclosure is to be determined by the broadest permissible interpretation of the following claims and their equivalents, and shall not be restricted or limited by the foregoing detailed description.
Zubas, Michael, Jackson, Jacqueline
Patent | Priority | Assignee | Title |
10893013, | Aug 22 2017 | Recipient notification of electronic message generated by voice-to-text engine |
Patent | Priority | Assignee | Title |
6775651, | May 26 2000 | Nuance Communications, Inc | Method of transcribing text from computer voice mail |
7542552, | Jun 24 2005 | INCONTACT INC | Method and apparatus for generating default personal greetings using text to speech conversion |
7562552, | Nov 10 2005 | Tyco Electronics Corporation | Crimp height adjustment mechanism |
7590550, | Sep 08 2006 | American Well Corporation | Connecting consumers with service providers |
7693267, | Dec 30 2005 | Microsoft Technology Licensing, LLC | Personalized user specific grammars |
8126718, | Jul 11 2008 | Malikie Innovations Limited | Facilitating text-to-speech conversion of a username or a network address containing a username |
8204486, | Dec 19 2008 | COX COMMUNICATIONS, INC | Dynamic messaging routing and audio-to-text linking for visual voicemail |
8249568, | Jun 15 2006 | STRIPE, INC | System and method for processing a voice mail |
20050021339, | |||
20070293272, | |||
20080167007, | |||
20110022386, |
Executed on | Assignor | Assignee | Conveyance | Frame | Reel | Doc |
Jun 29 2010 | ZUBAS, MICHAEL | AT&T MOBILITY II LLC | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 024625 | /0085 | |
Jul 01 2010 | AT&T MOBILITY II LLC. | (assignment on the face of the patent) | / | |||
Jul 01 2010 | JACKSON, JACQUELINE | AT&T MOBILITY II LLC | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 024625 | /0085 | |
Dec 14 2016 | AT&T MOBILITY II LLC | Nuance Communications, Inc | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 041494 | /0590 | |
Sep 20 2023 | Nuance Communications, Inc | Microsoft Technology Licensing, LLC | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 065578 | /0676 |
Date | Maintenance Fee Events |
Mar 17 2016 | ASPN: Payor Number Assigned. |
Aug 19 2019 | M1551: Payment of Maintenance Fee, 4th Year, Large Entity. |
Jun 19 2023 | M1552: Payment of Maintenance Fee, 8th Year, Large Entity. |
Date | Maintenance Schedule |
Feb 23 2019 | 4 years fee payment window open |
Aug 23 2019 | 6 months grace period start (w surcharge) |
Feb 23 2020 | patent expiry (for year 4) |
Feb 23 2022 | 2 years to revive unintentionally abandoned end. (for year 4) |
Feb 23 2023 | 8 years fee payment window open |
Aug 23 2023 | 6 months grace period start (w surcharge) |
Feb 23 2024 | patent expiry (for year 8) |
Feb 23 2026 | 2 years to revive unintentionally abandoned end. (for year 8) |
Feb 23 2027 | 12 years fee payment window open |
Aug 23 2027 | 6 months grace period start (w surcharge) |
Feb 23 2028 | patent expiry (for year 12) |
Feb 23 2030 | 2 years to revive unintentionally abandoned end. (for year 12) |