A motor vehicle has a speech interface for an acoustic input of commands for operating the motor vehicle or a module of the motor vehicle. The speech interface includes a speech recognition database in which a substantial portion of commands or command components, which can be input, are stored in a version according to a pronunciation in a first language and in a version according to a pronunciation in at least a second language, and a speech recognition engine for automatically comparing an acoustic command to commands and/or command components, which are stored in the speech recognition database, in a version according to the pronunciation in the first language and to commands and/or command components, which are stored in the speech recognition database, in a version according to the pronunciation in the second language.
|
14. A motor vehicle comprising:
a speech interface configured to provide an acoustic output of information;
said speech input interface including a speech identification module with a speech recognition database storing a substantial portion of at least one of commands and command components which can be input, in a version according to a pronunciation in a first language and in a version according to a pronunciation in at least a second language, such that each given one of the substantial portion of the at least one of commands and command components is stored in a version according to a pronunciation in the first language and in a version according to a pronunciation in at least the second language;
said speech interface including a language selection module configured to automatically select one of the first language and the at least second language for the output of information;
said speech interface including a speech recognition acoustic model trained in the first language and in the second language; and
said speech interface including a multilingual grammar module, said multilingual grammar module including grammar and phrases in the first language and in the second language.
11. A motor vehicle comprising:
a speech interface including a speech input interface configured to receive an acoustic input of commands for operating one of a motor vehicle and a module of the motor vehicle;
said speech input interface including a speech identification module with a speech recognition database storing a substantial portion of at least one of commands and command components which can be input, in a version according to a pronunciation in a first language and in a version according to a pronunciation in at least a second language, such that each given one of the substantial portion of the at least one of commands and command components is stored in a version according to a pronunciation in the first language and in a version according to a pronunciation in at least the second language;
said speech identification module being configured to assign a pronunciation of at least one of a command and a command component to one of the first language and the at least second language;
said speech interface including a speech recognition acoustic model trained in the first language and in the second language; and
said speech interface including a multilingual grammar module, said multilingual grammar module including grammar and phrases in the first language and in the second language.
1. A motor vehicle comprising:
a speech interface configured to receive an acoustic input of commands for operating one of a motor vehicle and a module of the motor vehicle;
said speech interface including a speech recognition database and a speech recognition engine;
said speech recognition database storing a substantial portion of at least one of commands and command components which can be input, in a version according to a pronunciation in a first language and in a version according to a pronunciation in at least a second language, such that each given one of the substantial portion of the at least one of commands and command components is stored in a version according to a pronunciation in the first language and in a version according to a pronunciation in at least the second language;
said speech recognition engine being configured to automatically compare an acoustic command to at least one of commands and command components, which are stored in said speech recognition database, in a version according to the pronunciation in the first language and to at least one of commands and command components, which are stored in said speech recognition database, in a version according to the pronunciation in the second language;
said speech interface including a speech recognition acoustic model trained in the first language and in the second language; and
said speech interface including a multilingual grammar module, said multilingual grammar module including grammar and phrases in the first language and in the second language.
2. The motor vehicle according to
3. The motor vehicle according to
said grapheme-to-phoneme module assigned to the first language is a first grapheme-to-phoneme module; and
said speech interface further includes a second grapheme-to-phoneme module assigned to the second language, said second grapheme-to-phoneme module is configured to generate a new entry in said speech recognition database for at least one of the new word and the new name.
4. The motor vehicle according to
5. The motor vehicle according to
6. The motor vehicle according to
7. The motor vehicle according to
8. The motor vehicle according to
9. The motor vehicle according to
10. The motor vehicle according to
12. The motor vehicle according to
13. The motor vehicle according to
15. The motor vehicle according to
16. The motor vehicle according to
17. The motor vehicle according to
18. The motor vehicle according to
19. The motor vehicle according to
|
The invention relates to a motor vehicle with a speech interface for an acoustic output of information and/or for an acoustic input of commands for operating the motor vehicle or a module of the motor vehicle.
International Publication No. WO 01/28187 A1 discloses a system which is implemented in a vehicle and which is operated by acoustic inputs. Acoustic inputs in the context of motor vehicles are furthermore disclosed in German Patent Application Publication Nos. DE 10 2004 055 609 A1 and DE 10 2004 061 782 A1.
It is accordingly an object of the invention to provide a motor vehicle with a speech interface which overcomes disadvantages of the heretofore-known motor vehicles of this general type and which improves the operation of the motor vehicle and makes the operation of the motor vehicle easier and more convenient.
With the foregoing and other objects in view there is provided, in accordance with the invention, a motor vehicle including:
a speech interface configured to receive an acoustic input of commands for operating a motor vehicle or a module of the motor vehicle;
the speech interface including a speech recognition database and a speech recognition engine;
the speech recognition database storing a substantial portion of commands and/or command components which can be input, in a version according to a pronunciation in a first language and in a version according to a pronunciation in at least a second language; and
the speech recognition engine being configured to automatically compare an acoustic command to commands and/or command components, which are stored in the speech recognition database, in a version according to the pronunciation in the first language and to commands and/or command components, which are stored in the speech recognition database, in a version according to the pronunciation in the second language.
In other words, according to the invention, there is provided a motor vehicle with a speech interface for an acoustic input of commands for operating the motor vehicle or a module of the motor vehicle, wherein the speech interface includes a speech recognition database in which a substantial portion of commands or command components, which can be input, are stored in a version according to a pronunciation in a first language and in a version according to a pronunciation in at least a second language, and a speech recognition engine for automatically comparing an acoustic command to commands and/or command components, which are stored in the speech recognition database, in a version according to the pronunciation in the first language and to commands and/or command components, which are stored in the speech recognition database, in a version according to the pronunciation in the second language.
A speech recognition database in accordance with the invention may also be set up in a modular manner separated according to languages. A speech recognition database that is set up in a modular manner separated according to languages is in particular then a speech recognition database in accordance with the invention if the speech recognition database interacts in accordance with the invention with a speech recognition engine such that the speech recognition engine automatically compares an acoustically input command to the commands and/or command components, which are stored in the speech recognition database, in the version according to the pronunciation in the first language as well as to the commands and/or command components, which are stored in the speech recognition database, in the version according to the pronunciation in the second language.
According to another feature of the invention, the speech interface includes a speech recognition acoustic model trained in the first language and in the second language.
According to yet another feature of the invention, the speech interface includes a first grapheme-to-phoneme module assigned to the first language, the first grapheme-to-phoneme module is configured to generate a new entry in the speech recognition database for a new word and/or a new name.
According to another feature of the invention, the speech interface further includes a second grapheme-to-phoneme module assigned to the second language, the second grapheme-to-phoneme module is configured to generate a new entry in the speech recognition database for the new word and/or the new name.
According to a further feature of the invention, the speech interface includes a multilingual grammar module, the multilingual grammar module includes grammar and phrases in the first language and in the second language.
According to another feature of the invention, the speech interface includes a speech output interface configured to provide an acoustic output of information.
According to a further feature of the invention, the speech output interface includes a language selection module configured to automatically select the first language or the second language for an output of information.
According to yet another feature of the invention, the language selection module includes a counter configured to count words and/or components of a command having been input, distinguished according to a use of the first language and the second language for individual words and components.
According to another feature of the invention, the language selection module includes a language switch configured to automatically select the first language or the second language for the output of the information in dependence of a number of words and/or components of the command having been input in the first language and the second language within a counting interval.
According to another feature of the invention, the language selection module includes a counter configured to count words and/or components of a command having been input, distinguished according to a use of the first language and the second language for individual words and components within a counting interval.
According to another feature of the invention, the speech output interface includes a text-to-speech module trained with the first language and the second language for transforming a text command into a speech output.
According to another feature of the invention, the speech output interface includes a speech output database storing a substantial portion of information which can be output, in a version according to a pronunciation in the first language and in a version according to a pronunciation in the second language.
As defined above, in accordance with an embodiment of the invention, the speech interface further includes a speech recognition acoustic model trained at least in the first language as well as in the second language.
In accordance with a further embodiment of the invention, the speech interface includes a first grapheme-to-phoneme module assigned to the first language for generating a new entry in the speech recognition database for a new word or a new name (in the first language) as well as, in accordance with a further embodiment of the invention, a second grapheme-to-phoneme module assigned to the second language for generating a new entry in the speech recognition database for the new word or the new name (in the second language). Details relating to the grapheme-to-phoneme process are for example disclosed in the sources cited in German Patent Application Publication Nos. DE 10 2004 055 609 A1 and DE 10 2004 061 782 A1, such as the article with the title “Grapheme-to-phoneme conversion, a knowledge-based approach” by Niklas Torstenson, Dept. of Languages, Högskolan i Skövde, TMH-QPRS Vol. 44—Fonetik 2002, available at the web address www.speech.kth.se/qprs/tmh/2002/02-44-117-120.pdf.
In a further embodiment of the invention, the speech interface further includes a speech output interface for an acoustic output of information, wherein, in accordance with a further embodiment of the invention, the speech output interface includes a language selection module for an automatic selection of the first language or the second language for an output of information. In accordance with a further embodiment of the invention, the language selection module includes a counter for counting words or components of a command having been input, distinguished according to a use of the first and the second language for individual words or components of a command that has been input (in particular within a counting interval) and, in accordance with another embodiment of the invention, a language switch for an automatic selection of the first or second language for an output of the information in dependence of a number of words or components of the command, which has been input, in the first and the second language within the counting interval. A counting interval according to the invention may for example be a given number of words or word groups.
In a further embodiment of the invention, the speech output interface includes a text-to-speech module trained by means of the first language and the second language for transforming a text command into a speech output.
In another embodiment of the invention, the speech output interface includes a speech output database in which an essential part of the information which can be output is stored in a version according to the pronunciation in the first language and in a version according to the pronunciation in the second language.
With the foregoing and other objects in view there is provided, in accordance with the invention, a motor vehicle including:
a speech input interface configured to receive an acoustic input of commands for operating a motor vehicle and/or a module of the motor vehicle; and
the speech input interface including a speech identification module configured to assign a pronunciation of a command and/or a command component to a language.
In other words, the above-stated object of the invention is furthermore achieved by a motor vehicle, which in particular includes one or more of the above-described features, with a speech input interface for an acoustic input of commands for operating the motor vehicle or a module of the motor vehicle, wherein the speech input interface includes a speech identification module for assigning or allocating the pronunciation of a command that has been input and/or a command component to a language. The motor vehicle may furthermore include a speech output interface for an acoustic output of information as well as, in accordance with a further embodiment of the invention, a language selection module for an automatic selection of the aforementioned language as a language, in which the output of information is performed.
According to another feature of the invention, the motor vehicle includes a speech output interface which is operatively connected to the speech input interface and configured to provide an acoustic output of information.
According to another feature of the invention, the speech output interface includes a language selection module configured to automatically select the proper language as a language to be used for the acoustic output of information.
With the foregoing and other objects in view there is provided, in accordance with the invention, a motor vehicle including a speech interface configured to provide an acoustic output of information; and the speech interface including a language selection module configured to automatically select a language for the output of information.
In other words, the above-stated object of the invention is furthermore achieved by a motor vehicle, which in particular includes one or more of the above-described features, with a speech interface for an acoustic output of information wherein the speech interface includes a language selection module for an automatic selection of the language, in which the output of information is performed.
According to another feature of the invention, the language selection module includes a counter configured to count words and/or components of a command having been input, distinguished according to a language used for individual words and/or components.
According to a further feature of the invention, the language selection module includes a language switch configured to automatically select a language for the output of information in dependence of a respective number of words and/or components of a command having been input in a respective language within a counting interval.
According to another feature of the invention, the language selection module includes a counter configured to count words and/or command components having been input, distinguished according to a language used for individual words and/or components within a counting interval.
According to yet another feature of the invention, the speech interface includes a multilingual text-to-speech module configured to transform a text command into a speech output.
According to another feature of the invention, the speech interface includes a multilingual speech output database.
As explained above, in a further embodiment of the invention, the language selection module includes a counter for counting words or components of a command having been input, distinguished according to a language used for individual words and/or components (in particular within a counting interval) and, in a further embodiment of the invention, a language switch for an automatic selection of the language in which the output of information occurs, in dependence of the number of a language used for words or components of a command having been input within a counting interval. In a further embodiment of the invention, the speech interface includes a text-to-speech module, which is trained as a multilingual text-to-speech module, for a transformation of a text command into a speech output.
A motor vehicle in accordance with the invention is in particular a land vehicle which is operated individually in road traffic. Motor vehicles for the purpose of the invention are in particular not limited to land vehicles having a combustion engine.
Other features which are considered as characteristic for the invention are set forth in the appended claims.
Although the invention is illustrated and described herein as embodied in a motor vehicle having a speech interface, it is nevertheless not intended to be limited to the details shown, since various modifications and structural changes may be made therein without departing from the spirit of the invention and within the scope and range of equivalents of the claims.
The construction and method of operation of the invention, however, together with additional objects and advantages thereof will be best understood from the following description of specific embodiments when read in connection with the accompanying drawings.
Referring now to the figures of the drawings in detail and first, particularly, to
The speech interface 2 includes a speech input interface 10, which is illustrated in detail in
The speech input interface 10 includes furthermore a speech recognition engine 21 for automatically comparing an acoustic command in the form of an output signal mic of the microphone 4 to the commands and/or command components, which are stored in the speech recognition database, in the version according to the pronunciation in the first language as well as to the commands and/or command components, which are stored in the speech recognition database, in the version according to the pronunciation in the second language. For this purpose, speech components in the output signal mic of the microphone 4 are identified by a speech recognition acoustic model 22, which is trained in the first language as well as in the second language, wherein the speech components are divided or organized into (phonetic) command components (e.g. individual words or groups of words, such as for example “the destination is” or “Miranda Avenue”) by a multilingual grammar module 23, which includes grammar and phrases of the first language and of the second language.
The (phonetic) command components are compared to the entries of the speech recognition database 24 which means with respect to
The speech input interface 10 furthermore includes a grapheme-to-phoneme module 25, which is assigned to the first language, for generating a new entry in the speech recognition data base 24 for a new word or a new name in the first language as well as a grapheme-to-phoneme module 26, which is assigned to the second language, for generating a new entry in the speech recognition data base 24 for the new word or the new name in the second language. Details relating to the grapheme-to-phoneme process are for example disclosed in the sources cited in German Patent Application Publication Nos. DE 10 2004 055 609 A1 and DE 10 2004 061 782 A1, such as the article with the title “Grapheme-to-phoneme conversion, a knowledge-based approach” by Niklas Torstenson, Dept. of Languages, Högskolan i Skövde, TMH-QPRS Vol. 44-Fonetik 2002, available at the web address www.speech.kth.se/qprs/tmh/2002/0244-117-120.pdf.
The speech output interface 20 includes a counter 31 for counting words or components of an command, which has been input, distinguished according to a use of the first and second language for individual words or components of a command, which has been input, within a counting interval. The speech output interface 20 includes furthermore a language switch 32, which is controlled by the counter 31, for an automatic selection of the first or the second language for outputting the information in dependence of the number of words or components of the command, which has been input, in the first and the second language within the counting interval. When, for example, the majority of the words or components of the command, which has been input, or of several commands within the counting interval are pronounced in Spanish, then a setting is performed by the language switch 32 such that a confirmation or answer is given in Spanish. The counter 31 and the language switch 32 form an exemplary embodiment of a language selection module as defined in the claims.
If, for example, the operator 6 gives the instruction “Dial cuatro siete seis dos ocho cinco siete,” then the speech input interface 10 provides “Dial the telephone number 4762857” as the input command ci and provides “English Spanish Spanish Spanish Spanish Spanish Spanish Spanish” as the output information lng. The counter 31 determines once “English” and seven times “Spanish” and controls the language switch 32 such that an input confirmation is output in Spanish.
The speech output interface 20 includes translation modules 33 and, respectively 34, which are provided downstream of the language switch 32, for translating the output information co into the first and, respectively, second language. If the content of the output information co reads “input confirmation,” then, in accordance with an exemplary embodiment, this output information is converted or translated by the translation module 33 into “thanks” or by the translation module 34 into “gracias.” The corresponding translation for the case when the confirmation is performed in Spanish, namely “gracias,” is a text command and is transformed through the use of a text-to-speech module 39, trained by means of the first and the second language, for a speech output into an input signal sp for the loudspeaker 5. A speech output database 37 and a multilingual voice 38 are provided for this purpose. The speech output database 37, a detail of which is shown in an exemplary manner in
In order to supplement the speech output database 37 with new entries, a grapheme-to-phoneme module 35, which is assigned to the first language, and grapheme-to-phoneme module 36, which is assigned to the second language, are provided.
Although, for reasons of clarity, the invention has been described only in conjunction with two languages, the invention is not intended to be limited to a bilingual system. Rather, the invention is to be used in particular with more than two languages.
Bergmann, Carsten, Imam, M. Kashif, Prieto, Ramon, Williams, Carly, Cheung, Wai Yin
Patent | Priority | Assignee | Title |
10388269, | Sep 10 2013 | Hyundai Motor Company; Kia Corporation | System and method for intelligent language switching in automated text-to-speech systems |
10839793, | Apr 16 2018 | GOOGLE LLC | Automatically determining language for speech recognition of spoken utterance received via an automated assistant interface |
10896672, | Apr 16 2018 | GOOGLE LLC | Automatically determining language for speech recognition of spoken utterance received via an automated assistant interface |
11017766, | Apr 16 2018 | GOOGLE LLC | Automatically determining language for speech recognition of spoken utterance received via an automated assistant interface |
11195510, | Sep 10 2013 | Hyundai Motor Company; Kia Corporation | System and method for intelligent language switching in automated text-to-speech systems |
11530930, | Sep 19 2017 | VOLKSWAGEN AKTIENGESELLSCHAFT | Transportation vehicle control with phoneme generation |
11735173, | Apr 16 2018 | GOOGLE LLC | Automatically determining language for speech recognition of spoken utterance received via an automated assistant interface |
11798541, | Apr 16 2018 | GOOGLE LLC | Automatically determining language for speech recognition of spoken utterance received via an automated assistant interface |
11817084, | Apr 16 2018 | GOOGLE LLC | Adaptive interface in a voice-based networked system |
11817085, | Apr 16 2018 | GOOGLE LLC | Automatically determining language for speech recognition of spoken utterance received via an automated assistant interface |
7949517, | Dec 01 2006 | Deutsche Telekom AG | Dialogue system with logical evaluation for language identification in speech recognition |
9536521, | Jun 30 2014 | Xerox Corporation | Voice recognition |
9640173, | Sep 10 2013 | Hyundai Motor Company; Kia Corporation | System and method for intelligent language switching in automated text-to-speech systems |
ER4815, |
Patent | Priority | Assignee | Title |
6085160, | Jul 10 1998 | Nuance Communications, Inc | Language independent speech recognition |
6243675, | Sep 16 1999 | Denso Corporation | System and method capable of automatically switching information output format |
6272464, | Mar 27 2000 | Alcatel-Lucent USA Inc | Method and apparatus for assembling a prediction list of name pronunciation variations for use during speech recognition |
7149688, | Nov 04 2002 | Microsoft Technology Licensing, LLC | Multi-lingual speech recognition with cross-language context modeling |
7181395, | Oct 27 2000 | Nuance Communications, Inc | Methods and apparatus for automatic generation of multiple pronunciations from acoustic data |
7277846, | Apr 14 2000 | Alpine Electronics, Inc | Navigation system |
7292980, | Apr 30 1999 | Alcatel Lucent | Graphical user interface and method for modifying pronunciations in text-to-speech and speech recognition systems |
7328155, | Sep 25 2002 | TOYOTA INFOTECHNOLOGY CENTER CO , LTD | Method and system for speech recognition using grammar weighted based upon location information |
7415411, | Mar 04 2004 | CLUSTER, LLC; Optis Wireless Technology, LLC | Method and apparatus for generating acoustic models for speaker independent speech recognition of foreign words uttered by non-native speakers |
7457755, | Jan 19 2004 | SAMSUNG ELECTRONICS CO , LTD | Key activation system for controlling activation of a speech dialog system and operation of electronic devices in a vehicle |
20020095282, | |||
20050033575, | |||
20050197842, | |||
20050273337, | |||
20060100871, | |||
20070005206, | |||
DE102004055609, | |||
DE102004061782, | |||
EP953896, | |||
EP1693828, | |||
WO231814, | |||
WO250817, | |||
WO3060877, | |||
WO128187, |
Executed on | Assignor | Assignee | Conveyance | Frame | Reel | Doc |
Oct 17 2006 | BERGMANN, CARSTEN | VOLKSWAGEN OF AMERICA, INC | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 025474 | /0938 | |
Oct 18 2006 | PRIETO, RAMON | VOLKSWAGEN OF AMERICA, INC | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 025474 | /0938 | |
Oct 18 2006 | CHEUNG, WAI YIN | VOLKSWAGEN OF AMERICA, INC | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 025474 | /0938 | |
Oct 18 2006 | WILLIAMS, CARLY | VOLKSWAGEN OF AMERICA, INC | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 025474 | /0938 | |
Oct 31 2006 | IMAM, M KASHIF | VOLKSWAGEN OF AMERICA, INC | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 025474 | /0938 | |
Nov 09 2006 | Volkswagen of America, Inc. | (assignment on the face of the patent) | / |
Date | Maintenance Fee Events |
Jul 14 2014 | M1551: Payment of Maintenance Fee, 4th Year, Large Entity. |
Jul 13 2018 | M1552: Payment of Maintenance Fee, 8th Year, Large Entity. |
Jul 05 2022 | M1553: Payment of Maintenance Fee, 12th Year, Large Entity. |
Date | Maintenance Schedule |
Jan 18 2014 | 4 years fee payment window open |
Jul 18 2014 | 6 months grace period start (w surcharge) |
Jan 18 2015 | patent expiry (for year 4) |
Jan 18 2017 | 2 years to revive unintentionally abandoned end. (for year 4) |
Jan 18 2018 | 8 years fee payment window open |
Jul 18 2018 | 6 months grace period start (w surcharge) |
Jan 18 2019 | patent expiry (for year 8) |
Jan 18 2021 | 2 years to revive unintentionally abandoned end. (for year 8) |
Jan 18 2022 | 12 years fee payment window open |
Jul 18 2022 | 6 months grace period start (w surcharge) |
Jan 18 2023 | patent expiry (for year 12) |
Jan 18 2025 | 2 years to revive unintentionally abandoned end. (for year 12) |