A sound collection apparatus for far-field voice includes a multi-channel analog sound receiver configured to convert an obtained sound signal into an electrical signal; a first analog-to-digital converter coupled to the multi-channel analog sound receiver and configured to convert the electrical signal into a digital signal; and an interface controller coupled to the analog-to-digital converter and configured to transmit the digital signal to a control device via a preset interface. Using the above solutions, the technical problems of high hardware cost and unguaranteed performance of existing sound collection devices can be solved, and the technical effects of effectively reducing the hardware cost and the difficulties of development are achieved.
|
16. A method comprising:
converting, by a multi-channel analog sound receiver, a sound signal into an electrical signal;
converting, by a first analog-to-digital converter coupled to the multi-channel analog sound receiver, the electrical signal into a first digital signal;
converting, by a second analog-to-digital converter coupled to a control device, a playback reference signal to a second digital signal, the playback reference signal indicating an additional sound signal generated by the control device;
de-noising, by an interface controller coupled to the first analog-to-digital converter and the second analog-to-digital converter, the first digital signal using the second digital signal to remove the additional sound signal; and
transmitting, by the interface controller, the de-noised first digital signal to the control device via a preset interface.
1. An apparatus comprising:
a multi-channel analog sound receiver configured to convert a sound signal into an electrical signal;
a first analog-to-digital converter coupled to the multi-channel analog sound receiver and configured to convert the electrical signal into a first digital signal;
a second analog-to-digital converter coupled to a control device and configured to convert a playback reference signal to a second digital signal, the playback reference signal indicating an additional sound signal generated by the control device; and
an interface controller coupled to the first analog-to-digital converter and the second analog-to-digital converter and configured to de-noise the first digital signal using the second digital signal to remove the additional sound signal and transmit the de-noised first digital signal to the control device via a preset interface.
2. The apparatus of
3. The apparatus of
4. The apparatus of
5. The apparatus of
6. The apparatus of
9. The apparatus of
10. The apparatus of
11. The apparatus of
14. The apparatus of
15. The apparatus of
17. The method of
|
This application claims priority to Chinese Patent Application No. 201711107934.5, filed on 10 Nov. 2017, entitled “Sound Collection Apparatus for Far-Field Voice,” which is hereby incorporated by reference in its entirety.
The present disclosure relates to the technical field of hardware devices, and particularly to sound collection apparatuses for far-field voice.
With the rapid development of smart devices and the increasing demand for human-computer interactions, original control methods for smart devices appear to be relatively complicated.
As a relatively simple interaction and control method, voice interaction can greatly improve the convenience of controlling smart devices. The voice interaction is a method of communications using natural voice. The creation of this type of method enables all smart devices perform communications in a unified and simple manner, which reduces the complexity of controlling the smart devices.
However, as voice interaction is bound to involve an acquisition of voice, an existing collection structure generally has the problems of high hardware development cost and low performance, and applying thereof to general household devices or to relatively low-cost devices is obviously not suitable.
No effective solution has yet been proposed in response to the above problems.
This Summary is provided to introduce a selection of concepts in a simplified form that are further described below in the Detailed Description. This Summary is not intended to identify all key features or essential features of the claimed subject matter, nor is it intended to be used alone as an aid in determining the scope of the claimed subject matter. The term “techniques,” for instance, may refer to device(s), system(s), method(s) and/or processor-readable/computer-readable instructions as permitted by the context above and throughout the present disclosure.
Embodiments of the present disclosure provide a sound collection apparatus for far-field voice, so as to achieve the technical effects of reducing the cost of a sound collection device while fully guaranteeing the system performance. The apparatus includes a multi-channel analog sound receiver configured to convert an obtained sound signal into an electrical signal; a first analog-to-digital converter coupled to the multi-channel analog sound receiver and configured to convert the electrical signal into a digital signal; and an interface controller coupled to the analog-to-digital converter and configured to transmit the digital signal to a control device via a preset interface. In the embodiments of the present disclosure, an obtained sound signal is converted into an electrical signal by a multi-channel analog sound receiver, and the electrical signal is converted into a digital signal by an analog-to-digital converter. The converted digital signal is transmitted to a control device through an interface controller for processing sound data. The sound collection apparatus is relatively simple to implement and has a lower cost. As such, the technical problems of high hardware cost and unguaranteed performance of existing sound collection devices can be solved, and the technical effects of effectively reducing the hardware cost and the difficulties of development are achieved.
Accompanying drawings described herein are provided for a further understanding of the present disclosure, and constitute a part of the present disclosure, which are not intended to limit the present disclosure. In the drawings:
In order to make the goals, technical solutions, and advantages of the present disclosure more comprehensible, the present disclosure will be further described in detail hereinafter in conjunction with implementations and drawings. The illustrative implementations of the present disclosure and the description thereof are intended to explain the present disclosure, but are not intended to limit the present disclosure.
As shown in
In an implementation, a multi-channel analog sound receiver 101 is configured to convert an acquired sound signal into an electrical signal, and is an energy conversion device.
The multi-channel analog sound receiver 101 can be a two-channel, four-channel or eight-channel analog sound receiver, or the like. A specific number of selected channels may be selected according to actual needs, which is not limited in the present disclosure. For example, the above multi-channel analog sound receiver may be a microphone array.
For example, a 4 MIC, i.e., a 4-channel analog sound receiver, may be selected. The MIC is selected for sound acquisition, and is mainly to ensure sufficient far-field collection effect. The MIC has advantages such as a high signal-to-noise ratio and a high sensitivity. When selecting a multi-channel analog sound receiver, a multi-channel analog sound receiver having a signal-to-noise ratio of 65 dB or more may be selected.
A first analog-to-digital converter 102 is coupled to multi-channel analog sound receiver 101, and is configured to convert the electrical signal into a digital signal. For example, the following types of analog-to-digital converters can be used: Conexant Semiconductor Company's CX20810-11Z, Conexant Semiconductor Company's CX20811-11Z, Nuvoton Technology Corporation Limited's NAU85L40YG, Nuvoton Technology Corporation Limited's NAU85L40BYG, X-Powers Technology's AC108.
It should be noted that models of the analog-to-digital converter that are listed are only exemplary. In practical implementations, other types of analog-to-digital converters may be used, and chips and devices capable of implementing an analog signal of a voice signal, etc., can be selected and used, which are not limited by the present disclosure.
The interface controller 103 is coupled to the analog-to-digital converter 102, and is configured to transmit the digital signal to the control device 104 via a preset interface. Preferably, ALC4042 chip of Realtek Semiconductor can be selected, and correspondingly, the preset interface is a USB interface. In practical implementations, other types of interfaces may also be selected according to actual conditions, for example, a headphone interface, etc.
In view of sounds sometimes being generated the control device 104 and are noises themselves, the accuracy of a microphone signal is affected. In order to eliminate the influence of these noises, these pieces of data can be obtained to de-noise a sound signal obtained based on the multi-channel analog sound receiver 101. As such, as shown in
The playback reference signal can be used as an analog reference signal for de-noising the sound signal obtained by the multi-channel analog sound receiver. The playback reference signal is a signal from the control device. For example, the playback reference signal can be transmitted to the sound collection apparatus in a variety of ways, for example, through a wired means or a wireless means to the sound collection apparatus. For example, the wired means may be a transmission to the sound collection apparatus through a dedicated connection line, and the wireless means may be a direct transmission to the sound collection apparatus through a wireless signal, which are not limited in the present disclosure. In this example, the wired means is used as example. As shown in
In an embodiment, as shown in
In view of signals from two directions, one is from the multi-channel analog sound receiver 101 and the other is from the control device 104. During analog-to-digital conversion, the second analog-to-digital converter 201 and the first analog-to-digital converter 102 may also be separately deployed, and each perform analog-to-digital conversion independently on one of the signals from one direction, so that the signals from these two directions do not affect each other when being subjected to analog-to-digital conversion.
Considering that an actual analog reference channel can be two-channel analog reference channels, an analog microphone can be a 4-channel analog microphone. Correspondingly, order to meet the requirements, the second analog-to-digital converter 201 and the first analog-to-digital converter 102 can be four-channel analog-to-digital conversion chip. Serial transmission of 8 channels of data can be realized through a TDM (Time Division Multiplexing) mode of I2S (Inter-IC Sound bus, an integrated circuit with a built-in audio bus) interface, and extra two analog-to-digital conversion chips can be used for expansion.
In the above example, performing an analog-to-digital conversion of a playback reference signal by the second analog-to-digital converter 201 is used as an example. In practical implementations, the playback reference signal may also be the one as shown in
However, it is worth noting that the number of channels of an analog reference channel and the number of channels of an analog microphone are only described as an example, and may be selected as needed in practical implementations, which is not limited in the present disclosure. Further, the number of channels of an analog-to-digital conversion chip can be selected according to the number of channels of a corresponding analog reference channel and the number of channels of an analog microphone.
The control device 104 may include, but is not limited to, at least one of the following: a computer, a television, a set top box, a robot, or a smart speaker.
In an embodiment, the sound collection apparatus for far-field voice may be provided as an external device or integrated in the control device. Which method is specifically adopted may be selected according to actual requirements, which is not limited in the present disclosure.
The obtained sound signal is converted into an electrical signal by the multi-channel analog sound receiver. The electrical signal is converted into a digital signal by the analog-to-digital converter, and the converted digital signal is transmitted to the control device through the interface controller for processing sound data. The sound collection apparatus is simpler to implement and has a lower cost, thus solving the technical problems of high hardware cost and unguaranteed performance of existing sound collection devices, and achieving the technical effects of effectively reducing the hardware cost and the difficulties of development.
Based on the sound collection apparatus for far-field voice as shown in
S501: Obtain a sound signal through the multi-channel analog sound receiver, and obtain a playback reference signal from the control device through the analog reference channel formed by the connection line.
S502: Convert the sound signal and the playback reference signal into digital signals through the analog-to-digital converter.
S503: Transmit the digital signals to the control device through the interface controller.
S504: Perform operations such as signal processing, wake-up terms, and voice recognition through the control device.
The sound collection apparatus for far-field voice will be described hereinafter in conjunction with particular embodiments. However, it is worth noting that the particular embodiments are only used for better explanation of the present disclosure and do not constitute an improper limitations to the present disclosure.
In the present example, voice data is acquired through a microphone array module. Specifically, as shown in
Each module unit in
1) 4MIC is expressed as a sign of 4 analog microphones, which are used for converting sound signals into electrical signals. In order to ensure sufficient far-field collection effects, MIC can be selected mainly because MIC has advantages such as a high signal-to-noise ratio and a high sensitivity. The signal-to-noise ratio is required to be above 65 dB.
However, it is worth noting that the above only uses 4MIC as an example. In practical implementations, other types of MICs, such as 8MIC, can be used, and the number of channels in MIC can be selected according to actual requirements.
2) An analog reference channel can obtain 2Analog Ref, that is, two analog references. 2Analog Ref is an external sound emitted by the control device itself, and can be called an echo signal. Based on an acquired echo signal, an echo cancellation algorithm can eliminate the external sound and retain a user's voice, thereby eliminating the interference of the control device on the user's voice.
3) ALC4042, used for implementing audio data stream transmission of a USB interface. In implementations, ALC4042 can be used to support a 16K sample rate and 16-bit, 24-bit and 32-bit data sample transmissions.
4) An analog-to-digital converter, which can be the one as shown in
The ADC can be selected, but is not limited to, one of the following models: CX20810, NAU85L40, AC108, etc.
The sound collection apparatus can be deployed as an external device, and can be plugged and unplugged through a USB interface or other form of interface, which is relatively convenient to use and has a better compatibility. The playback reference signal can be transmitted to the sound collection apparatus through a connection line drawn from the control device.
Furthermore, the sound collection apparatus has a relatively high level of integration, a good of sound collection performance, and a relatively low hardware cost.
The sound collection apparatus will be described hereinafter using a specific implementation setting.
The sound collection apparatus can be applied in a television set. The sound collection apparatus is integrated as a separate modular device with a USB plug. A playback reference signal can be transmitted to the sound collection apparatus in various ways. For example, transmission to the sound collection apparatus can be made through a wired means or a wireless means. For example, the wired means may be transmission to the sound collection apparatus through a dedicated connection line and the wireless means may be transmission of a playback reference signal to the sound collection apparatus directly through a wireless signal, which are not limited in the present disclosure. In the present example, transmitting a playback reference signal through a dedicated connection line is used as an example. An interface and a USB plug in the sound collection apparatus that are connected to this connection line can be integrated together. As shown in
The sound collection apparatus may also be plugged into an external device of the television set, such as a set top box or the like. The sound collection apparatus is inserted into the set top box, and interactions between the sound collection apparatus and the television set are realized through interactions between the set top box and the television set.
In an embodiment, the sound collection apparatus may also be a functional module integrated in the television set, which is built in when the television is shipped from the factory.
By deploying the sound collection apparatus into the television, a user's remote voice can be acquired by the sound collection apparatus, and then the voice is transmitted to a main control chip of the television. Operations such as signal processing, wake-up terms and voice recognition, etc., are performed through the master chip.
In the embodiments of the present disclosure, an obtained sound signal is converted into an electrical signal by a multi-channel analog sound receiver, and the electrical signal is converted into a digital signal by an analog-to-digital converter. The converted digital signal is transmitted to a control device through an interface controller for processing sound data. This type of sound collection apparatus is relatively simple to implement and has a lower cost. As such, the technical problems of high hardware cost and unguaranteed performance of existing sound collection devices can be solved, and the technical effects of effectively reducing the hardware cost and the difficulties of development are achieved.
The above description is only exemplary embodiments of the present disclosure, and is not intended to limit the present disclosure. For one skilled in art, various changes and modifications can be made to the embodiments of the present disclosure. Any modifications, equivalent replacements, improvements, etc. that are made within the spirit and scope of the present disclosure should be included within the scope of protection of the present disclosure.
The present disclosure can be further understood using the following clauses.
Clause 1: A sound collection apparatus for far-field voice comprising: a multi-channel analog sound receiver configured to convert an obtained sound signal into an electrical signal; a first analog-to-digital converter coupled to the multi-channel analog sound receiver and configured to convert the electrical signal into a digital signal; and an interface controller coupled to the analog-to-digital converter and configured to transmit the digital signal to a control device via a preset interface.
Clause 2: The apparatus of Clause 1, further comprising a second analog-to-digital converter, the second analog-to-digital converter being coupled between the control device and the interface controller, and configured to receive and convert a playback reference signal of the control device to a digital signal, and transmit the digital signal to the interface controller, the playback reference signal being used for de-noising the sound signal.
Clause 3: The apparatus of Clause 2, wherein the first analog-to-digital converter is a 4-channel analog-to-digital conversion chip, and the second analog-to-digital converter is a 4-channel analog-to-digital conversion chip.
Clause 4: The apparatus of Clause 1, wherein the multi-channel analog sound receiver comprises a 4-channel analog sound receiver.
Clause 5: The apparatus of Clause 1, wherein a model of the analog-to-digital converter comprises at least one of: CX20810, NAU85L40, or AC108.
Clause 6: The apparatus of Clause 1, wherein the preset interface comprises a USB interface.
Clause 7: The apparatus of Clause 1, wherein the interface controller is ALC4042 chip.
Clause 8: The apparatus of any one of Clauses 1-7, wherein the control device comprises at least one of: a computer, a television, a set top box, a robot, or a smart speaker.
Clause 9: The apparatus of any one of Clauses 1-7, wherein the multi-channel analog sound receiver comprises a microphone array.
Fu, Qiang, Yang, Zhihui, Yan, Zhijie
Patent | Priority | Assignee | Title |
Patent | Priority | Assignee | Title |
4013841, | Dec 23 1971 | Matsushita Electric Industrial Co., Ltd. | Four-channel stereo receiver |
8005682, | Oct 31 2007 | AVAGO TECHNOLOGIES INTERNATIONAL SALES PTE LIMITED | Control of a non-active channel in a multi-channel receiver |
8184822, | Apr 28 2009 | Bose Corporation | ANR signal processing topology |
9659555, | Feb 09 2016 | Amazon Technologies, Inc. | Multichannel acoustic echo cancellation |
20090112603, | |||
20160260441, | |||
20170243577, | |||
20190096398, | |||
CN106601225, | |||
CN1375178, | |||
CN201781567, | |||
CN203522988, | |||
CN2590317, | |||
WO2014040667, |
Executed on | Assignor | Assignee | Conveyance | Frame | Reel | Doc |
Nov 09 2018 | Alibaba Group Holding Limited | (assignment on the face of the patent) | / | |||
Apr 10 2019 | FU, QIANG | Alibaba Group Holding Limited | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 051284 | /0179 | |
Apr 10 2019 | YAN, ZHIJIE | Alibaba Group Holding Limited | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 051284 | /0179 | |
Jun 10 2019 | YANG, ZHIHUI | Alibaba Group Holding Limited | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 051284 | /0179 |
Date | Maintenance Fee Events |
Nov 09 2018 | BIG: Entity status set to Undiscounted (note the period is included in the code). |
Aug 16 2024 | M1551: Payment of Maintenance Fee, 4th Year, Large Entity. |
Date | Maintenance Schedule |
Feb 16 2024 | 4 years fee payment window open |
Aug 16 2024 | 6 months grace period start (w surcharge) |
Feb 16 2025 | patent expiry (for year 4) |
Feb 16 2027 | 2 years to revive unintentionally abandoned end. (for year 4) |
Feb 16 2028 | 8 years fee payment window open |
Aug 16 2028 | 6 months grace period start (w surcharge) |
Feb 16 2029 | patent expiry (for year 8) |
Feb 16 2031 | 2 years to revive unintentionally abandoned end. (for year 8) |
Feb 16 2032 | 12 years fee payment window open |
Aug 16 2032 | 6 months grace period start (w surcharge) |
Feb 16 2033 | patent expiry (for year 12) |
Feb 16 2035 | 2 years to revive unintentionally abandoned end. (for year 12) |