The present invention relates to a method for receiving data transmitted acoustically. The method includes receiving an acoustically transmitted signal encoding data; processing the received signal to minimise environmental interference within the received signal; and decoding the processed signal to extract the data. The data encoded within the signal using a sequence of tones. A method for encoding data for acoustic transmission is also disclosed. This method includes encoding data into an audio signal using a sequence of tones. The audio signal in this method is configured to minimise environmental interference. A system and software are also disclosed.
|
18. A method for encoding data for acoustic transmission, including:
encoding the data into an audio signal comprising a sequence of tones encoding the data,
wherein the audio signal is configured to minimize environmental interference by configuring the sequence of tones to insert a space between at least some tones of the sequence of tones within the audio signal and to include a tone length that is proportional to a reverberation time of a room where the audio signal is to be transmitted.
32. A method for encoding data for acoustic transmission, including:
encoding the data into an audio signal comprising a sequence of tones encoding the data,
wherein the audio signal is configured to minimize environmental interference by sharpening an amplitude envelope of each tone of the sequence of tones within the audio signal and by configuring the sequence of tones to include a tone length that is proportional to a reverberation time of a room where the audio signal is to be transmitted.
30. An apparatus for encoding data for acoustic transmission, the apparatus comprising one or more processors configured to:
encode the data into an audio signal including a sequence of tones encoding the data,
wherein the audio signal is configured to minimize environmental interference by configuring the sequence of tones to insert a space between at least some tones of the sequence of tones within the audio signal and to include a tone length that is proportional to a reverberation time of a room where the audio signal is to be transmitted.
35. An apparatus for encoding data for acoustic transmission, the apparatus comprising one or more processors configured to:
encode the data into an audio signal including a sequence of tones encoding the data,
wherein the audio signal is configured to minimize environmental interference by sharpening an amplitude envelope of each tone of the sequence of tones within the audio signal and by configuring the sequence of tones to include a tone length that is proportional to a reverberation time of a room where the audio signal is to be transmitted.
1. A method for receiving data transmitted acoustically, including:
receiving an acoustically transmitted audio signal comprising a sequence of tones encoding the data;
processing the received audio signal to minimize environmental interference within the received audio signal, wherein the received audio signal includes a space between at least some tones of the sequence of tones and a tone length that is proportional to a reverberation time of a room where the audio signal was transmitted; and
decoding the processed signal to extract the data.
13. An apparatus for receiving data transmitted acoustically, the apparatus comprising one or more processors configured to:
receive an acoustically transmitted audio signal comprising a sequence of tones encoding the data;
process the received audio signal to minimize environmental interference within the received audio signal, wherein the received audio signal includes a space between at least some tones of the sequence of tones and a tone length that is proportional to a reverberation time of a room where the audio signal was transmitted; and
decode the processed signal to extract the data.
31. A non-transitory computer readable medium having stored therein computer-readable instructions that, when executed by one or more processors, cause the one or more processors to:
encode data into an audio signal including a sequence of tones encoding the data,
wherein the audio signal is configured to minimize environmental interference by configuring the sequence of tones to insert a space between at least some tones of the sequence of tones within the audio signal and to include a tone length that is proportional to a reverberation time of a room where the audio signal is to be transmitted.
36. A non-transitory computer readable medium having stored therein computer-readable instructions that, when executed by one or more processors, cause the one or more processors to:
encode data into an audio signal including a sequence of tones encoding the data,
wherein the audio signal is configured to minimize environmental interference by sharpening an amplitude envelope of each tone of the sequence of tones within the audio signal and by configuring the sequence of tones to include a tone length that is proportional to a reverberation time of a room where the audio signal is to be transmitted.
15. A non-transitory computer readable medium having stored therein computer-readable instructions that, when executed by one or more processors, cause the one or more processors to:
receive an acoustically transmitted audio signal comprising a sequence of tones encoding data;
process the received audio signal to minimize environmental interference within the received audio signal, wherein the received audio signal includes a space between at least some tones of the sequence of tones and a tone length that is proportional to a reverberation time of a room where the audio signal was transmitted; and
decode the processed signal to extract the data.
16. A system for receiving data transmitted acoustically, including:
a first device comprising a speaker for acoustically transmitting an audio signal comprising a sequence of tones encoding the data and one or more processors; and
a second device comprising a microphone for acoustically receiving an audio signal including the transmitted audio signal and environmental interference and one or more processors configured to:
process the received audio signal to minimize the environmental interference within the received audio signal, wherein the received audio signal includes a space between at least some tones of the sequence of tones and a tone length that is proportional to a reverberation time of a room where the transmitted audio signal was transmitted; and
decode the processed signal to extract the data.
2. A method as claimed in
3. A method as claimed in
5. A method as claimed in
6. A method as claimed in
7. A method as claimed in
8. A method as claimed in
9. A method as claimed in
10. A method as claimed in
11. A method as claimed in
12. The method as claimed in
14. The apparatus of
17. The system of
19. A method as claimed in
20. A method as claimed in
21. A method as claimed in
22. A method as claimed in
24. A method as claimed in
25. A method as claimed in
26. A method as claimed in
27. A method as claimed in
28. A method as claimed in
29. A method as claimed in
33. A method as claimed in
|
This application is the U.S. national phase of International Application No. PCT/GB2017/053112 filed 13 Oct. 2017, which designated the U.S. and claims priority to GB Patent Application No. 1617409.6 filed 13 Oct. 2016, the entire contents of each of which are hereby incorporated by reference.
The present invention is in the field of data communication. More particularly, but not exclusively, the present invention relates to a method and system for acoustic communication of data.
There are a number of solutions to communicating data wirelessly over a short range to and from devices. The most typical of these is WiFi. Other examples include Bluetooth and Zigbee.
An alternative solution for a short range data communication is described in U.S. Patent Publication Ser. No. 12/926,470, DATA COMMUNICATION SYSTEM. This system, invented by Patrick Bergel and Anthony Steed, involves the transmission of data using an audio signal transmitted from a speaker and received by a microphone. This system involves the encoding of data, such as shortcode, into a sequence of tones within the audio signal.
This acoustic communication of data provides for novel and interesting applications. However, acoustic communication of data does involve unique problems. Specifically, because the signals are transmitted acoustically, the receiver receives a signal that may include a lot of interference created by the environment in which the signal is transmitted which may, for example, be reverberation (including early/late reflections). At the point of receiving the audio, distortions caused by interference have the effect of reducing reliable data rates due to the decoder's increased uncertainty about a signal's original specification. For example, early reflections which are coherent but delayed versions of the direct signal, usually created from an acoustic reflection from a hard surface, may make it more difficult for a decoder to confidently determine the precise start or end point of a signal feature/note. This decreases overall reliability. It is therefore preferable to reduce these effects at the receiver. Otherwise the data encoded within the signal can be difficult to accurately detect. This can result in non-communication of data in certain environments or under certain conditions within environments.
There is a desire to improve the acoustic communication of data.
It is an object of the present invention to provide a method and system for acoustic communication of data which overcomes the disadvantages of the prior art, or at least provides a useful alternative.
According to a first aspect of the invention there is provided a method for receiving data transmitted acoustically, including:
a) receiving an acoustically transmitted signal encoding data;
b) processing the received signal to minimise environmental interference within the received signal; and
c) decoding the processed signal to extract the data.
wherein the data is encoded within the transmitted signal using a sequence of tones.
The signal may be human-audible.
The environmental interference may be caused by/during transmission of the signal.
The environmental interference may be reverberation.
The received signal may be processed frame by frame. Each frame of the received signal may be processed to generate a Fast-Fourier Transform (FFT).
The FFT for at least some of the frames may be processed to modify a magnitude in each bin of the FFT in accordance with a magnitude value of the corresponding bin in a preceding frame.
An impulse response of an acoustic environment may be calculated. The impulse response may be calculated via measurements of the acoustic space.
The impulse response may be processed to generate a transfer function. The received signal may be processed using the transfer function.
The signal may be received via a microphone.
According to a further aspect of the invention there is provided a method for encoding data for acoustic transmission, including
encoding data into an audio signal using a sequence of tones;
wherein the audio signal is configured to minimise environmental interference.
Characteristics of at least some of the tones and/or sequence of tones may be modified to minimise the environmental interference. The characteristics may be modified based upon predictions of interference caused to the sequence of tones when received by a receiver. The predictions may relate to interference generated by acoustic transmission of the sequence of tones. The interference generated may be non-direct acoustic energy. The interference may be reverberation.
The audio signal may be configured by configuring the sequence of tones such that at least some of the tone frequencies are arranged from high to low. The at least some of the tone frequencies may correspond to a plurality of tones at the beginning of the signal.
The audio signal may be configured by configuring the sequence of tones to insert space between at least some of the tones within the signal.
The audio signal may be configured by sharpening the amplitude envelope of each tone signal within the audio signal.
The audio signal may be configured by configuring the sequence of tones to avoid repeating same or similar frequency tones one after the other.
The environmental interference may be reverberation.
The method of the above aspect may further include the step of acoustically transmitting the audio signal for receipt by a microphone.
Other aspects of the invention are described within the claims.
Embodiments of the invention will now be described, by way of example only, with reference to the accompanying drawings in which:
The present invention provides a method and system for the acoustic communication of data.
The inventors have discovered that, when the data is encoded in sequence of tones, that the received signal can be processed to minimise environmental interference before decoding, such processing enables more accurate decoding of the signal into the data. Furthermore, the inventors have discovered that the signal can be encoded before acoustic transmission to also minimise environmental interference. Thereby, improving accuracy of data decoding by the recipient.
In
A first device is shown 101. This device 101 may include a processor 101a and a speaker 102. The processor 101a may be configured to encode data into a sequence of tones within an audio signal. The signal may be encoded by the processor 101a to minimise environmental interference. The processor 101a may be configured to perform the method described in relation to
The device 101 may be configured to acoustically transmit the signal, for example, via the speaker 102.
The environmental interference may be that which would be generated by acoustic transmission of signal by the speaker 102. The environmental interference may be distortion introduced by the speaker 102 or non-direct acoustic energies caused by this transmission such as reverberation. In this document, the term reverberation should be interpreted to cover first order reflections and echoes as well as true reverberation (e.g. later order reflections). The signal may be encoded by modifying characteristics of the tones and/or sequence of tones based upon, for example, predicting the environmental interference that would be caused to a signal received by a receiver.
The processor 101a and device 101 may encode and output the audio signal via a standard digital to analogue converter or via pulse-width modulation. Pulse-width modulation may be more efficient on very low power devices.
The audio signal may be encoded dynamically for immediate acoustic transmission or precomputed and stored in memory for later playback.
In embodiments, the processor 101a and speaker 102 may not be co-located at the same device. For example, the processor 101a may encode the data into the audio signal and transmit the audio signal to a device for acoustic transmission at the speaker 102. The audio signal may be stored at a memory before acoustic transmission.
A second device 103 is shown. This second device 103 may include or be connected to a microphone 104. The microphone 104 may be configured to receive signals acoustically transmitted, for example, by the first device 101, and to forward those signals to one or more processors 105 within the second device 103. In embodiments, the processor(s) 105 are not located within the second device 103. For example, the processor(s) 105 may be remotely located.
The microphone 104 and the processor(s) 105 may be connected via a communications bus or via a wired or wireless network connection.
The processor(s) 105 may be configured to process the signal to minimise environmental interference and to decode the signal to extract data. The data may have been encoded within the signal as a sequence of tones. The environmental interference may have been generated by acoustic transmission of the signal by speaker (such speaker 102) including, for example, distortion caused by the speaker or playback media (e.g. tape/vinyl/compression codecs) or non-direct acoustic energies such as reverberation.
The processor(s) 105 may be configured to perform the method described in relation to
In some embodiments, the microphone 104 may be configured with a narrow polar response to further mitigate environmental interference such as reverberation and any other non-direct acoustic energies.
In some embodiments, the second device may include multiple microphones 104 coordinated in a phase-array or beam-forming implementation to further mitigate environmental interference.
It will be appreciated by those skilled in the art that the above embodiments of the invention may be deployed on different devices and in differing architectures.
Referring to
In step 201, an acoustically transmitted signal is received (for example, via microphone 104). The signal encodes data. The data is encoded as a sequence of tones. The encoding format of the signal may include a header, error correction and a payload. An example of an encoding format is shown in
The signal may be human-audible, either fully or at least in part. For example, data may be encoded within the signal across a frequency spectrum which includes human-audible frequencies.
The inventors note that human-audible frequencies are particularly vulnerable to environmental interference caused by reverberation of the acoustically transmitted signal within the environment due to the sound absorption coefficient of materials being generally proportional to frequency (causing reverberation at human-audible frequencies but little reverberation at higher frequencies).
In step 202, the signal is processed to minimise environmental interference. The environmental interference may be non-direct acoustic energy having originally emanated from the signal transmitting device such as reverberation. The signal may be processed to minimise interference by artificially compounding the decay of non-direct energy.
In one embodiment, the signal may be processed using a fast fourier transform (FFT) to produce bins of magnitudes across the spectrum. The FFT can be calculated on a per-frame basis. With the reverb cancellation values, the value passed to a decoding engine at a given frame t (Zt) is a combination of the current FFT magnitude (Xt) and a function of previous output values (Yt-1):
Yt=αbYt-1+(1−αb)Xt
Zt=Xt−βYt-1
Where the reverb cancellation is characterised by:
In embodiments, the value may be passed to one or more of a plurality of decoding engines, or all of a plurality of decoding engines. The decoding engines may be voters as defined in UK Patent Application No. 1617408.8 and a process for decoding the signal may proceed as outlined in that document. For example, each of the voters may be tuned to decode the value in a different way (for example, assuming different acoustic characteristics of the environment) and the decoded value may be decided as that which the most voters agree with.
In one embodiment, as illustrated in
In one embodiment, values of α and β can be altered dynamically to increase the system's efficacy during operation or due to changing environmental factors such as different locations or changes to a single space which may affect its reverberation characteristics, such as the materials in it or its layout. Parameters α and β may be changed, for example, by observing the sound energy decay following an encoded tone of known length, or by applying successive values of each and observing and maximising the decoder's tone detection confidence.
Referring to
In step 301, the data may be encoded into an audio signal using a sequence of tones. The encoding format of the signal may include a header, error correction and a payload. An example of an encoding format is described in relation to
The audio signal may be configured to minimise environmental interference. The environmental interference may be that which would be generated by acoustic transmission of signal by the speaker (e.g. 102). The environmental interference may be non-direct acoustic energies caused by this transmission such as reverberation.
The signal may be configured to minimise environmental interference by modifying characteristics of the tones and/or sequence of tones based upon, for example, predicting the environmental interference that would be caused to the audio signal when acoustically received by a receiver (e.g. at a microphone 104). Characteristics of the tones that may be modified may include tone length, tone waveform (e.g. sharp edges to the waveform envelope), tone frequencies (e.g. avoiding resonant frequencies for the environment) or multi-frequency tones. Characteristics of the sequence that may be modified may include tone order (e.g. ordering a high frequency tone before a low frequency tone, and preventing proximity of the same or similar tones in the sequence) and gaps between tones in the sequence.
In embodiments, at least a portion of the audio signal is configured to sequence adjacent tones from high to low to reduce frequency tails from a preceding tone from overlapping with a subsequent tone in a reverberant space. In one example, the initial portion of the audio signal is configured in this way. This initial portion may comprise the header or a portion of the header. This portion may be identical for every signal and constitute the “front-door” sound for the protocol.
In embodiments as shown in
In embodiments, at least a portion of the audio signal is configured to sharpen the amplitude envelopes of the tone signals within the portion. This may be done by altering the amplitude envelope of each note within the signal, typically by using very short duration attack and decay phases such that the note's total acoustic energy is maximised. Typically also a note will have a short amplitude decay such that the end of the note is clearly defined to have occurred at a specific time.
In embodiments, several steps at the encoding side of the transmission may be made to make the transmission more resilient to reverberation, by altering the signal to avoid temporal effects (acoustic energy remaining after an encoded tone) and spectral effects (specific frequencies being prone to resonance, for example at room modes).
Referring to
Referring to
Potential advantages of some embodiments of the present invention include:
While the present invention has been illustrated by the description of the embodiments thereof, and while the embodiments have been described in considerable detail, it is not the intention of the applicant to restrict or in any way limit the scope of the appended claims to such detail. Additional advantages and modifications will readily appear to those skilled in the art. Therefore, the invention in its broader aspects is not limited to the specific details, representative apparatus and method, and illustrative examples shown and described. Accordingly, departures may be made from such details without departure from the spirit or scope of applicant's general inventive concept.
Jones, Daniel John, Nesfield, James Andrew
Patent | Priority | Assignee | Title |
Patent | Priority | Assignee | Title |
10090003, | Aug 06 2013 | Huawei Technologies Co., Ltd. | Method and apparatus for classifying an audio signal based on frequency spectrum fluctuation |
10186251, | Aug 06 2015 | OBEN, INC | Voice conversion using deep neural network with intermediate voice training |
4045616, | May 23 1975 | Time Data Corporation | Vocoder system |
4048074, | Mar 14 1975 | BASF Aktiengesellschaft | Apparatus for separating solids from liquids or gas |
4101885, | Aug 26 1975 | Musical chime device | |
4323881, | May 10 1979 | Nippon Electric Co., Ltd. | Radio pager with reduced power consumption |
4794601, | Dec 24 1985 | NEC Corporation | High-reliability computer system |
6133849, | Feb 20 1996 | TRAFFIC SYSTEMS, LLC | Control signal coding and detection in the audible and inaudible ranges |
6163803, | Oct 08 1997 | Sony Corporation | Transmitting apparatus, receiving apparatus, recording apparatus, and reproducing apparatus |
6532477, | Feb 23 2000 | Oracle America, Inc | Method and apparatus for generating an audio signature for a data item |
6711538, | Sep 29 1999 | Sony Corporation | Information processing apparatus and method, and recording medium |
6766300, | Nov 07 1996 | Creative Technology Ltd.; CREATIVE TECHNOLOGY LTD | Method and apparatus for transient detection and non-distortion time scaling |
6909999, | Dec 07 2000 | Qualcomm Incorporated | Sound link translation |
6996532, | Dec 07 2000 | HEWLETT-PACKARD DEVELOPMENT COMPANY L P | Method and apparatus for accessing a content site with a sound sequence |
7058726, | Jul 08 1996 | Internet Number Corporation | Method and systems for accessing information on a network using message aliasing functions having shadow callback functions |
7349668, | May 31 2000 | OPTINETIX ISRAEL LTD | Systems and methods for embedding commercial information into broadcast media |
7379901, | Sep 11 1998 | RPX Corporation | Accessing a vendor web site using personal account information retrieved from a credit card company web site |
7403743, | Dec 31 2004 | Sony Ericsson Mobile Communications AB | System and method to unlock hidden multimedia content |
7944847, | Jun 25 2007 | EFJ, Inc.; EFJ, INC | Voting comparator method, apparatus, and system using a limited number of digital signal processor modules to process a larger number of analog audio streams without affecting the quality of the voted audio stream |
8494176, | Jun 28 2005 | RED SUNRISE CO , LTD | Information providing system |
8594340, | Mar 26 2010 | RED SUNRISE CO , LTD | Transmitter |
8782530, | Mar 25 2009 | SAP SE | Method and system for providing a user interface in a computer |
9118401, | Oct 28 2014 | HARRIS GLOBAL COMMUNICATIONS, INC | Method of adaptive interference mitigation in wide band spectrum |
9137243, | Jun 15 2011 | RED SUNRISE CO , LTD | Authentication system and authentication method |
9237226, | Oct 23 2007 | Cisco Technology, Inc. | Controlling echo in a wideband voice conference |
9270811, | Sep 23 2014 | Amazon Technologies, Inc | Visual options for audio menu |
9344802, | Jun 28 2005 | RED SUNRISE CO , LTD | Information providing system |
20020107596, | |||
20020152388, | |||
20020184010, | |||
20030065918, | |||
20030195745, | |||
20030212549, | |||
20040002858, | |||
20040081078, | |||
20040133789, | |||
20040148166, | |||
20040264713, | |||
20050049732, | |||
20050086602, | |||
20050219068, | |||
20060167841, | |||
20060253209, | |||
20060287004, | |||
20070063027, | |||
20070121918, | |||
20070144235, | |||
20070174052, | |||
20070192672, | |||
20070192675, | |||
20070232257, | |||
20080002882, | |||
20080011825, | |||
20080027722, | |||
20080031315, | |||
20080059157, | |||
20080112885, | |||
20080232603, | |||
20080242357, | |||
20080262928, | |||
20090034712, | |||
20090119110, | |||
20090141890, | |||
20090254485, | |||
20100030838, | |||
20100064132, | |||
20100088390, | |||
20100134278, | |||
20100146115, | |||
20100223138, | |||
20100267340, | |||
20100290641, | |||
20110173208, | |||
20110276333, | |||
20110277023, | |||
20110307787, | |||
20120084131, | |||
20120214416, | |||
20130010979, | |||
20130030800, | |||
20130034243, | |||
20130077798, | |||
20130216058, | |||
20130223279, | |||
20130275126, | |||
20140028818, | |||
20140046464, | |||
20140053281, | |||
20140074469, | |||
20140142958, | |||
20140164629, | |||
20140172141, | |||
20140172429, | |||
20140258110, | |||
20150004935, | |||
20150088495, | |||
20150141005, | |||
20150215299, | |||
20150248879, | |||
20150349841, | |||
20150382198, | |||
20160007116, | |||
20160098989, | |||
20170279542, | |||
20180106897, | |||
20180359560, | |||
20200105128, | |||
20200169327, | |||
20210098008, | |||
CN105790852, | |||
CN106921650, | |||
EP1760693, | |||
EP2334111, | |||
EP2916554, | |||
EP3275117, | |||
EP3526912, | |||
GB2369995, | |||
GB2484140, | |||
JP1078928, | |||
JP2001320337, | |||
JP2004139525, | |||
JP2004512765, | |||
JP2007121626, | |||
JP2007195105, | |||
JP2008219909, | |||
WO115021, | |||
WO150665, | |||
WO161987, | |||
WO163397, | |||
WO211123, | |||
WO235747, | |||
WO2004002103, | |||
WO2005006566, | |||
WO2008131181, | |||
WO2016094687, |
Executed on | Assignor | Assignee | Conveyance | Frame | Reel | Doc |
Oct 13 2017 | Sonos Experience Limited | (assignment on the face of the patent) | / | |||
Apr 26 2019 | JONES, DANIEL JOHN | ASIO LTD | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 049460 | /0033 | |
Apr 26 2019 | NESFIELD, JAMES ANDREW | ASIO LTD | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 049460 | /0033 | |
Apr 29 2019 | ASIO LTD | ASIO LTD | CHANGE OF ADDRESS | 049462 | /0114 | |
Jan 05 2020 | ASIO LTD | Sonos Experience Limited | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 052896 | /0938 |
Date | Maintenance Fee Events |
Apr 15 2019 | BIG: Entity status set to Undiscounted (note the period is included in the code). |
Apr 22 2019 | SMAL: Entity status set to Small. |
Jun 12 2020 | BIG: Entity status set to Undiscounted (note the period is included in the code). |
Date | Maintenance Schedule |
Aug 09 2025 | 4 years fee payment window open |
Feb 09 2026 | 6 months grace period start (w surcharge) |
Aug 09 2026 | patent expiry (for year 4) |
Aug 09 2028 | 2 years to revive unintentionally abandoned end. (for year 4) |
Aug 09 2029 | 8 years fee payment window open |
Feb 09 2030 | 6 months grace period start (w surcharge) |
Aug 09 2030 | patent expiry (for year 8) |
Aug 09 2032 | 2 years to revive unintentionally abandoned end. (for year 8) |
Aug 09 2033 | 12 years fee payment window open |
Feb 09 2034 | 6 months grace period start (w surcharge) |
Aug 09 2034 | patent expiry (for year 12) |
Aug 09 2036 | 2 years to revive unintentionally abandoned end. (for year 12) |