Method for performing time-scale modification of speech information or speech signals

Method for performing time-scale modification of speech information or speech signals
US4864620

Pre-recorded speech is played back at a different rate, without pitch change. Adjacent signal segments are combined with best match processing. Method and apparatus process time domain speech signals containing speech information, the rate of reproduction of which is to be varied without changing pitch, wherein the input signal is processed by capturing input time domain speech samples in frames wherein the number of samples per frame is a function of a desired speech change factor, forming blocks from the frames, additively cross correlating input blocks with prior-processed or output blocks, preferably by means of an average magnitude difference function, to obtain a time relation of best match for the rate of reproduction, adding consecutive input and output blocks at the point of maximum correlation, and applying a window function between the overlapping portions of the output block and the input block to obtain a new output block. The method does not require multiplication or division. Relatively smooth transitions between superimposed segments of speech which become output blocks are realized by applying a graduated weighting.

PTO Wrapper PDF
Dossier Espace Google

Patent 4864620
Priority Dec 21 1987
Filed Feb 03 1988
Issued Sep 05 1989
Expiry Feb 03 2008
Inventors Bialick, L…
Assg.orig The DSP Gr…
Assg.curr DSP GROUP,…
Entity Small
Referenced by 99
References 4
Maint.: all paid

BACKGROUND OF THE IN…
SUMMARY OF THE INVEN…
DESCRIPTION OF DRAWI…
DESCRIPTION OF A PRE…

1. A method for processing time domain speech signals containing speech information to vary the rate of reproduction thereof without change of pitch comprising:

superimposing partially overlapping blocks of speech samples in a manner such that periodicity of pitch is maintained, the extent of superimposition being a function of a desired variance in rate of reproduction of said speech information;

applying an average magnitude difference of function to the overlapping blocks at each superimposition in a search range to determine a best match;

fixing a precise superimposition of the overlapping blocks in accordance with the best match; and

applying a smoothed weighted function to the superimposed portion of the overlapping blocks.

6. A system for processing time domain speech signals containing speech information to vary rate of reproduction thereof without changing pitch comprising:

means for superimposing partially overlapping blocks of speech samples in a manner such that periodicity of pitch is maintained, the extent of superimposition being a function of a desired variance in rate of reproduction of said speech information;

means for applying an average magnitude difference function to the overlapping blocks at each superimposition in a search range to determine a best match;

means for fixing a precise superimposition of the overlapping blocks in accordance with the best match; and

means for applying a smoothed weighting function to the superimposed portion of the overlapping blocks.

5. A method for varying the rate of reproduction of a time domain speech signal containing speech information without changing pitch comprising the steps for each frame of speech of:

capturing input time domain speech samples in a unit defined by said frame at a fixed sample rate, the number of samples per frame being a function of a desired speech change factor;

forming an input block from at least a portion of a first said frame;

comparing said input block with a prior-processed block by means of a multiplierless average magnitude difference function to obtain a time relation of maximum correlation at a preselected rate of reproduction indicated by a point in time where the average magnitude difference between said input block and said prior-processed block is of minimum magnitude;

adding said input block to said prior-processed block in overlap at said point of maximum correlation to obtain an intermediate block having a common portion between said input block and said prior processed block;

weighting said common portion by a smoothing window function to obtain an output block for output as well as for use as a next subsequent prior-processed block with a next subsequent input block; and

providing with said output block to an output utilization means for reproduction of a segment of said speech signal at a rate differing from said input rate and without a change of pitch.

3. A method for varying rate of reproduction of speech information comprising the steps, for each frame of speech information, of:

receiving speech samples representative of time domain speech information sufficient to form a frame, the number of speech samples being determined by a desired rate of reproduction, and duration of the frame being fixed;

placing said speech samples in an input block having a first portion and at least a second portion;

establishing a first search range and a second search range on an output block, specifically a high search range and a low search range, an output block being a block which was processed directly prior to said frame;

designating a first portion of the samples of said input block as a high search representation;

additively comparing between said input block and said output block for all samples between said low search range and said high search range according to an average magnitude difference function to obtain a point of maximum cross correlation of said output block with said input block;

at the point of maximum cross correlation; combining overlapping segments of said input block with said output block according to a preselected smoothing weighting function to form a next output block; and

providing said next output block as information to an output utilization means, said next output block also becoming said output block for a next iteration.

2. The method according to claim 1 wherein said superimposing step comprises defining a search range over which said best match is sought, said search range being a function of pitch frequency of said speech information.

4. The method according to claim 1 wherein said smoothing weighting function is a ramped window function having a maximum combination at commencement of said input block and minimum combination at termination of said output block.

7. The system according to claim 6 wherein said superimposing means includes means for applying a smoothed weighting function to the superimposed portion of the overlapping blocks.

8. The system according to claim 7 wherein said superimposing means further comprises means defining a search range over which said best match is sought, said search range being a function of pitch frequency of said speech information.

9. The system according to claim 6 wherein said superimposing means comprises means defining a search range over which said best match is sought, said search range being a function of pitch frequency of said speech information.

BACKGROUND OF THE INVENTION

This invention relates to digital signal processing and more particularly to time domain digital speech processing in order to vary the rate of reproduction of speech without changing pitch.

In recent years various techniques have been developed for achieving time compression/expansion of audio information, particularly speech information. In order to utilize time compression or expansion effectively, where the compression or expansion factor is significant, some mechanism is necessary to correct for changes in pitch which would normally follow a direct application of acceleration or deceleration techniques. Acceleration or deceleration of recorded speech is easily achieved by speeding or slowing the rate of reproduction, which in turn raises or lowers pitch, as is expected.

Time compression and expansion of speech is useful in many applications. Time compression allows matching of speech information to a desired playback time. Time expansion is particularly useful for example, in dictation equipment to speed up playback or in foreign language learning situations to slow down playback to improve comprehension, which may be difficult or otherwise impaired.

Numerous techniques have been developed to achieve time compression and/or expansion, particularly techniques which manipulate analog signal representations. Of the various prior art techniques, the following patents or publications are representative:

Roucos and Wilgus, "High Quality Time-Scale Modification for Speech," ICASSP 85. Proceedings of the IEEE International Conference of Acoustics, Speech, and Signal Processing, pp. 493-6, Volume 2, 1985 (26-29 March 1985), IEEE. This relatively recent paper represents a development in the algorithms for reproducing speech using digital techniques. The research group is Bolt, Beranek & Newman Inc. of Cambridge, Mass.

Makhoul, J. and El-Jaroudi, "Time-Scale Modification in Medium to Low Rate Speech Coding," ICASSP 86. Proceedings of the IEEE International Conference of Acoustics, Speech, and Signal Processing pp. 1705-1708, Volume 3, 1986, (Apr. 7-11, 1986), IEEE. This paper produced by the same research group related to the foregoing describes further development in digital signal processing techniques for rate modifying speech.

These two papers relate to description and implementation of the synchronous-overlap-and-add method of time-scale modification. The algorithm described therein allows arbitrary linear or nonlinear scaling of the time axis using a modified overlap-and-add procedure operating on the time domain waveform. The Makhoul paper describes the implementation of a technique involving generalized cross-correlaton between a normalized source signal (y(n)) and a normalized derived signal (x(n)). The technique was originally described in the Roucos paper.

Asada et al., U.S. Pat. No. 4,435,832 issued Mar. 6, 1984, to Hitachi, describes a speech synthesizer wherein LPC (linear predictive coding) techniques are employed to synthesize speech. Control is exercised over the rate of speech by lengthening or shortening the time interval of interpolation between the fetching of each of the LPC parameters to synthesize the speech. This technology is essentially unrelated to the present invention, since the present invention is unrelated to synthesized speech or parametrically-defined speech.

Klasco et al., U.S. Pat. No. 4,406,001 issued Sept. 20, 1983, to The Variable Speech Control Company of San Francisco, describes a time compression/expansion audio reproduction system of the type which relies on analog circuitry. It provides speech correction by repetitive variable time delay achieved by separating the reproduced signal from a recording into components which are separately delayed. The signal is separated into contiguous frequency bands, each of which is delayed synchronously. The signal is then recombined after delay, and low-pass filtering techniques are employed to remove high-frequency components introduced into the speech components by the signal processing technique. This technology is readily distinguishable from the present invention for at least two reasons. First, this technology relies on analog methods, whereas the present invention is digital in nature. Second, the present invention does not require filtering of speech components. Other distinctions will also be apparent to those of ordinary skill in this art.

Brantingham et al., U.S. Pat. No. 4,209,844, issued June 24, 1980, to Texas Instruments, describes a digital filter technique using a form of linear predictive coding (LPC). Specifically, the patent describes an invention embodied in a device implementing a lattice-type filter for generating complex waveforms suitable for implementation in semiconductor device technology. The invention appears to be unsuited to time-domain speech processing and further is not applicable to time scale modification in the time domain.

Kohut et al., U.S. Pat. No. 4,022,974, issued May 10, 1987, to Bell Telephone Laboratories, describes a predictive speech synthesizer having the capability of varying speech without changing pitch. The Bell technique is substantially unrelated to the present invention, since it relates primarily to parametric speech and does not deal with a actual time domain speech signal.

What is needed is a simple yet effective digital technique for providing time scale modification of real time or near real time speech signals.

SUMMARY OF THE INVENTION

According to the invention, method and apparatus are provided to process time domain speech signal containing speech information, the rate of reproduction of which is to be varied without changing pitch. The basic process comprises superimposing partially overlapping blocks of speech samples in a manner such that the pitch periodicity is maintained. The extent of superimposition is a function of the desired increase or decrease , or variance, in the time scale of the speech. In accordance with a preferred embodiment of the invention, maintenance of speech periodicity is achieved by fixing the precise superimposition in the time domain such that the superimposed waveforms achieve a best match using a technique which does not require multiplication or division.

Relatively smooth transition between superimposed speech signals are realized by applying a graduated weighting thereto.

In accordance with a preferred embodiment of the invention, if the extent of superimposition exceeds the amount of overlap, an accelerated speech output is provided, and if the extent of superimposition is less than the amount of overlap, a decelerated speech output is provided.

To minimize required computational load, the search range, that is, the range over which superimposition is varied in order to achieve a best match between speech segments, is selected as a function of pitch, thus ensuring that a sufficient number of samples are taken to assure that pitch pulses are contained in a sample set without requiring superfluous computations.

A specific embodiment of the invention allows for speech expansion of up to 150% and speech compression to as little as 40% of the duration of the source.

The method according to the invention may be incorporated into an embodiment using programmable digital signal processing hardware, such as a Texas Instruments TMS 320 Series device. Therefore it is not necessary to describe such devices in detail, since the combination of such components with programs in general are known to those of skill in the art. The application of such devices in accordance with the invention is nevertheless not apparent from the devices.

The method in accordance with the invention is substantially simpler, faster and more efficient than other methods which might be considered for purposes similar to the intended application. As one consequence, the method in accordance with the invention is more easily adapted to implementation in Very Large Scale Integration (VLSI) technology.

The method in accordance with the invention makes use of a waveform-segments-matching technique which takes advantage of the periodic nature of the signals produced by speech, and more specifically the existence of pitch pulses within a speech signal. Hence, in accordance with the invention, use is made of the maximum value of the pitch period of the input speech to reduce complexity, a technique not used heretofore.

The invention will be better understood by reference to the following detailed description in connection with the accompanying drawings.

DESCRIPTION OF DRAWINGS

FIG. 1 is a block diagram of a device which operates in accordance with the invention.

FIG. 2 is a flow chart of a method in accordance with the invention.

FIGS. 3A through 3D are illustrations showing operation of the method and apparatus according to the invention.

DESCRIPTION OF A PREFERRED EMBODIMENT

Referring to FIG. 1, a block diagram is shown of a signal processing apparatus 10 illustrating a typical environment of apparatus in accordance with the invention. Many variations will be apparent to those of ordinary skill in this art, including such variations as to the type of input devices and output components.

In the illustrative embodiment, the signal processing apparatus 10 includes a time-domain speech sampling means 12, the input port 11 of which receives live real-time or substantially real-time analog speech signals, and the output port 13 of which is coupled to digital storage means 14, such as a computer memory or set of digital storage registers. The digital storage means 14 has a digital signal output which is coupled to a digital signal processing means 16, such as a microcomputer constructed around a programmable microprocessor or special purpose digital signal processing device.

A suitable microprocessor is a Motorola 68000 series microprocessor or a Texas Instruments TMS 32020 DSP Chip preprogrammed to receive digital input data temporarily stored in the digital storage means 16, to process the digital input data in accordance with the method of the invention and to provide as a digital output signal digital output data to an output means such as a digital-to-analog converter means 18.

The digital-to-analog converter means 18 reconstructs an analog signal for audio reproduction and therefore has an output terminal which is coupled to an audio amplifier means 20 or the like, such as an analog recorder. In addition, output of the digital signal processor 16 is provided to interim storage means 22 which provides a second input to the digital signal processing means 16 for use in comparing the resultant digital output with subsequently received speech segments (frames or portions of frames) as explained hereinbelow.

Referring to FIG. 2, there is shown a flow chart for the relevant portion of a computer program for processing digitized input speech information in accordance with the invention. FIGS. 3A-3D, which are to be viewed as one diagram in connection with FIG. 2, illustrate the time relationship among block of speech samples. These blocks may represent the content of registers or temporary storage locations, each element of which contains data representing the amplitude of a given speech sample.

Phase information is for the most part ignored or otherwise only indirectly accounted for by the method according to the invention. It is known that the human ear is substantially immune to inaccuracies in phase information in speech.

In accordance with the invention, incoming speech is sampled at a selected sampling rate, and the samples are combined into blocks, herein termed "input blocks," the samples in each input block representing the amplitude of the speech i§ signal for such sample. Each input block overlaps the preceding input block by a predetermined number of samples. The number of samples by which each successive input block exceeds or extends beyond the preceding input block is termed the overlap value or OV and is a function of the sampling rate and of the number of samples contained in an input block.

Normally, the sample values are normalized to a range suitable for subsequent processing. (Automatic gain control may be employed independently of the normalized values.) In a specific embodiment, a maximum pitch period of no more than 17 ms is assumed, and each input block contains a uniform number of samples, selected to be between 80 and 120, representing a nominal 10-15 ms segment of speech information. A 10 ms segment is considered time invariant for the purpose of speech, which has a nominal spectrum of information of 200 Hz to 4000 Hz.

The method of the invention normally begins with initializing of variables and memory locations, which are set in accordance with preselected initializing values (Step A). The values to be initialized include user-selectable parameters, such as the number of samples which will be contained in each input block, the value of overlap value OV and the speed control value SCV, which indicates the amount by which it is desired to speed up or slow down speech (Step B).

The speed control value SCV is typically expressed as a number of samples. If the SCV is selected to exceed the overlap value OV, the output signal will be slowed relative to the input signal. If the SCV is selected to be less than the OV value, the output signal will be speeded up relative to the input signal.

FIG. 3A illustrates three successive input blocks on a continuing time scale, illustrating the overlapping thereof. In accordance with the present invention, an output block is defined and typically comprises an input block of speech samples which is stored in storage means 22. A superimposition reference pointer P is placed at a location along the output block in accordance with the SCV value (Step C).

FIG. 3B illustrates the pointer P at a location on an output block which produces speeding up of the output speech. Were the pointer P at the OV line, the output speech would be provided at exactly the same speed as the input speech.

A search range of a selected number of samples SR to either side of the pointer is selected as a function of the pitch frequency of the speech (Step D). The search range is requited to be approximately equal to the maximum pitch frequency. The selection of a search range is a particular feature of the present invention, as it enables preservation of pitch without requiring superfluous computations which require excess computing capability and computation time.

An input block, such as input block I, is defined (Step E). The first N samples of the input block (FIG. 3A) then undergo best fit matching to the portion of the output block within the above-defined search range, preferably by means of an Average Magnitude Difference Function (AMDF) adapted to the present invention, in order that the pitch pulses of the input block and the output block match as nearly as possible. Once the desired match has been found the input and output blocks are superimposed (FIG. 3C) at the location providing the best match, thereby preserving the pitch without creating undesired discontinuity between output blocks (Step F). In accordance with a preferred embodiment of the invention, the AMDF calculates the absolute value of the difference between the input block and the output block for each of a plurality of different possible superimpositions within the predetermined search range, thus identifying the superimposition having the lowest difference so that it may be selected for use in the subsequent processes. Use of the AMDF is a particular feature of the invention which represents a significant advance over the art and a departure from the prior art which employs cross-correlation functions. Such prior art functions involve multiplications which require substantial computation capabilities and computation time. Use of the AMDF increases capabilities without sacrificing computation power, which for example gives the method according to the invention an inherent bandwidth advantage over the prior art. A description of an Average Magnitude Difference Function suitable for implementation in the present invention is found in Digital Processing of Speech Signals, by L. R. Rabiner and R. W. Schafer, pp. 149-150 (Prentice-Hall, 1978), the content of which is incorporated herein by reference.

The superimposed portions of the output block and the input block are combined by a desired weighting arrangement or factor W (FIG. 3C) so as to provide a smooth transition from the sample values of the output block to those of the input block (Steps G and H). A substantially linear ramp is a suitable weighting factor, as illustrated in FIG. 3C.

The weighted combination of the input block with the overlapping portion of the output block becomes a new or next output block, herein indicated as output block II and shown in FIG. 3D. Output block II is stored in storage means 22.

According to the invention, that portion of the output block I which did not overlap the input block is output for the DAC 18 (FIG. 1) (Step I).

It is to be appreciated that the difference between the location of the pointer and the location at which superimposition begins is a potential source of distortions if combined over several output blocks. Accordingly, signal processor 16 operates to store the information on this difference (Step J) and to position the pointer on the subsequent output block so as to compensate for this difference.

Reference is made to the Appendix for a detailed technical description illustrating a specific embodiment of the invention.

The invention has now been explained with reference to specific embodiments. Other embodiments will be apparent to those of ordinary skill in the relevant art. It is therefore not intended that the invention be limited, except as indicated by the appended claims. ##SPC1##

INVENTORS:

Bialick, Leonid

THIS PATENT IS REFERENCED BY THESE PATENTS:

Patent	Priority	Assignee	Title
10134409,	Apr 13 2001	Dolby Laboratories Licensing Corporation	Segmenting audio signals into auditory events
5129036,	Mar 30 1990	GOOGLE LLC	Broadcast digital sound processing system
5175769,	Jul 23 1991	Virentem Ventures, LLC	Method for time-scale modification of signals
5216744,	Mar 21 1991	NICE SYSTEMS, INC	Time scale modification of speech signals
5285499,	Apr 27 1993	ED0 RECONNAISSANCE & SURVEILLANCE SYSTEMS	Ultrasonic frequency expansion processor
5303326,	Mar 30 1990	GOOGLE LLC	Broadcast digital sound processing system
5341432,	Oct 06 1989	Matsushita Electric Industrial Co., Ltd.	Apparatus and method for performing speech rate modification and improved fidelity
5353374,	Oct 19 1992	Lockheed Martin Corporation	Low bit rate voice transmission for use in a noisy environment
5444816,	Feb 23 1990	Universite de Sherbrooke	Dynamic codebook for efficient speech coding based on algebraic codes
5479564,	Aug 09 1991	Nuance Communications, Inc	Method and apparatus for manipulating pitch and/or duration of a signal
5491774,	Apr 19 1994	E DIGITAL CORPORATION	Handheld record and playback device with flash memory
5611002,	Aug 09 1991	Nuance Communications, Inc	Method and apparatus for manipulating an input signal to form an output signal having a different length
5630013,	Jan 25 1993	Matsushita Electric Industrial Co., Ltd.	Method of and apparatus for performing time-scale modification of speech signals
5694521,	Jan 11 1995	O HEARN AUDIO LLC	Variable speed playback system
5699482,	Feb 23 1990	Universite de Sherbrooke	Fast sparse-algebraic-codebook search for efficient speech coding
5701392,	Feb 23 1990	Universite de Sherbrooke	Depth-first algebraic-codebook search for fast coding of speech
5717823,	Apr 14 1994	THE CHASE MANHATTAN BANK, AS COLLATERAL AGENT	Speech-rate modification for linear-prediction based analysis-by-synthesis speech coders
5729657,	Nov 25 1993	Intellectual Ventures I LLC	Time compression/expansion of phonemes based on the information carrying elements of the phonemes
5751901,	Jul 31 1996	Qualcomm Incorporated	Method for searching an excitation codebook in a code excited linear prediction (CELP) coder
5754976,	Feb 23 1990	Universite de Sherbrooke	Algebraic codebook with signal-selected pulse amplitude/position combinations for fast coding of speech
5774837,	Sep 13 1995	VOXWARE, INC	Speech coding system and method using voicing probability determination
5787387,	Jul 11 1994	GOOGLE LLC	Harmonic adaptive speech coding method and system
5828995,	Feb 28 1995	Motorola, Inc.	Method and apparatus for intelligible fast forward and reverse playback of time-scale compressed voice messages
5832442,	Jun 23 1995	Transpacific IP Ltd	High-effeciency algorithms using minimum mean absolute error splicing for pitch and rate modification of audio signals
5842172,	Apr 21 1995	TensorTech Corporation	Method and apparatus for modifying the play time of digital audio tracks
5890108,	Sep 13 1995	Voxware, Inc.	Low bit-rate speech coding system and method using voicing probability determination
6178405,	Nov 18 1996	INNOMEDIA PTE , LTD	Concatenation compression method
6182042,	Jul 07 1998	Creative Technology, Ltd	Sound modification employing spectral warping techniques
6223153,	Sep 30 1995	IBM Corporation	Variation in playback speed of a stored audio data signal encoded using a history based encoding technique
6232540,	May 06 1999	Yamaha Corp.	Time-scale modification method and apparatus for rhythm source signals
6246752,	Jun 08 1999	NICE SYSTEMS, INC	System and method for data recording
6249570,	Jun 08 1999	NICE SYSTEMS, INC	System and method for recording and storing telephone call information
6252946,	Jun 08 1999	NICE SYSTEMS, INC	System and method for integrating call record information
6252947,	Jun 08 1999	NICE SYSTEMS, INC	System and method for data recording and playback
6360198,	Sep 12 1997	Nippon Hoso Kyokai	Audio processing method, audio processing apparatus, and recording reproduction apparatus capable of outputting voice having regular pitch regardless of reproduction speed
6418218,	Jun 02 1999	GMAC COMMERCIAL FINANCE LLC, AS AGENT	System and method for multi-stage data logging
6496794,	Nov 22 1999	Google Technology Holdings LLC	Method and apparatus for seamless multi-rate speech coding
6718309,	Jul 26 2000	SSI Corporation	Continuously variable time scale modification of digital audio signals
6728345,	Jun 08 1999	NICE SYSTEMS, INC	System and method for recording and storing telephone call information
6775372,	Jun 02 1999	NICE SYSTEMS, INC	System and method for multi-stage data logging
6785369,	Jun 08 1999	NICE SYSTEMS, INC	System and method for data recording and playback
6873954,	Sep 09 1999	Telefonaktiebolaget LM Ericsson (publ)	Method and apparatus in a telecommunications system
6901209,	Oct 12 1994	PIXEL INSTRUMENTS CORP	Program viewing apparatus and method
6937706,	Jun 08 1999	NICE SYSTEMS, INC	System and method for data recording
7283954,	Apr 13 2001	Dolby Laboratories Licensing Corporation	Comparing audio using characterizations based on auditory events
7313519,	May 10 2001	Dolby Laboratories Licensing Corporation	Transient performance of low bit rate audio coding systems by reducing pre-noise
7366659,	Jun 07 2002	GOOGLE LLC	Methods and devices for selectively generating time-scaled sound signals
7426221,	Feb 04 2003	Cisco Technology, Inc.	Pitch invariant synchronization of audio playout rates
7461002,	Apr 13 2001	Dolby Laboratories Licensing Corporation	Method for time aligning audio signals using characterizations based on auditory events
7524191,	Sep 02 2003	ROSETTA STONE LLC	System and method for language instruction
7610205,	Apr 13 2001	Dolby Laboratories Licensing Corporation	High quality time-scaling and pitch-scaling of audio signals
7711123,	Apr 13 2001	Dolby Laboratories Licensing Corporation	Segmenting audio signals into auditory events
7751804,	Jul 23 2004	WIDEORBIT OPCO INC ; WideOrbit LLC	Dynamic creation, selection, and scheduling of radio frequency communications
7826444,	Apr 13 2007	WIDEORBIT OPCO INC ; WideOrbit LLC	Leader and follower broadcast stations
7853447,	Dec 08 2006	Micro-Star Int'l Co., Ltd.	Method for varying speech speed
7889724,	Apr 13 2007	WIDEORBIT OPCO INC ; WideOrbit LLC	Multi-station media controller
7899678,	Jan 11 2007		Fast time-scale modification of digital signals using a directed search technique
7925201,	Apr 13 2007	WIDEORBIT OPCO INC ; WideOrbit LLC	Sharing media content among families of broadcast stations
8143620,	Dec 21 2007	SAMSUNG ELECTRONICS CO , LTD	System and method for adaptive classification of audio sources
8150065,	May 25 2006	SAMSUNG ELECTRONICS CO , LTD	System and method for processing an audio signal
8165888,	Mar 16 2007	The University of Electro-Communications; Funai Electric Co., Ltd.	Reproducing apparatus
8180064,	Dec 21 2007	SAMSUNG ELECTRONICS CO , LTD	System and method for providing voice equalization
8185929,	Oct 12 1994	PIXEL INSTRUMENTS CORP	Program viewing apparatus and method
8189766,	Jul 26 2007	SAMSUNG ELECTRONICS CO , LTD	System and method for blind subband acoustic echo cancellation postfiltering
8194880,	Jan 30 2006	SAMSUNG ELECTRONICS CO , LTD	System and method for utilizing omni-directional microphones for speech enhancement
8194882,	Feb 29 2008	SAMSUNG ELECTRONICS CO , LTD	System and method for providing single microphone noise suppression fallback
8195472,	Apr 13 2001	Dolby Laboratories Licensing Corporation	High quality time-scaling and pitch-scaling of audio signals
8204252,	Oct 10 2006	SAMSUNG ELECTRONICS CO , LTD	System and method for providing close microphone adaptive array processing
8204253,	Jun 30 2008	SAMSUNG ELECTRONICS CO , LTD	Self calibration of audio device
8259926,	Feb 23 2007	SAMSUNG ELECTRONICS CO , LTD	System and method for 2-channel and 3-channel acoustic echo cancellation
8345890,	Jan 05 2006	SAMSUNG ELECTRONICS CO , LTD	System and method for utilizing inter-microphone level differences for speech enhancement
8355511,	Mar 18 2008	SAMSUNG ELECTRONICS CO , LTD	System and method for envelope-based acoustic echo cancellation
8428427,	Oct 12 1994	PIXEL INSTRUMENTS CORP	Television program transmission, storage and recovery with audio and video synchronization
8488800,	Apr 13 2001	Dolby Laboratories Licensing Corporation	Segmenting audio signals into auditory events
8521530,	Jun 30 2008	SAMSUNG ELECTRONICS CO , LTD	System and method for enhancing a monaural audio signal
8570328,	Dec 12 2000	Virentem Ventures, LLC	Modifying temporal sequence presentation data based on a calculated cumulative rendition period
8744844,	Jul 06 2007	SAMSUNG ELECTRONICS CO , LTD	System and method for adaptive intelligent noise suppression
8769601,	Oct 12 1994	PIXEL INSTRUMENTS CORP	Program viewing apparatus and method
8774423,	Jun 30 2008	SAMSUNG ELECTRONICS CO , LTD	System and method for controlling adaptivity of signal modification using a phantom coefficient
8797329,	Dec 12 2000	Virentem Ventures, LLC	Associating buffers with temporal sequence presentation data
8842844,	Apr 13 2001	Dolby Laboratories Licensing Corporation	Segmenting audio signals into auditory events
8849231,	Aug 08 2007	SAMSUNG ELECTRONICS CO , LTD	System and method for adaptive power control
8867759,	Jan 05 2006	SAMSUNG ELECTRONICS CO , LTD	System and method for utilizing inter-microphone level differences for speech enhancement
8886525,	Jul 06 2007	Knowles Electronics, LLC	System and method for adaptive intelligent noise suppression
8934641,	May 25 2006	SAMSUNG ELECTRONICS CO , LTD	Systems and methods for reconstructing decomposed audio signals
8949120,	Apr 13 2009	Knowles Electronics, LLC	Adaptive noise cancelation
9008329,	Jun 09 2011	Knowles Electronics, LLC	Noise reduction using multi-feature cluster tracker
9035954,	Dec 12 2000	Virentem Ventures, LLC	Enhancing a rendering system to distinguish presentation time from data time
9076456,	Dec 21 2007	SAMSUNG ELECTRONICS CO , LTD	System and method for providing voice equalization
9165562,	Apr 13 2001	Dolby Laboratories Licensing Corporation	Processing audio signals with adaptive time or frequency resolution
9185487,	Jun 30 2008	Knowles Electronics, LLC	System and method for providing noise suppression utilizing null processing noise subtraction
9251782,	Mar 21 2007	OSR ENTERPRISES AG	System and method for concatenate speech samples within an optimal crossing point
9338492,	Sep 19 2006	RAI RADIOTELEVISIONE ITALIANA S P A ; S I SV EL S P A	Method for reproducing an audio and/or video sequence, a reproducing device and reproducing apparatus using the method
9536540,	Jul 19 2013	SAMSUNG ELECTRONICS CO , LTD	Speech signal separation and synthesis based on auditory scene analysis and speech modeling
9640194,	Oct 04 2012	SAMSUNG ELECTRONICS CO , LTD	Noise suppression for speech processing based on machine-learning mask estimation
9723357,	Oct 12 1994	PIXEL INSTRUMENTS CORP	Program viewing apparatus and method
9799330,	Aug 28 2014	SAMSUNG ELECTRONICS CO , LTD	Multi-sourced noise suppression
9830899,	Apr 13 2009	SAMSUNG ELECTRONICS CO , LTD	Adaptive noise cancellation
9961441,	Jun 27 2013	DSP Group Ltd	Near-end listening intelligibility enhancement

THIS PATENT REFERENCES THESE PATENTS:

Patent	Priority	Assignee	Title
4022974,	Jun 03 1976	Bell Telephone Laboratories, Incorporated	Adaptive linear prediction speech synthesizer
4209844,	Jun 17 1977	Texas Instruments Incorporated	Lattice filter for waveform or speech synthesis circuits using digital logic
4406001,	Aug 18 1980	VARIABLE SPEECH CONTROL COMPANY THE A LIMITED PARTNERSHIP OF CT	Time compression/expansion with synchronized individual pitch correction of separate components
4435832,	Oct 01 1979	Hitachi, Ltd.	Speech synthesizer having speech time stretch and compression functions

ASSIGNMENT RECORDS Assignment records on the USPTO

Executed on	Assignor	Assignee	Conveyance	Frame	Reel	Doc
Feb 03 1988		The DSP Group, Inc.	(assignment on the face of the patent)
Mar 23 1988	BIALICK, LEONID	DSP GROUP, INC , THE, A CA CORP	ASSIGNMENT OF ASSIGNORS INTEREST	004879	0799	pdf

MAINTENANCE FEES AND DATES: Maintenance records on the USPTO

Date	Maintenance Fee Events
Feb 01 1993	M283: Payment of Maintenance Fee, 4th Yr, Small Entity.
Mar 03 1993	ASPN: Payor Number Assigned.
Feb 21 1997	M284: Payment of Maintenance Fee, 8th Yr, Small Entity.
Mar 01 2001	M285: Payment of Maintenance Fee, 12th Yr, Small Entity.

Date	Maintenance Schedule
Sep 05 1992	4 years fee payment window open
Mar 05 1993	6 months grace period start (w surcharge)
Sep 05 1993	patent expiry (for year 4)
Sep 05 1995	2 years to revive unintentionally abandoned end. (for year 4)
Sep 05 1996	8 years fee payment window open
Mar 05 1997	6 months grace period start (w surcharge)
Sep 05 1997	patent expiry (for year 8)
Sep 05 1999	2 years to revive unintentionally abandoned end. (for year 8)
Sep 05 2000	12 years fee payment window open
Mar 05 2001	6 months grace period start (w surcharge)
Sep 05 2001	patent expiry (for year 12)
Sep 05 2003	2 years to revive unintentionally abandoned end. (for year 12)