Speech signal compression and/or decompression method, medium, and apparatus

Speech signal compression and/or decompression method, medium, and apparatus
US8019600

A speech signal compression and/or decompression method, medium, and apparatus in which the speech signal is transformed into the frequency domain for quantizing and dequantizing information of frequency coefficients. The speech signal compression apparatus includes a transform unit to transform a speech signal into the frequency domain and obtain frequency coefficients, a magnitude quantization unit to transform magnitudes of the frequency coefficients, quantize the transformed magnitudes and obtain magnitude quantization indices, a sign quantization unit to quantize signs of the frequency coefficients and obtain sign quantization indices, and a packetizing unit to generate the magnitude and sign quantization indices as a speech packet.

PTO Wrapper PDF
Dossier Espace Google

Patent 8019600
Priority May 13 2004
Filed May 13 2005
Issued Sep 13 2011
Expiry Apr 27 2028 Extension 1080 days
Inventors Park, Hoch…
Assg.orig Samsung El…
Assg.curr SAMSUNG EL…
Entity Large
Referenced by 1
References 21
Maint.: EXPIRED

CROSS-REFERENCE TO R…
BACKGROUND OF THE IN…
SUMMARY OF THE INVEN…
BRIEF DESCRIPTION OF…
DETAILED DESCRIPTION…

20. A speech signal compression method comprising:

transforming a speech signal including a plurality of subframes into a frequency domain to obtain frequency coefficients;

transforming magnitudes of the frequency coefficients for each of the subframes of the speech signal and quantizing the transformed magnitudes to obtain magnitude quantization indices;

quantizing each sign of each of the frequency coefficients to obtain sign quantization indices; and

generating the magnitude quantization indices and the signs quantization indices as a speech packet,

wherein the frequency coefficients for each of the subframes are combined into a plurality of groups according to a time-varying property of the speech signal and a two-dimensional transform is performed on each group according to the time-varying property of the speech signal, the frequency coefficients being combined into groups based on a uniformity of the subframes in relation to neighboring subframes.

38. A computer-readable non-transitory medium encoded with instructions capable of being executed on a computer and implementing a speech signal compression method, comprising:

transforming a speech signal including a plurality of subframes into a frequency domain to obtain frequency coefficients;

transforming magnitudes of the frequency coefficients for each of the subframes of the speech signal and quantizing the transformed magnitudes to obtain magnitude quantization indices;

quantizing each sign of each of the frequency coefficients to obtain sign quantization indices; and

generating the magnitude quantization indices and the sign quantization indices as a speech packet,

wherein the frequency coefficients for each of the subframes are combined into a plurality of groups according to a time-varying property in energy of the speech signal and a two-dimensional transform is performed on each group according to the time-varying property of the speech signal, the frequency coefficients being combined into groups based on a uniformity of the subframes in relation to neighboring subframes.

1. A speech signal compression apparatus, including at least one processing device comprising:

a transform unit, using the at least one processing device, to transform a speech signal including a plurality of subframes into a frequency domain and obtain frequency coefficients;

a magnitude quantization unit to transform magnitudes of the frequency coefficients for each of the subframes of the speech signal, quantize the transformed magnitudes and obtain magnitude quantization indices;

a sign quantization unit to quantize each sign of each of the frequency coefficients and obtain sign quantization indices; and

a packetizing unit to generate the magnitude quantization indices and the sign quantization indices as a speech packet,

36. A speech signal decompression method comprising:

inversely packetizing a compressed speech packet to obtain sign quantization indices and magnitude quantization indices;

dequantizing the sign quantization indices that were obtained by quantizing each sign of each frequency coefficient obtained from a speech signal and coefficient signs;

dequantizing the magnitude quantization indices to obtain first coefficient magnitudes;

two-dimensionally arranging the first coefficient magnitudes to obtain second coefficient magnitudes;

inversely transforming the second coefficient magnitudes to obtain third coefficient magnitudes;

inserting signs into the third coefficient magnitudes to obtain frequency coefficients;

dividing the frequency coefficients into a plurality of subframes; and

inversely transforming the frequency coefficients to obtain a time domain signal for each of the subframes,

wherein the speech signal includes a plurality of subframes and the frequency coefficients for each of the subframes are combined into a plurality of groups according to a time-varying property in energy of the speech signal and a two-dimensional transform is performed on each group according to the time-varying property of the speech signal, the frequency coefficients being combined into groups based on a uniformity of the subframes in relation to neighboring subframes.

39. A computer-readable non-transitory medium encoded with instructions capable of being executed on a computer and implementing a speech signal decompression method, comprising: