The performance of a voice conference using a packet-based conference bridge can be improved with a number of modifications. In one modification, the conference bridge receives speech indication signals from the individual packet-based terminals within the voice conference, these speech indication signals then being used by the conference bridge to select the talkers within the voice conference. This removes the need for speech detection techniques within the conference bridge, hence decreasing the required processing power and the latency within the conference bridge. In another modification, the conference bridge sends addressing control signals to the individual packet-based terminals selected as talkers, these addressing control signals directing the terminals selected as talkers to directly transmit their voice data packets to the other terminals within the voice conference. This direct transmission of voice data packets can reduce transcoding and latency within the network. These two modifications could further be combined, resulting in a conference bridge that receives speech indication signals, selects the talkers for the voice conference and outputs addressing control signal to the talkers. In this case, the advantages of the two modifications are gained as well as additional capacity advantages resulting from no voice signals actually traversing the conference bridge.
|
10. A conference bridge arranged to be coupled to a packet-based network that includes at least two sources of media signals forming a media conference, the conference bridge comprising:
a talker selection unit that operates to:
receive speech indication signals from at least one of the sources within the media conference and to process the speech indication signals including selecting a set of the sources within the media conference as talkers; and
output addressing control signals to the sources within the media conference selected as talkers, the addressing control signals comprising instructions for the sources within the media conference selected as talkers to output their media signals directly to other sources within the media conference.
1. A conference bridge comprising:
an input unit that operates to receive media data packets from at least two sources forming a media conference, each media data packet defining a media signal;
an energy detection and talker selection unit, coupled to the input unit, that operates to:
determine at least one speech parameter corresponding to each of the media signals;
select a set of the sources within the media conference as talkers based on the determined speech parameters; and
output addressing control signals to only the sources within the media conference selected as talkers, the addressing control signals comprising instructions for the sources within the media conference selected as talkers to output their media signals directly to other sources within the media conference.
35. A method for a packet-based apparatus to operate within a media conference controlled by a conference bridge, the method comprising:
receiving media signal from at least one participant within the media conference;
processing the received media signal in order to generate a speech indication signal based upon the received media signal; and
outputting the received media signal and the speech indication signal to the conference bridge,
wherein processing the received media signal in order to generate a speech indication signal comprises:
determining if the received media signal contains speech;
if the received media signal contains speech, including a talking indication within the speech indication signal; and
if the received media signal does not contain speech, including a listening indication within the speech indication signal.
25. A packet-based apparatus arranged to be coupled to a conference bridge via a packet-based network, the packet-based apparatus comprising:
an output unit that operates to receive signal from at least one participant within a media conference and output the received media signal to the conference bridge via the packet-based network; and
a speech detection unit, coupled to the output unit, that operates to process the received media signal, generate a speech indication signal based upon the received media signal, and output the speech indication signal to the conference bridge,
wherein to generate a speech indication signal based upon the received media signal, the speech detection unit operates to:
determine if the received media signal contains speech;
if the received media signal contains speech, include a talking indication within the speech indication signal; and
if the received media signal does not contain speech, include a listening indication within the speech indication signal.
31. A packet-based apparatus arranged to be coupled to a conference bridge via a packet-based network, the packet-based apparatus comprising:
an addressing control unit that operates to receive at least one addressing control signal from the conference bridge; and
an output unit that operates to receive at least one media signal from at least one participant within a media conference and output the received media signal, via the packet-based network, to at least one other participant within the media conference based upon the at least one addressing control signal,
further comprising a speech detection unit, coupled to the output unit, that operates to process the received media signal, generate a speech indication signal based upon the received media signal, and output the speech indication signal to the conference bridge, wherein to generate a speech indication signal based upon the received media signal, the speech detection unit operates to:
determine if the received media signal contains speech;
if the received media signal contains speech, include a talking indication within the speech indication signal; and
if the received media signal does not contain speech, include a listening indication within the speech indication signal.
2. A conference bridge according to
3. A conference bridge according to
wherein the speech parameter corresponding to each of the media signals is a number of bytes within each of the compressed audio signals.
4. A conference bridge according to
5. A conference bridge according to
6. A conference bridge according to
7. A conference bridge according to
8. A network incorporating a conference bridge according to
wherein each of the sources within the media conference operates to output the at least one media signal to the conference bridge, receive the addressing control signal from the conference bridge, and output their media signals to the other sources within the media conference based upon the received addressing control signal.
9. A conference bridge according to
wherein the conference bridge further comprises a mixing block and an output unit, the mixing block coupled to the talker selection unit and the output unit coupled to the mixing block; and
wherein the mixing block operates to receive media signals corresponding to sources within the media conference selected as talkers from the input unit, mix these received media signals, and output the mixed result to the output unit.
11. A conference bridge according to
12. A conference bridge according to
monitor the speech indication signals for talking indications; and
select sources within the media conference as talkers based upon the order in which any talking indications are received at the talker selection unit from the sources within the media conference.
13. A conference bridge according to
14. A conference bridge according to
determine which sources within the media conference are sending media signals containing speech with the use of the speech parameters within the speech indication signals; and
select sources within the media conference as talkers based upon the order in which sources within the media conference are determined to send media signals containing speech.
15. A conference bridge according to
16. A conference bridge according to
determine which sources within the media conference are sending media signals containing speech with the use of the energy levels within the speech indication signals; and
select sources within the media conference as talkers based upon the comparative energy levels of the sources within the media conference determined to be sending media signals containing speech.
17. A conference bridge according to
18. A conference bridge according to
19. A conference bridge according to
wherein the conference bridge further comprises a mixing block and an output unit, the mixing block coupled to the talker selection unit and the output unit coupled to the mixing block; and
wherein the mixing block operates to receive media signals corresponding to sources within the media conference selected as talkers from the input unit, mix these received media signals, and output the mixed result to the output unit.
20. A conference bridge according to
21. A conference bridge according to
22. A conference bridge according to
23. A conference bridge according to
24. A network incorporating a conference bridge according to
wherein each of the sources within the media conference operates to output a speech indication signal to the conference bridge, receive the addressing control signal from the conference bridge, and output their media signals to the other sources within the media conference based upon the received addressing control signal.
26. A packet-based apparatus according to
27. A packet-based network interface arranged to be coupled between a packet-based network and a non-packet-based network, the network interface comprising a packet-based apparatus according to
28. A packet-based apparatus according to
29. A packet-based apparatus according to
30. A packet-based apparatus according to
wherein to determine if the received media signal contains speech, the speech detection unit operates to determine if the number of bytes of the compressed media signal indicates that the received media signal contains speech.
32. A packet-based apparatus according to
33. A packet-based apparatus according to
34. A packet-based apparatus according to
wherein to determine if the received media signal contains speech, the speech detection unit operates to determine if the number of bytes of the compressed media signal indicates that the received media signal contains speech.
|
This invention relates generally to packet-based media communications and more specifically to media conferencing within a packet-based communication network.
Prior to the use of packet-based voice communications, telephone conferences were a service option available within standard non-packet-based telephone networks such as Pulse Code Modulation (PCM) telephone networks. As depicted in
One such algorithm used to control a conference session, referred to as a “party line” approach, comprises the steps of mixing the voice communications received from each telephone terminal 16 within the conference session and further distributing the result to each of the telephone terminals 16 for broadcasting. A problem with this algorithm is the amount of noise that is combined during the mixing step, this noise comprising a background noise source corresponding to each of the telephone terminals 16 within the conference session.
An improved algorithm for controlling a conference session is disclosed within U.S. patent application Ser. No. 08/987,216 entitled “Method of Providing Conferencing in Telephony” by Dal Farra et al, filed on Dec. 9, 1997, assigned to the assignee of the present invention, and herein incorporated by reference. This algorithm comprises the steps of selecting primary and secondary talkers, mixing the voice communications from these two talkers and forwarding the result of the mixing to all the participants within the conference session except for the primary and secondary talkers. The primary and secondary talkers receive the voice communications corresponding to the secondary and primary talkers respectively. The selection and mixing of only two talkers at any one time can reduce the background noise level within the conference session when compared to the “party line” approach described above.
In a standard PCM telephone network as is depicted in
Currently, packet-based voice communications are being utilized more frequently as Voice-over-Internet Protocol (VoIP) becomes increasingly popular. In these standard VoIP communications, voice data in PCM form is being encapsulated with a header and footer to form voice data packets; the header in these packets has, among other things, a Real Time Protocol (RTP) header that contains a time stamp corresponding to when the packet was generated. One area that requires considerable improvement is the use of packet-based voice communications to perform telephone conferencing capabilities.
As depicted within
The inputting apparatus 30 performs a number of functions on the packets that are received at the conference bridge 28 from the terminals within a voice conference. These functions include protocol stack, jitter buffer and decompression operations. During the protocol stack operation, the inputting apparatus 30 receives packets comprising compressed voice signals, hereinafter referred to as voice data packets, and strips off the packet overhead required for transmitting the voice data packets through the packet-based network 20. During the jitter buffer operation, the inputting apparatus 30 receives the compressed voice signals, ensures that the compressed voice signals are within the proper sequence (i.e. time ordering signals), buffers the compressed voice signals to ensure smooth playback and ideally implements packet loss concealment. During the decompression operation, the inputting apparatus 30 receives the buffered compressed voice signals, converts them into standard PCM format and outputs the resulting voice signals (that are in Pulse Code Modulation) to the energy detection, talker selection and mixing block 32.
The energy detection, talker selection and mixing block 32 performs almost identical functionality to the conference bridge 17 within FIG. 1A. The key to the design of a conference bridge 28 as depicted in
The outputting apparatus 34 performs a number of functions on the outputs from the block 32, these functions including compression and transmission operations. During the compression operation, the outputting apparatus 34 receives and compresses respective ones of the three outputs from the energy detection, talker selection and mixing block 32. During the transmission operation, the outputting apparatus 34 performs a protocol stack operation on the compressed voice signals, encapsulates the compressed voice signals within the packet-based format required for transmission on the packet-based network 20 and transmits voice data packets comprising the compressed voice signals to the appropriate terminals 22,24,26 within the conference session. It is noted that, in the case of the talker selection algorithm described above, the mixed voice signal is forwarded to all the terminals with the exception of the primary and secondary talkers while the primary and secondary talkers are sent the appropriate unmixed voice signals.
One problem with the setup depicted within
Hence, a new design within a packet-based voice communication network is required to implement voice conferencing functionality. In this new design, a reduction in transcoding, latency and/or required signal processing power within the conferencing network is needed.
The present invention is directed to methods and apparatus that can be utilized within a packet-based media communication system for media conferences. In one embodiment of the present invention, a packet-based conference bridge receives speech indication signals from the individual packet-based terminals within a voice conference, these speech indication signals being used to select the talkers within the voice conference. The speech indication signals could be a talking/listening indication, an energy level indication or another parameter that a talker selection algorithm could use to select packet-based terminals as talkers. In another embodiment of the present invention, the packet-based conference bridge sends addressing control signals to the individual packet-based terminals selected as talkers. These addressing control signals indicate the packet-based network addresses for all the packet-based terminals that the talker should directly transmit its voice data packets to. A yet other embodiment of the present invention combines the use of both of the above embodiments such that the packet-based conference bridge essentially comprises a talker selection block that receives speech indication signals from packet-based terminals within a voice conference and transmits addressing control signals to the terminals that are selected as talkers in order to direct the voice data packets from the talker(s) to the appropriate other packet-based terminals within the voice conference.
There are numerous advantages of the embodiments of the present invention compared to well-known voice conferencing techniques. For one, all of the embodiments of the present invention reduce the amount of processing power required within the conference bridges. This is done by removing the need for an energy detection block and/or an outputting apparatus within the conference bridge. This, in turn, can reduce the latency for the voice data packets. Another advantage of some embodiments of the present invention is a reduced transcoding that must be done. This reduction could be caused by the reduced need to decompress the compressed voice signals within the conference bridge due to the independently received speech detection signals. Further, by transmitting voice data packets in some embodiments directly between the source of the voice data packets to the destination of the voice data packets, a significant reduction in transcoding can be achieved. Yet another advantage of embodiments of the present invention is the reduced concentration of traffic that results from the implementation of the combined embodiments. In this case, the conference bridge does not receive or transmit high bandwidth voice data packets, but rather receives and transmits control signals to manage the voice conference. This also reduces any strain that might occur on the limited input/output capacity for the conference bridge.
The present invention, according to a first broad aspect, is a conference bridge including an input unit, a talker selection unit and an output unit. The input unit operates to receive at least one media data packet from at least two sources forming a media conference, each media data packet defining a media signal. The talker selection unit operates to receive speech indication signals from at least one of the sources within the media conference and to process the speech indication signals including selecting a set of the sources within the media conference as talkers. The output unit operates to output the media signals that correspond to the set of sources within the media conference selected as talkers.
The present invention, according to a second broad aspect, is a conference bridge including an input unit, an energy detection and talker selection unit and an output unit. The input unit operates to receive at least one media data packet from at least two sources forming a media conference, each media data packet defining a media signal. The energy detection and talker selection unit operates to determine at least one speech parameter corresponding to each of the media signals and select a set of the sources within the media conference as talkers based on the determined speech parameters. The output unit operates to output addressing control signals to the sources within the media conference selected as talkers. The addressing control signals comprise instructions for the sources within the media conference selected as talkers to output their media signals directly to other sources within the media conference.
The present invention, according to a third broad aspect, is a conference bridge arranged to be coupled to a packet-based network that includes at least two sources of media signals forming a media conference. In this aspect, the conference bridge includes a talker selection unit similar to that of the first broad aspect and an output unit similar to the second broad aspect.
According to a fourth broad aspect, the present invention is a packet-based apparatus arranged to be coupled to a conference bridge via a packet-based network. The packet-based apparatus including an output unit and a speech detection unit. The output unit operates to receive at least one media signal from at least one participant within a media conference and output the received media signal to the conference bridge via the packet-based network. The speech detection unit operates to process the received media signal, generate a speech indication signal based upon the received media signal and output the speech indication signal to the conference bridge.
According to a fifth broad aspect, the present invention is a packet-based apparatus arranged to be coupled to a conference bridge via a packet-based network, the apparatus including an addressing control unit and an output unit. The addressing control unit operates to receive at least one addressing control signal from the conference bridge. The output unit operates to receive at least one media signal from at least one participant within a media conference and output the received media signal, via the packet-based network, to at least one other participant within the media conference based upon the addressing control signal. In another embodiment of the fifth broad aspect, the apparatus further includes a speech detection unit similar to that of the fourth broad aspect.
In yet further aspects, the present invention is a method for controlling a media conference, a method for a packet-based apparatus to operate within a media conference controlled by a conference bridge and a network incorporating a conference bridge according to one of the first three broad aspects.
Other aspects and features of the present invention will become apparent to those ordinarily skilled in the art upon review of the following description of specific embodiments of the invention in conjunction with the accompanying figures.
Embodiments of the present invention are described with reference to the following figures, in which:
The present invention is directed to a number of different methods and apparatus that can be utilized within a packet-based voice communication system. Primarily, the embodiments of the present invention are directed to methods and apparatus used for voice conferences within packet-based communication networks, but this is not meant to limit the scope of the present invention.
One skilled in the art would understand that there are two essential sectors for the operations of a telephone session. These sectors include a control plane that performs administrative functions such as access approval and build-up/tear-down of telephone sessions and/or conference sessions and a media plane which performs the signal processing required on media (voice or video) streams such as format conversions and mixing operations. As described below, the present invention is applicable to modifications within the media plane which could be implemented with a variety of different control planes while remaining within the scope of the present invention.
Embodiments of the present invention described herein below are directed to packet-based conference bridges and packet-based apparatus coupled within a packet-based network that enable media conferences between numerous sources of media signals. These sources of media signals can be any device in which a person can output media data for transmission within the packet-based network. In some embodiments, the packet-based apparatus are packet-based terminals coupled together with the packet-based conference bridge within a packet-based network, each of the packet-based terminals being a source for media signals for the other packet-based apparatus.
In other embodiments, one or more of the packet-based apparatus are packet-based network interfaces which couple standard non-packet-based terminals, such as PCM or analog telephone terminals, to a packet-based network, each of the non-packet-based terminals being a source for media signals for the media conference. This situation is illustrated within
In the following description, it should be understood that despite referring to the sources of media signals as packet-based terminals within the packet-based network throughout this document, such references could alternatively be directed to another form of media signal source. Further, although the packet-based apparatus described below are the packet-based terminals that also serve as the source for media signals, it should be understood that, alternatively, the packet-based apparatus could be packet-based network interfaces. Yet further, although the following description of the present invention is specific to voice data packets that contain compressed voice signals and generally to voice conferencing, this should not limit the scope of the present invention as is described in further detail herein below.
A first embodiment of the present invention, in which reduced processing is required within the packet-based conference bridge compared to well-known conference bridge designs, is now described with reference to
In operation, the talker selection block 44 receives the speech indication signals from the packet-based terminals within the voice conference, via the packet-based network 20, and performs a predefined talker selection algorithm. This talker selection algorithm could be similar to that disclosed within U.S. patent application Ser. No. 08/987,216, as incorporated by reference herein above, in which primary and secondary talkers are selected, though the present invention should not be limited to this implementation. During the selection of talkers by the talker selection block 44, the technique used depends upon the particular design. For instance, in one implementation, talkers are selected based upon the order in which participants in the voice conference begin to speak. In this case, the talkers are selected as the first terminals which send speech indication signals to the talker selection block 44 indicating that a participant local to the particular packet-based terminal has begun to speak. In other designs, the energy level of the voice signals, as indicated within the speech indication signals received from the packet-based terminals, is used by the talker selection block 44 to select the talkers. In yet other designs, some of the talkers could be pre-selected while the talker selection block 44 uses the speech indication signals simply to select the other talker(s) within the voice conference. This could be applicable in cases that a monitor or prearranged speaker for the voice conference is always selected as a talker.
Within the implementation of
It should be noted that a procedure for de-selecting talkers is another operation within the talker selection block 44. In one embodiment, the de-selection of a packet-based terminal as a talker occurs if a speech indication signal received from the particular terminal indicates that a participant local to the terminal has stopped speaking. In another embodiment, the de-selection of a packet-based terminal as a talker occurs if speech indication signals received from the particular terminal indicate the speech from a participant local to the terminal has decreased in energy. In yet another embodiment, the de-selection of a terminal as a talker is performed if a predetermined time interval is passed since the receipt of a speech indication signal that indicates that the particular terminal has a participant local to the terminal speaking.
There are numerous alternative implementations for the packet-based conference bridge according to the first embodiment of the present invention. For one, modifications within the conference bridge could be made similar to those described within U.S. patent application Ser. No. 09/475,047 entitled “APPARATUS AND METHOD FOR PACKET-BASED MEDIA COMMUNICATIONS” by Simard et al, filed on Dec. 29, 1999 and incorporated herein by reference. As indicated within U.S. patent application Ser. No. 09/475,047, there are numerous implementations for the inputting apparatus 30, talker selection and mixing block 42 and the outputting apparatus 34 possible. For instance, the jitter buffer operation could be removed from the inputting apparatus 30 in some implementations. Further, in some implementations, the inputting apparatus 30 does not need to perform a decompression operation and the outputting apparatus 34 does not need to perform a compression operation on any voice signals corresponding to talkers which do not require a mixing operation. This reduced transcoding can result in higher quality voice signals being broadcast to the participants of the voice conference as well as reduce the latency of the voice data packets through the conference bridge 28.
In yet further alternatives, the talker selection block 44 is coupled to the inputting apparatus 30 so as to prevent the unnecessary processing of voice data packets that are received from packet-based terminals that are not selected as talkers. This can be accomplished with the present invention since the selection of the talkers within the voice conference is independent of the processing of the received voice data packets.
It should be noted that although the blocks 30,34,44,46 within
In operation, the inputting apparatus 50 receives the voice data packets output from the packet-based conference bridge 28 and, along with the decompression unit 52, performs similar operations as described above for the inputting apparatus 30 within
The microphone 58 operates to receive sound waves local to the microphone 58 and generate analog voice signals corresponding to the sound waves, these analog voice signals being input to the A/D converter 60. The A/D converter 60 converts the analog voice signals to a digital format and forwards these voice signals to the compression unit 62. The compression unit 62 combined with the outputting apparatus 64 perform similar operations to those described above for the outputting apparatus 34 within
Both of the above described operations within the packet-based terminal of
There are numerous alternative implementations for the speech detector 66. For instance, in one implementation, the speech detector 66 sends the talking signal to the packet-based conference bridge 28 when it first detects the energy level of the received voice signals have exceeded the predetermined energy threshold for a first predetermined time interval and sends the listening signal to the packet-based conference bridge 28 when it detects the energy level of the received voice signals are below the predetermined energy threshold for a second predetermined time interval.
In other embodiments, the speech indication signals are not talking and listening signals respectively. Instead, the speech indication signals correspond to specific parameters extracted from the received voice signals. For instance, the speech indication signals in one implementation correspond to energy levels for the voice signals. In one example, these speech indication signals could be nil energy (0), a low energy level (E1) or a high energy level (E2). For this example, multiple energy thresholds could be used for comparison in order to classify the energy level of talking at the specific packet-based terminal. In another implementation, the extracted parameters from the voice signals could be the pitch of the voice signals. In this case, the pitch could either be directly forwarded to the talker selection block 44 or, alternatively, a determination could take place within the speech detector 66 on whether the pitch indicates that there is speech or not. In the alternative case, a talking or listening signal as described above could be sent after processing the pitch values.
It should be noted that, although not illustrated within
Although the speech detector 66 is illustrated in
In other implementations, the speech detector 66 receives the compressed voice signals from the compression unit 62 and/or the voice data packets from the outputting apparatus 64. In these cases, speech detection operations as disclosed within U.S. patent application Ser. No. 09/475,047, previously incorporated by reference, could be utilized. In one implementation, as disclosed within U.S. patent application Ser. No. 09/475,047, a voice Activity Detection (VAD) operation is enabled at the packet-based terminal. In this embodiment, packets (and therefore compressed voice signals) that contain speech can be distinguished from packets that do not by the number of bytes contained within the packet. In other words, the size of the compressed voice signal can determine whether it contains speech. For example, in the case that the G.723.1 VoIP standard is utilized, voice data packets containing voice would contain a compressed voice signal of 24 bytes while voice data packets containing essentially silence would contain a compressed voice signal of 4 bytes. In another implementation as disclosed within U.S. patent application Ser. No. 09/475,047, the speech detector 66 could determine if there is speech within a compressed voice signal by monitoring a pitch-related sector within the corresponding voice data packet. For example, within the G.723.1 VoIP standard, the pitch sector is an 18-bit field that contains pitch lag information for all subframes. In this particular implementation, the speech detector 66 could use the pitch sector to generate a pitch value for each subframe. If the pitch value is within a particular predetermined range, the corresponding compressed voice signal is said to contain speech. If not, the compressed voice signal is said to not contain speech. This predetermined range can be determined by experimentation or alternatively calculated mathematically. It is noted that many current VoIP standard codecs include pitch information as part of the transmitted packet and a similar comparison of pitch values with a predetermined range can be used with these standards.
Although the blocks within
There are a number of advantages of the packet-based network according to the first embodiment of the present invention. For one, there is a decrease in required processing power within the conference bridge 28 compared to well-known designs due to the removal of the energy detection operation from the conference bridge. This removal of the energy detection operation further, as described above, could lead to reduced need for decoding, decompression and transcoding operations and thus to increased quality voice signals with significantly reduced latency.
As depicted within
Next within the signalling diagram of
Subsequently, terminal A 22 sends a talking signal 78 to the conference bridge 28, this talking signal 78 indicating that a participant within the voice conference local to terminal A 78 has begun to speak. In this case, since primary and secondary talkers are already selected and in this particular example only two talkers are to be selected at a time, no change occurs within the conference bridge 28 due to the receipt of talking signal 78. Essentially, the participant at the terminal A 22 is being muted within the voice conference.
Next as depicted in
It should be noted that the above descriptions of sample signalling diagrams within a network according to the first embodiment of the present invention, should not be used to limit the scope of the present invention. This signalling diagrams are included to illustrate two possible implementations of the present invention.
A second embodiment of the present invention, in which the transmission of voice data packets is routed directly between packet-based terminals according to instructions from a packet-based conference bridge, is now described with reference to
In operation, the energy detection and talker selection block 100 receives the voice signals corresponding to participants within a voice conference from the inputting apparatus 30, performs an energy detection operation on the received voice signals to determine which packet-based terminals within the voice conference have participants local to the terminals speaking, and selects the talker(s) within the voice conference based upon the results of the energy detection operation. Further, the block 100 within
The energy detection operation performed within the energy detection and talker selection block 100 could be implemented in a number of different manners. For instance, it could include one of the speech detection algorithms described above for speech detector 66. As described previously, the operation of energy detection/speech detection algorithms are disclosed within U.S. patent application Ser. No. 09/475,047 as incorporated by reference previously. The talker selection operation performed within the block 100 could also be implemented in numerous different manners. Essentially, all of the possible implementations previously described for the talker selection block 44 of
As described above, the selection of the talkers within block 100 determines which packet-based terminals within the voice conference receive the addressing control signals, the addressing control signals giving the talkers permission to transmit their voice data packets to the other terminals within the voice conference. As well, the addressing control signals preferably forward the packet-based network addresses corresponding to the other packet-based terminals that is needed to transmit the voice data packets directly. In alternative implementations, the talker(s) do not require the packet-based network addresses since they have them stored internally. In this case, the addressing control signals are simply permission signals to allow the talkers to transmit to the other packet-based terminals within the voice conference.
As an option to the conference bridge according to the second embodiment of the present invention depicted in
There are numerous alternative implementations for the packet-based conference bridge according to the second embodiment of the present invention. For one, similar to the first embodiment of the present invention, modifications within the conference bridge could be made similar to those described within U.S. patent application Ser. No. 09/475,047, previously incorporated by reference. As indicated within U.S. patent application Ser. No. 09/475,047, there are numerous implementations for the inputting apparatus 30 and energy detection and talker selection block 100 possible.
It should be noted that although the blocks 30,100,46,34 within
In the operation of the packet-based terminal of
It should be recognized that modifications are required within the inputting apparatus 50 within the packet-based terminal for the second embodiment of the present invention if more than one talker is allowed to be selected at a time. This is because, according to the second embodiment of the present invention, this would result in more than one set of voice data packets arriving at the inputting apparatus 50. In the case of primary and secondary talkers being selected by the block 100, it is possible that a particular terminal will receive voice data packets from two different talkers. In this situation, the packet-based terminal mix the primary and secondary voice signals to generate mixed voice signals.
Although depicted as separate components within
Although the blocks within
There are a number of advantages of the packet-based network according to the second embodiment of the present invention. With the direct transmission of voice data packets from one packet-based terminal to other packet-based terminals, there is a significantly lighter load on the conference bridge which translates into higher capacity. Further, the conferencing configuration of the second embodiment reduces the concentration effect in which conference bridges are traditionally significant sources and sinks of traffic within the network and redistributes the traffic more evenly within the packet-based network. Yet further, the direct transmission of the voice data packets can reduce the need for transcoding and also decrease the overall latency.
As depicted within
Next, within
As depicted in
A third embodiment of the present invention, in which the first and second embodiments of the present invention are combined, is now described with reference to
In this third embodiment of the present invention, the packet-based conference bridge 28 is reduced to simply a talker selection block 150 as illustrated in FIG. 11. The talker selection block 150 operates in similar fashion to talker selection block 44 in terms of selecting talkers based upon the received speech indication signals while the block 150 operates in similar fashion to block 100 in terms of sending addressing control signals based upon the selection of the talker(s). The talker selection block 150 could be implemented in numerous manners similar to the blocks 44,100 described above with reference to
As depicted within
Next within
As depicted in
The packet-based terminals for embodiments as described herein above is not specific to any one packet-based voice communications standard (such as VoIP G.711, G.729, G.723, etc), as it can be modified such that it can be used for numerous different standards. In one alternative embodiment, the packet-based terminal is a multi-mode terminal that allows for voice conferences of a number of different standards to utilize the single packet-based terminal.
It should be noted that, although the network described above for embodiments of the present invention was specific to networks used for voice conferencing, this should not limit the scope of the present invention. For instance, the network of packet-based terminals could be used for point-to-point communications as well as voice conferencing. In the case of a point-to-point voice communication, both terminals would select the other participant as a lone talker. This allows a point-to-point conversation to be expanded to a larger voice conference with no major configuration modifications.
In general, although the operation of the present invention was described herein above with use of the terms voice data packets and voice signals, these packets and signals can be referred to broadly as media data packets and media signals respectively. In this case, media data packets are any data packets that are transmitted via the media plane, these media data packets preferably being either audio or audio/video data packets. It is noted that use of the term voice data packets above is specific to the described embodiments in which the audio signals are voice. Further, it should be understood that video data packets may incorporate audio data packets.
Although the present invention herein above described has a single voice conference being established with the use of a network of packet-based apparatus and a conference bridge, it should be understood that in some embodiments the conference bridge it could be possible and/or one or more of the packet-based apparatus could be capable of handling a plurality of voice conferences simultaneously.
Persons skilled in the art will appreciate that there are yet more alternative implementations and modifications possible for implementing the present invention, and that the above implementation is only an illustration of this embodiment of the invention. The scope of the invention, therefore, is only to be limited by the claims appended hereto.
Simard, Frederic F., Edholm, Philip K., Cuddy, David R.
Patent | Priority | Assignee | Title |
7145884, | Apr 17 2002 | Texas Instruments Incorporated | Speaker tracking on a single core in a packet based conferencing system |
7266127, | Feb 08 2002 | Lucent Technologies Inc. | Method and system to compensate for the effects of packet delays on speech quality in a Voice-over IP system |
7292543, | Apr 17 2002 | Texas Instruments Incorporated | Speaker tracking on a multi-core in a packet based conferencing system |
7428223, | Sep 26 2001 | UNIFY PATENTE GMBH & CO KG | Method for background noise reduction and performance improvement in voice conferencing over packetized networks |
7460656, | Dec 18 2003 | Intel Corporation | Distributed processing in conference call systems |
7483400, | Apr 07 2002 | THINKLOGIX, LLC | Managing a packet switched conference call |
7486629, | Aug 09 2002 | UNIFY GMBH & CO KG | System for controlling conference circuit in packet-oriented communication network |
7599834, | Nov 29 2005 | DILITHIUM ASSIGNMENT FOR THE BENEFIT OF CREDITORS , LLC; Onmobile Global Limited | Method and apparatus of voice mixing for conferencing amongst diverse networks |
7602769, | Nov 01 2001 | LG-ERICSSON CO , LTD | Audio packet switching system |
7619995, | Jul 18 2003 | RPX CLEARINGHOUSE LLC | Transcoders and mixers for voice-over-IP conferencing |
7680047, | Nov 22 2005 | Cisco Technology, Inc. | Maximum transmission unit tuning mechanism for a real-time transport protocol stream |
7693190, | Nov 22 2006 | Cisco Technology, Inc. | Lip synchronization for audio/video transmissions over a network |
7694002, | Apr 07 2006 | STA GROUP LLC | System and method for dynamically upgrading / downgrading a conference session |
7778206, | Jan 06 2005 | Cisco Technology, Inc. | Method and system for providing a conference service using speaker selection |
7787605, | Dec 31 2001 | Polycom, Inc | Conference bridge which decodes and responds to control information embedded in audio information |
7847815, | Oct 11 2006 | Cisco Technology, Inc. | Interaction based on facial recognition of conference participants |
7864786, | Sep 02 2004 | Samsung Electronics Co., Ltd. | Repeater apparatus for supporting a plurality of protocols, and a method for controlling protocol conversion in the repeater apparatus |
7870590, | Oct 20 2004 | Cisco Technology, Inc. | System and method for fast start-up of live multicast streams transmitted over a packet network |
7983200, | Dec 29 2000 | RPX CLEARINGHOUSE LLC | Apparatus and method for packet-based media communications |
8077636, | Jul 18 2003 | RPX CLEARINGHOUSE LLC | Transcoders and mixers for voice-over-IP conferencing |
8120637, | Sep 20 2006 | Cisco Technology, Inc.; Cisco Technology, Inc | Virtual theater system for the home |
8121277, | Dec 12 2006 | Cisco Technology, Inc.; Cisco Technology, Inc | Catch-up playback in a conferencing system |
8144854, | May 10 2001 | Polycom, Inc | Conference bridge which detects control information embedded in audio information to prioritize operations |
8149261, | Jan 10 2007 | Cisco Technology, Inc.; Cisco Technology, Inc | Integration of audio conference bridge with video multipoint control unit |
8169937, | Jul 04 2002 | THINKLOGIX, LLC | Managing a packet switched conference call |
8208003, | Mar 23 2007 | Cisco Technology, Inc. | Minimizing fast video update requests in a video conferencing system |
8218654, | Mar 08 2006 | SYNAMEDIA LIMITED | Method for reducing channel change startup delays for multicast digital video streams |
8255574, | May 20 2009 | Empire Technology Development LLC | System for locating computing devices |
8289362, | Sep 26 2007 | Cisco Technology, Inc | Audio directionality control for a multi-display switched video conferencing system |
8326927, | May 23 2006 | Cisco Technology, Inc. | Method and apparatus for inviting non-rich media endpoints to join a conference sidebar session |
8358763, | Aug 21 2006 | Cisco Technology, Inc. | Camping on a conference or telephony port |
8462847, | Feb 27 2006 | SYNAMEDIA LIMITED | Method and apparatus for immediate display of multicast IPTV over a bandwidth constrained network |
8495688, | Oct 20 2004 | Cisco Technology, Inc. | System and method for fast start-up of live multicast streams transmitted over a packet network |
8526336, | Aug 09 2006 | Cisco Technology, Inc. | Conference resource allocation and dynamic reallocation |
8588077, | Sep 11 2006 | Cisco Technology, Inc. | Retransmission-based stream repair and stream join |
8711854, | Apr 16 2007 | Cisco Technology, Inc. | Monitoring and correcting upstream packet loss |
8769591, | Feb 12 2007 | Cisco Technology, Inc.; Cisco Technology, Inc | Fast channel change on a bandwidth constrained network |
8787153, | Feb 10 2008 | Cisco Technology, Inc. | Forward error correction based data recovery with path diversity |
8989058, | Sep 28 2011 | Marvell World Trade Ltd | Conference mixing using turbo-VAD |
9015555, | Nov 18 2011 | Cisco Technology, Inc.; Cisco Technology, Inc | System and method for multicast error recovery using sampled feedback |
9083585, | Sep 11 2006 | Cisco Technology, Inc. | Retransmission-based stream repair and stream join |
9246962, | Sep 28 2011 | MARVELL INTERNATIONAL LTD; CAVIUM INTERNATIONAL; MARVELL ASIA PTE, LTD | Conference mixing using turbo-VAD |
9337898, | Apr 14 2009 | CLEAR-COM LLC | Digital intercom network over DC-powered microphone cable |
9467569, | Mar 05 2015 | Raytheon Company | Methods and apparatus for reducing audio conference noise using voice quality measures |
9639906, | Mar 12 2013 | HM ELECTRONICS, INC | System and method for wideband audio communication with a quick service restaurant drive-through intercom |
Patent | Priority | Assignee | Title |
3912874, | |||
3937898, | Jul 18 1974 | ITT Corporation | Digital conference bridge |
4507781, | Mar 14 1980 | IBM Corporation | Time domain multiple access broadcasting, multipoint, and conferencing communication apparatus and method |
4685425, | Feb 14 1985 | AOS Holding Company | Submersible chamber water heater |
4920565, | Jul 18 1988 | Nortel Networks Limited | Method for connection of secure conference calls |
5020098, | Nov 03 1989 | AT&T Bell Laboratories | Telephone conferencing arrangement |
5317567, | Sep 12 1991 | UNITED STATES OF AMERICA, THE, AS REPRESENTED BY THE SECRETARY OF THE AIR FORCE | Multi-speaker conferencing over narrowband channels |
5818836, | Aug 09 1995 | CLICK-TO-CALL TECHNOLOGIES LP | Method and apparatus for anonymous voice communication using an online data service |
5991385, | Jul 16 1997 | International Business Machines Corporation | Enhanced audio teleconferencing with sound field effect |
6081513, | Feb 10 1997 | AT&T Corp.; AT&T Corp | Providing multimedia conferencing services over a wide area network interconnecting nonguaranteed quality of services LANs |
6141597, | Sep 08 1997 | Polycom, Inc | Audio processor |
6157635, | Feb 13 1998 | Hewlett Packard Enterprise Development LP | Integrated remote data access and audio/visual conference gateway |
6463414, | Apr 12 1999 | WIAV Solutions LLC | Conference bridge processing of speech in a packet network environment |
6466550, | Nov 11 1998 | Cisco Technology, Inc. | Distributed conferencing system utilizing data networks |
6522633, | Dec 22 1998 | RPX CLEARINGHOUSE LLC | Conferencing arrangement for use with wireless terminals |
6577622, | Sep 27 1999 | Hewlett Packard Enterprise Development LP | System and method for using a portable information device to establish a conference call on a telephony network |
6606305, | Nov 25 1998 | WSOU Investments, LLC | Apparatus, method and system for automatic telecommunication conferencing and broadcasting |
6654455, | Jun 09 1999 | Oki Electric Industry Co., Ltd. | IP conference telephone system compatible with IP-PBX systems |
6810116, | Sep 12 2000 | International Business Machines Corporation | Multi-channel telephone data collection, collaboration and conferencing system and method of using the same |
Executed on | Assignor | Assignee | Conveyance | Frame | Reel | Doc |
Mar 22 1994 | CUDDY, DAVID R | Northern Telecom Limited | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 026312 | /0493 | |
Apr 29 1999 | Northern Telecom Limited | Nortel Networks Corporation | CHANGE OF NAME SEE DOCUMENT FOR DETAILS | 026389 | /0174 | |
May 01 2000 | Nortel Networks Corporation | Nortel Networks Limited | CHANGE OF NAME SEE DOCUMENT FOR DETAILS | 026389 | /0228 | |
Dec 29 2000 | Nortel Networks Limited | (assignment on the face of the patent) | / | |||
Jan 17 2001 | SIMARD, FREDERIC F | Nortel Networks Limited | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 011831 | /0349 | |
Apr 04 2001 | EDHOLM, PHILIP K | Nortel Networks Limited | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 011831 | /0349 | |
Jul 29 2011 | Nortel Networks Limited | Rockstar Bidco, LP | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 027164 | /0356 | |
May 09 2012 | Rockstar Bidco, LP | Rockstar Consortium US LP | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 032167 | /0270 | |
Nov 13 2013 | Rockstar Consortium US LP | Bockstar Technologies LLC | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 032399 | /0116 | |
Jan 28 2015 | ROCKSTAR CONSORTIUM LLC | RPX CLEARINGHOUSE LLC | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 034924 | /0779 | |
Jan 28 2015 | Constellation Technologies LLC | RPX CLEARINGHOUSE LLC | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 034924 | /0779 | |
Jan 28 2015 | NETSTAR TECHNOLOGIES LLC | RPX CLEARINGHOUSE LLC | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 034924 | /0779 | |
Jan 28 2015 | MOBILESTAR TECHNOLOGIES LLC | RPX CLEARINGHOUSE LLC | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 034924 | /0779 | |
Jan 28 2015 | Rockstar Consortium US LP | RPX CLEARINGHOUSE LLC | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 034924 | /0779 | |
Jan 28 2015 | Bockstar Technologies LLC | RPX CLEARINGHOUSE LLC | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 034924 | /0779 | |
Feb 26 2016 | RPX CLEARINGHOUSE LLC | JPMORGAN CHASE BANK, N A , AS COLLATERAL AGENT | SECURITY AGREEMENT | 038041 | /0001 | |
Feb 26 2016 | RPX Corporation | JPMORGAN CHASE BANK, N A , AS COLLATERAL AGENT | SECURITY AGREEMENT | 038041 | /0001 | |
Dec 22 2017 | JPMORGAN CHASE BANK, N A | RPX Corporation | RELEASE REEL 038041 FRAME 0001 | 044970 | /0030 | |
Dec 22 2017 | JPMORGAN CHASE BANK, N A | RPX CLEARINGHOUSE LLC | RELEASE REEL 038041 FRAME 0001 | 044970 | /0030 |
Date | Maintenance Fee Events |
Aug 18 2005 | ASPN: Payor Number Assigned. |
Mar 20 2009 | M1551: Payment of Maintenance Fee, 4th Year, Large Entity. |
Mar 18 2013 | M1552: Payment of Maintenance Fee, 8th Year, Large Entity. |
May 26 2017 | REM: Maintenance Fee Reminder Mailed. |
Nov 13 2017 | EXP: Patent Expired for Failure to Pay Maintenance Fees. |
Date | Maintenance Schedule |
Oct 18 2008 | 4 years fee payment window open |
Apr 18 2009 | 6 months grace period start (w surcharge) |
Oct 18 2009 | patent expiry (for year 4) |
Oct 18 2011 | 2 years to revive unintentionally abandoned end. (for year 4) |
Oct 18 2012 | 8 years fee payment window open |
Apr 18 2013 | 6 months grace period start (w surcharge) |
Oct 18 2013 | patent expiry (for year 8) |
Oct 18 2015 | 2 years to revive unintentionally abandoned end. (for year 8) |
Oct 18 2016 | 12 years fee payment window open |
Apr 18 2017 | 6 months grace period start (w surcharge) |
Oct 18 2017 | patent expiry (for year 12) |
Oct 18 2019 | 2 years to revive unintentionally abandoned end. (for year 12) |