Established multimodal conversations are enabled to be parked within an enhanced communication system such that a subscriber of the system can be notified through a variety of means and enabled to retrieve selected or all modalities for continuing the conversation. Different modalities may be parked together or separately. While waiting for the subscriber to retrieve the conversation, a participant may receive audio, video, presentation, or other forms of content as playback.
|
10. A communication system for implementing multimodal conversation park and retrieval, the system comprising:
a communication server configured to facilitate multimodal communications between endpoints of the system;
a park management server configured to:
receive a request for parking an established multimodal conversation from an endpoint of the system;
park the conversation in one of: a centralized manner at the park management server and a distributed manner at a plurality of servers, wherein distinct modalities of the conversation are associated together through a conversation identifier;
provide a location identifier to the requesting endpoint such that another endpoint can be notified about the parked conversation; and
enable the endpoint to retrieve at least one of the distinct modalities of the parked conversation;
a participant of the conversation configured to:
employ at least two endpoints to participate in the conversation and enable the endpoints to negotiate park operations among themselves.
1. A method to be executed at least in part in a computing device for facilitating multimodal conversation park and retrieval, the method comprising:
receiving a request for parking a multimodal conversation at a communication server;
parking the conversation, wherein distinct modalities of the conversation are identified with a conversation identifier;
notifying at least one subscriber about the parked conversation;
enabling the at least one subscriber to retrieve at least one of the distinct modalities of the parked conversation;
upon receiving an indication of selected modalities of the parked conversation from the at least one subscriber, enabling the subscriber to continue the conversation in the selected modalities; and
providing a subscriber requesting to park the conversation with a parking location identifier of the conversation such that the notified at least one subscriber is enabled to retrieve the at least one of the distinct modalities of the parked conversation from a location identified by the location identifier.
14. A tangible computer-readable memory device with instructions stored thereon for managing multimodal conversations with park and retrieval capability, the instructions comprising:
facilitating a multimodal conversation among subscribers of a unified communication system, wherein each subscriber employs at least one endpoint to participate in the conversation;
receiving a request for parking the multimodal conversation from a participating endpoint;
parking the conversation, wherein distinct modalities of the conversation are associated together through a conversation identifier;
providing a location identifier to the requesting endpoint such that another participating endpoint can be notified about the parked conversation;
providing multimodal content to participating endpoints of the conversation while the conversation is parked;
enabling a notified endpoint to retrieve at least one of the distinct modalities of the parked conversation; and
providing an endpoint requesting to park the conversation with a parking location identifier of the conversation such that the notified other endpoint is enabled to retrieve the at least one of the distinct modalities of the parked conversation from a location identified by the location identifier.
2. The method of
providing participants of the conversation multimodal content while the conversation is parked.
3. The method of
4. The method of
5. The method of
6. The method of
7. The method of
8. The method of
9. The method of
11. The system of
12. The system of
13. The system of
15. The tangible computer-readable memory device medium of
16. The tangible computer-readable memory device of
17. The tangible computer-readable memory device of
18. The tangible computer-readable memory device of
|
Call parking and retrieval are an integral part of conventional communication technologies such as in PBXs. A parked call is typically a time-extended call transfer. Call parking is restricted to just audio calls and a single modality in the phone communication systems.
Modern communication systems have a large number of capabilities including integration of various communication modalities with different services. For example, instant messaging, voice/video communications, data/application sharing, white-boarding, and other forms of communication may be combined with presence and availability information of subscribers. Such systems may provide subscribers with the enhanced capabilities such as providing instructions to callers for various status categories, alternate contacts, calendar information, and comparable features.
With the advent of modern communication systems such as unified communications and the prevalent use of desktop and soft-phone based telephony, the above mentioned modalities and others are commonly utilized in two-party or multi-party communications. While these modalities provide an enriched experience to the users, they also provide different challenges and opportunities for handling communications at the system level.
This summary is provided to introduce a selection of concepts in a simplified form that are further described below in the Detailed Description. This summary is not intended to exclusively identify key features or essential features of the claimed subject matter, nor is it intended as an aid in determining the scope of the claimed subject matter.
Embodiments are directed to enabling a subscriber of an enhanced communication system to park an established multimodal conversation within the enhanced communication system and notify another subscriber through a variety of means. The other subscriber may retrieve selected or all modalities for continuing the conversation. Different modalities may be parked together or separately on servers and/or endpoints. While waiting for the other subscriber to retrieve the conversation, a participant may receive audio, video, presentation, or other forms of content as playback.
These and other features and advantages will be apparent from a reading of the following detailed description and a review of the associated drawings. It is to be understood that both the foregoing general description and the following detailed description are explanatory and do not restrict aspects as claimed.
As briefly described above, individual or all modalities of multimodal conversations may be parked and retrieved in an enhanced communication system while the parked participant is played back various content including, but not limited to, audio, video, presentations (e.g. slide presentation), file displays, and comparable ones. In the following detailed description, references are made to the accompanying drawings that form a part hereof, and in which are shown by way of illustrations specific embodiments or examples. These aspects may be combined, other aspects may be utilized, and structural changes may be made without departing from the spirit or scope of the present disclosure. The following detailed description is therefore not to be taken in a limiting sense, and the scope of the present invention is defined by the appended claims and their equivalents.
While the embodiments will be described in the general context of program modules that execute in conjunction with an application program that runs on an operating system on a personal computer, those skilled in the art will recognize that aspects may also be implemented in combination with other program modules.
Generally, program modules include routines, programs, components, data structures, and other types of structures that perform particular tasks or implement particular abstract data types. Moreover, those skilled in the art will appreciate that embodiments may be practiced with other computer system configurations, including hand-held devices, multiprocessor systems, microprocessor-based or programmable consumer electronics, minicomputers, mainframe computers, and comparable computing devices. Embodiments may also be practiced in distributed computing environments where tasks are performed by remote processing devices that are linked through a communications network. In a distributed computing environment, program modules may be located in both local and remote memory storage devices.
Embodiments may be implemented as a computer-implemented process (method), a computing system, or as an article of manufacture, such as a computer program product or computer readable media. The computer program product may be a computer storage medium readable by a computer system and encoding a computer program that comprises instructions for causing a computer or computing system to perform example process(es). The computer-readable storage medium can for example be implemented via one or more of a volatile computer memory, a non-volatile memory, a hard drive, a flash drive, a floppy disk, or a compact disk, and comparable media.
Throughout this specification, the term “server” generally refers to a computing device executing one or more software programs typically in a networked environment. However, a server may also be implemented as a virtual server (software programs) executed on one or more computing devices viewed as a server on the network. More detail on these technologies and example operations is provided below.
Referring to
In a unified communication (“UC”) system such as the one shown in diagram 100, users may communicate via a variety of end devices (102, 104), which are client devices of the UC system. Each client device may be capable of executing one or more communication applications for voice communication, video communication, instant messaging, application sharing, data sharing, and the like. In addition to their advanced functionality, the end devices may also facilitate traditional phone calls through an external connection such as through PBX 124 to a Public Switched Telephone Network (“PSTN”). End devices may include any type of smart phone, cellular phone, any computing device executing a communication application, a smart automobile console, and advanced phone devices with additional functionality. Moreover, a subscriber of the UC system may use more than one end device and/or communication application for facilitating various modes of communication with other subscribers. End devices may also include various peripherals coupled to the end devices through wired or wireless means (e.g. USB connection, Bluetooth® connection, etc.) to facilitate different aspects of the communication.
UC Network(s) 110 includes a number of servers performing different tasks. For example, UC servers 114 provide registration, presence, and routing functionalities. Routing functionality enables the system to route calls to a user to anyone of the client devices assigned to the user based on default and/or user set policies. For example, if the user is not available through a regular phone, the call may be forwarded to the user's cellular phone, and if that is not answering a number of voicemail options may be utilized. Since the end devices can handle additional communication modes, UC servers 114 may provide access to these additional communication modes (e.g. instant messaging, video communication, etc.) through access server 112. Access server 112 resides in a perimeter network and enables connectivity through UC network(s) 110 with other users in one of the additional communication modes. UC servers 114 may include servers that perform combinations of the above described functionalities or specialized servers that only provide a particular functionality. For example, home servers providing presence functionality, routing servers providing routing functionality, rights management servers, and so on. Similarly, access server 112 may provide multiple functionalities such as firewall protection and connectivity, or only specific functionalities.
Audio/Video (A/V) conferencing server 118 provides audio and/or video conferencing capabilities by facilitating those over an internal or external network. Mediation server 116 mediates signaling and media to and from other types of networks such as a PSTN or a cellular network (e.g. calls through PBX 124 or from cellular phone 122). Mediation server 116 may also act as a Session Initiation Protocol (SIP) user agent.
In a UC system, users may have one or more identities, which is not necessarily limited to a phone number. The identity may take any form depending on the integrated networks, such as a telephone number, a Session Initiation Protocol (SIP) Uniform Resource Identifier (URI), or any other identifier. While any protocol may be used in a UC system, SIP is a commonly used method.
SIP is an application-layer control (signaling) protocol for creating, modifying, and terminating sessions with one or more participants. It can be used to create two-party, multiparty, or multicast sessions that include Internet telephone calls, multimedia distribution, and multimedia conferences. SIP is designed to be independent of the underlying transport layer.
SIP clients may use Transport Control Protocol (“TCP”) to connect to SIP servers and other SIP endpoints. SIP is primarily used in setting up and tearing down voice or video calls. However, it can be used in any application where session initiation is a requirement. These include event subscription and notification, terminal mobility, and so on. Voice and/or video communications are typically done over separate session protocols, typically Real-time Transport Protocol (“RTP”).
A conversation as used herein refers to a multimodal communication session, where subscribers may communicate over a plurality of devices, applications, and communication modes simultaneously or sequentially. For example, two subscribers may initiate a conversation by exchanging instant messages through their desktop computers. Later, the communication may be elevated to audio and instant message with one subscriber utilizing their desktop for both modes, while the other uses the desktop computer for instant messaging and a smart phone device for the audio mode. Other subscribers may join or leave the conversation other modes and devices may be added or removed. The commonality between these communications is preserved by designating all these communications as belonging to the same conversation. Conversations may be assigned a unique identifier, which enables subscribers to view, record, modify, share, and generally manage aspects of the conversation including documents and other data associated with the conversation (e.g. documents exchanged as attachments in one mode of the conversation or recordings of other modes of the conversation).
While the example system in
In an enhanced communication system such as a unified communication system, subscribers (e.g. 236, 244) may facilitate multimodal communications 240 employing one or more end devices (e.g. 238, 242) and associated peripherals. Multimodal communication 240 may include audio, video, file sharing, desktop sharing, instant messaging, electronic mail, whiteboard sharing, and similar forms of communication. The conversation may be established and managed by one or more servers in a distributed fashion (e.g. server 234).
In this new world of unified communications, different modalities of the same conversation may be parked together as a single multimodal parked conversation and retrieved together or separately. For example, a customer may call in to a sales department of a company using audio only. The responding sales person may elevate the conversation to audio and desktop sharing. At some point during the conversation, the sale person may realize he/she needs to bring in (or transfer to) a technical expert. The sales person may park the conversation and notify a technical expert about the parked conversation. The technical expert may then retrieve the conversation using both modalities or just one and continue serving the customer.
There are several aspects of parking and retrieving multimodal conversations as illustrated in the above described example. The modalities (audio and instant messaging) may be parked together at a dedicated server (park server), at distinct dedicated servers (one park server for each modality), at multipurpose server(s) (e.g. a routing server), or even at individual endpoints of the system. The sales person may notify the technical expert through various means such as an electronic mail, an instant message, a SIP notification, a notification application of the communication system, or even voice based notification (a voice mail or audio call for example). The notification may include elements such as links to individual parked modalities such that the technical expert can select and retrieve individual modalities or a link to the entire conversation. Moreover, the notification may be directed to identified person(s) or to a group (e.g. a group instant message to the entire technical assistance group such that any available technical expert can retrieve the parked conversation).
While the conversation is parked, content in various modalities may be played back to the customer. For example, audio, video, or other forms of presentations may be provided (e.g. a slide presentation if the conversation includes video or application sharing modalities). While waiting for the technical expert, the customer may be educated on different products, on aspects of products, provided forms and other information on the offered services, and so on.
Participants in a multimodal conversation such as the one shown in diagram 200 may be part of the same network (e.g. an enterprise network), connected through different networks (e.g. in a federated environment), or communicate via a combination of secure and unsecure networks such as the Internet. Appropriate security measures such as personal identification numbers, passwords, and comparable ones may be employed to ensure privacy and security of the conversation.
Information about parked conversation(s) may be sent to or shared with email distribution lists or persistent chat sessions as well. The information may include links in form of SIP URI or URLs. While the conversation is parked, content may be played back to user 2 (354) in various modalities. A media server may be employed to provide such content. The content may include audio playback, video playback, presentation displays, data displays, and comparable ones.
An end recipient of such a parked conversation may not only be within an enterprise, but outside the enterprise such as in a federated environment, or even behind a SIP trunk. The end recipient may be able to authenticate himself/herself to retrieve the parked conversation using, for example, a shared corporate identifier that authenticates the user against a directory service.
Following the invites from the endpoints of user 2, new sessions (audio and instant message) are established between the park server 466 and the endpoints of user 2 (464, 465) preserving the conversation identifier. In this mode, user 2 may be provided playback content as discussed previously. In the meantime, park server 466 provides location identifier for the parked conversation (e.g. as a SIP URI or Uniform Resource Locator ‘URL’) to user 1 and user 2. User 1 sends a notification message to user 3 (468) with the received SIP URI for the parked conversation.
User 3 (468) selects a modality (audio call in this example) by activating a link for the audio modality in the notification message. Subsequently, an invite is sent to the endpoint of user 2 (464) associated with the selected modality and the conversation continues in the selected modality between user 2 and user 3.
The above discussed scenarios, example systems, conversation modalities, and configurations are for illustration purposes. Embodiments are not restricted to those examples. Other forms of notifications, configurations, communication modes, and scenarios may be used in implementing multimodal conversations with parking and retrieval capability in a similar manner using the principles described herein.
User interface 500 is an example parked conversation invite. It includes graphic representations of current modalities in the parked conversation (572) and graphic/textual options to select acceptance of rejection by the invited user (574). The acceptance may also be accomplished by selecting one or more of the graphic representations of the available communication modes.
UI element 576 displays such selected communication modes for individual selection. Further information may be displayed by the user interface such as who parked the call (578) and conversation participant information 582 (name, address, any other pertinent information).
A user interface for notifying a subscriber about a parked call may include additional or fewer textual and graphical elements, and may employ various graphical, color, and other configuration schemes to display different functionalities. Other notification methods such as those described above may also be employed with additional or fewer elements as discussed herein.
As discussed above, modern communication technologies such as UC services enable subscribers to utilize a wide range of computing device and application capabilities in conjunction with communication services. This means, a subscriber may use one or more devices (e.g. a regular phone, a smart phone, a computer, a smart automobile console, etc.) to facilitate communications. Depending on the capabilities of each device and applications available on each device, additional services and communication modes may be enabled.
Client devices 611-613 are used to facilitate communications through a variety of modes between subscribers of the communication system. One or more of the servers 618 may be used to park (and subsequently retrieve) all or some of the modalities of an established conversation. Information associated with subscribers and facilitating multimodal conversations, as well as multimodal content for playback, may be stored in one or more data stores (e.g. data store 616), which may be managed by any one of the servers 618 or by database server 614.
Network(s) 610 may comprise any topology of servers, clients, Internet service providers, and communication media. A system according to embodiments may have a static or dynamic topology. Network(s) 610 may include a secure network such as an enterprise network, an unsecure network such as a wireless open network, or the Internet. Network(s) 610 may also coordinate communication over other networks such as PSTN or cellular networks. Network(s) 610 provides communication between the nodes described herein. By way of example, and not limitation, network(s) 610 may include wireless media such as acoustic, RF, infrared and other wireless media.
Many other configurations of computing devices, applications, data sources, and data distribution systems may be employed to implement a communication system with multimodal conversation parking and retrieval. Furthermore, the networked environments discussed in
Communication application 722 may be part of a service that facilitates communication through various modalities between client applications, servers, and other devices. Park management module 724 may enable client applications to park some or all of the modalities of established conversations, notify other client applications about the parked conversation, and enable other subscribers to retrieve one or more modalities of the parked conversation. As discussed previously, park management module may coordinate the notification with other applications such as an electronic mail application, an instant message application, and similar ones. According to some embodiments, park management module 724 may also facilitate play back of content to participant(s) of the parked conversation in various modalities while the conversation is parked. This basic configuration is illustrated in
Computing device 700 may have additional features or functionality. For example, the computing device 700 may also include additional data storage devices (removable and/or non-removable) such as, for example, magnetic disks, optical disks, or tape. Such additional storage is illustrated in
Computing device 700 may also contain communication connections 716 that allow the device to communicate with other devices 718, such as over a wireless network in a distributed computing environment, a satellite link, a cellular link, and comparable mechanisms. Other devices 718 may include computer device(s) that execute communication applications, other directory or policy servers, and comparable devices. Communication connection(s) 716 is one example of communication media. Communication media can include therein computer readable instructions, data structures, program modules, or other data in a modulated data signal, such as a carrier wave or other transport mechanism, and includes any information delivery media. The term “modulated data signal” means a signal that has one or more of its characteristics set or changed in such a manner as to encode information in the signal. By way of example, and not limitation, communication media includes wired media such as a wired network or direct-wired connection, and wireless media such as acoustic, RF, infrared and other wireless media.
Example embodiments also include methods. These methods can be implemented in any number of ways, including the structures described in this document. One such way is by machine operations, of devices of the type described in this document.
Another optional way is for one or more of the individual operations of the methods to be performed in conjunction with one or more human operators performing some. These human operators need not be collocated with each other, but each can be only with a machine that performs a portion of the program.
Process 800 begins with operation 810, where a multimodal conversation is facilitated. As discussed previously, the multimodal conversation may include a number of modalities such as voice, video, electronic mail, instant messaging, application sharing, data sharing, whiteboard sharing, and so on. The conversation may include two or more participants and be initiated by any one of the participants.
At operation 820, one of the participants parks the conversation such that another party can be enabled to join the conversation. Different modalities of the conversation may be parked together or individually (or in groups) at dedicated servers, multi-purpose servers, or even endpoints of the enhanced communication system. The modalities may be identified as belonging together by the conversation identifier (which may be a numeric value, an alphanumeric value, or other symbol).
While the conversation is parked, different modalities of content may be played back to the parked participants at optional operation 830. Such content may include audio, video, presentations, or other forms of displayed data. According to other embodiments, one or more modalities of the conversation may continue, while remaining modalities are parked. For example, in a conversation containing audio, data sharing, and instant messaging, only the audio and data sharing modalities may be parked and the instant messaging modality continue to be facilitated while the parking participant notifies another subscriber of the system to join the conversation.
At operation 840, the parking participant notifies one or more subscribers of the enhanced communication system to join (or take over) the conversation. The notification may be in the form of an electronic mail, an instant message, a SIP notification, or other forms. The notification(s) may include links to the parked modalities of the conversation. The links may enable the other subscriber(s) to join (or take over) the conversation by activating all parked modalities or by activating only selected modalities of the conversation (e.g. based on the capabilities of the notified subscriber, a preference of the notified subscriber, etc.) at operation 850. Once the other subscriber joins the conversation by providing an indication of selected modalities, it may continue to be facilitated employing the selected modalities.
The operations included in process 800 are for illustration purposes. A communication service with multimodal conversation parking and retrieval capability may be implemented by similar processes with fewer or additional steps, as well as in different order of operations using the principles described herein.
The above specification, examples and data provide a complete description of the manufacture and use of the composition of the embodiments. Although the subject matter has been described in language specific to structural features and/or methodological acts, it is to be understood that the subject matter defined in the appended claims is not necessarily limited to the specific features or acts described above. Rather, the specific features and acts described above are disclosed as example forms of implementing the claims and embodiments.
Ramanathan, Rajesh, Stucker, Brian
Patent | Priority | Assignee | Title |
Patent | Priority | Assignee | Title |
6275701, | Feb 25 1997 | Unwired Planet, LLC | Method and device in a mobile telecommunications system |
6650748, | Apr 13 1998 | AVAYA Inc | Multiple call handling in a call center |
8121282, | Apr 19 2007 | Cisco Technology, Inc. | Call park/retrieve using SIP |
20040006475, | |||
20060217133, | |||
20070033249, | |||
20070153770, | |||
20070165554, | |||
20070297581, | |||
20080096553, | |||
20080130848, | |||
20080282261, |
Executed on | Assignor | Assignee | Conveyance | Frame | Reel | Doc |
May 13 2009 | Microsoft Corporation | (assignment on the face of the patent) | / |
Date | Maintenance Fee Events |
Date | Maintenance Schedule |
Oct 30 2015 | 4 years fee payment window open |
Apr 30 2016 | 6 months grace period start (w surcharge) |
Oct 30 2016 | patent expiry (for year 4) |
Oct 30 2018 | 2 years to revive unintentionally abandoned end. (for year 4) |
Oct 30 2019 | 8 years fee payment window open |
Apr 30 2020 | 6 months grace period start (w surcharge) |
Oct 30 2020 | patent expiry (for year 8) |
Oct 30 2022 | 2 years to revive unintentionally abandoned end. (for year 8) |
Oct 30 2023 | 12 years fee payment window open |
Apr 30 2024 | 6 months grace period start (w surcharge) |
Oct 30 2024 | patent expiry (for year 12) |
Oct 30 2026 | 2 years to revive unintentionally abandoned end. (for year 12) |