One embodiment of the present invention provides a system for calculating prominence of an email with regard to a user. During operation, the system determines an importance value associated with an email participant in the user's conversations, wherein the email participant is an email sender and/or recipient other than the user. Next, the system calculates a prominence value associated with a received email based upon at least the importance values associated with the email participants in the received email.
|
1. A computer-implemented method for calculating prominence of an email with regard to a user, the method comprising:
computing, by a computer, a decay factor e−tγ associated with an e-mail participant ep, wherein t is the amount of time since a previous email was sent to or received by the email participant, γ is a decay constant, and the email participant is an entail sender and/or recipient other than the user;
computing a conversation weight cw(ep) associated with the email participant;
computing an importance value I(ep) associated with the email participant by determining a decayed conversation weight for the email participant according to an equation I(ep)=e−tγcw(ep); and
subsequent to computing the importance value, calculating a prominence value associated with a received email based upon the computed importance value associated with the email participant in the received email.
10. A non-transitory computer-readable storage medium storing instructions that when executed by a computer cause the computer to perform a method for calculating prominence of an email with regard to a user, the method comprising:
computing a decay factor e−tγ associated with an e-mail participant ep, wherein t is the amount of time since a previous email was sent to or received by the email participant, γ is a decay constant, and the email participant is an email sender and/or recipient other than the user;
computing a conversation weight cw(ep) associated with the email participant;
computing an importance value I(ep) associated with the email participant by determining a decayed conversation weight for the email participant according to an equation I(ep)=e−tγcw(ep); and
subsequent to computing the importance value, calculating a prominence value associated with a received email based upon the computed importance value associated with the email participant in the received email.
18. A system for calculating prominence of an email with regard to a user, the system comprising:
a decay factor computing mechanism configured to compute a decay factor e−tγ associated with an e-mail participant ep, wherein t is the amount of time since a previous email was sent to or received by the email participant, γ is a decay constant, and the email participant is an email sender and/or recipient other than the user;
a weight-computing mechanism configured to compute a conversation weight cw(ep) associated with the email participant;
an importance-value computing mechanism configured to compute an importance value I(ep) associated with the email participant by determining a decayed conversation weight for the email participant according to an equation I(ep)=e−tγcw(ep); and
a calculation mechanism configured to, subsequent to computing the importance value, calculate a prominence value associated with a received email based upon the computed importance value associated with the email participant in the received email.
2. The method of
cw(ep)=max[init(ep),F(ep)]; wherein:
init(ep) is assigned based on the email participant's domain, rw(c) is a recipient weight of a conversation c, cw(c) is a contribution weight of the conversation c, tw(c) is a temporal weight of the conversation c, and conv(ep) is a conversation in which both the email participant ep and the user are the recipient of at least one message and both the email participant ep and the user send at least one message.
3. The method of
4. The method of
5. The method of
a recipient weight indicating a number of recipients in the conversation other than the user and the email participant;
a contribution weight indicating the number of emails sent by the email participant in the conversation; and
a temporal weight indicating an average duration between messages in the conversation.
6. The method of
an importance value associated with the sender;
an average of the importance values associated with the recipients other than the user;
a timestamp of the email;
a domain name of the email sender; and
additional email features including direct address, request, attachment, and scheduling information contained in the received email.
7. The method of
allowing the user to inspect and modify the prominence value associated with a received email.
8. The method of
applying machine-learning techniques by examples to compute the importance value associated with the email participant and calculate the prominence value associated with the received email.
9. The method of
11. The computer-readable storage medium of
12. The computer-readable storage medium of
13. The computer-readable storage medium of
a recipient weight indicating a number of recipients in the conversation other than the user and the email participant;
a contribution weight indicating the number of emails sent by the email participant in the conversation; and
a temporal weight indicating an average duration between messages in the conversation.
14. The computer-readable storage medium of
an importance value associated with the sender;
an average of the importance values associated with the recipients other than the user;
a timestamp of the email;
a domain name of the email sender; and
additional email features including direct address, request, attachment, and scheduling information contained in the received email.
15. The computer-readable storage medium of
allowing the user to inspect and modify the prominence value associated with a received email.
16. The computer-readable storage medium of
applying machine-learning techniques by examples to compute the importance value associated with the email participant and calculate the prominence value associated with the received email.
17. The computer-readable storage medium of
19. The system of
|
1. Field
This disclosure is generally related to email processing. More specifically, this disclosure is related to calculating prominence values of emails and email participants.
2. Related Art
Email has become an indispensable part of today's information economy. Employees often spend a substantial part of their workday plodding through mountains of email messages whose subject matter can range from the utterly trivial to the extremely important. A fair amount of research has investigated how people perceive the importance of email and email senders/receivers.
One technique to evaluate email importance is based on user surveys and feedback collected from users on their actions taken on the emails, such as response and attachment. This technique is derived from the finding that perceived email importance and reply probability are related to each other. Early results give a good indication of correlations between specific factors and perceived importance. Although a linear regression model for showing correlations can be used for the prediction, the input factors (e.g., “Action request”) are hand-labeled and their derivation is not automatic.
Some recent work has proposed approaches for email prioritization based on automatically derived social network information. For example, an email message from a sender may be assigned a high importance if the recipient frequently receives emails from the sender. However, this technique based on social network features requires a sufficient amount of emails and calculation resources to derive higher-level social network features.
One embodiment of the present invention provides a system for calculating prominence of an email with regard to a user. During operation, the system determines an importance value associated with an email participant in the user's conversations, wherein the email participant is an email sender and/or recipient other than the user. Next, the system calculates a prominence value associated with a received email based upon at least the importance values associated with the email participants in the received email.
In a variation on this embodiment, the importance value associated with the email participant and the prominence value associated with a received email are between zero and one, wherein zero indicates the lowest importance and one indicates the highest importance.
In a variation on this embodiment, the system determines the importance value associated with the email participant by determining a conversation weight for the email participant subject to a decay corresponding to an amount of time since a previous email was sent to or received by the email participant.
In a further variation, the system determines the conversation weight for the email participant by determining a number of conversations in which both the email participant and the user have participated.
In a further variation, the system determines the conversation weight for the email participant in each conversation by determining one or more of: a recipient weight indicating a number of recipients in the conversation other than the user and the email participant, a contribution weight indicating the number of emails sent by the email participant in the conversation, and a temporal weight indicating an average duration between messages in the conversation.
In a variation on this embodiment, the system calculates the prominence value associated with the received email by determining one or more of: an importance value associated with the sender, an average of the importance values associated with the recipients other than the user, a timestamp of the email, a domain name of the email sender, and additional email features including direct address, request, attachment, and scheduling information contained in the received email.
In a variation on this embodiment, the system allows the user to inspect and modify the prominence value associated with a received email.
In a variation on this embodiment, the system applies machine-learning techniques by examples to determine the importance value associated with the email participant and calculate the prominence value associated with the received email.
In a variation on this embodiment, the system allows the user to customize the calculation of the prominence value by providing feedback and/or by inspecting one or more factors used for calculating the prominence value.
In the figures, like reference numerals refer to the same figure elements.
The following description is presented to enable any person skilled in the art to make and use the embodiments, and is provided in the context of a particular application and its requirements. Various modifications to the disclosed embodiments will be readily apparent to those skilled in the art, and the general principles defined herein may be applied to other embodiments and applications without departing from the spirit and scope of the present disclosure. Thus, the present invention is not limited to the embodiments shown, but is to be accorded the widest scope consistent with the principles and features disclosed herein.
Overview
Embodiments of the present invention provide a solution for calculating prominence or importance values associated with emails and email participants with regard to a user. In one embodiment of the present invention, the email-prominence calculation system first determines an importance value associated with an email participant in the user's conversations, wherein the email participant is an email sender and/or recipient other than the user. Next, the system calculates a prominence value associated with a received email based upon at least the importance values associated with the email participants in the received email.
Conventional methods to determine email prominence have relied on user surveys and feedback on the actions taken on the emails, such as response and attachment. This technique is derived from the finding that perceived email importance and reply probability are related to each other. However, it is often hard to derive operational formulas to calculate email prominence from such approaches. Some recent work has proposed approaches for email prioritization based on automatically derived social network information, which require a large amount of emails and calculation resources to derive higher-level social network features. To overcome these issues, an email-prominence calculation method is proposed in embodiments of the present invention to provide an operational formula that incorporates both email-specific features and simple social network cues. In this disclosure, the terms “prominence” and “importance” are used inter-changeably.
The email-prominence calculation system also provides a graphic user interface that allows the user to inspect and modify the prominence values associated with email participants as well as the received emails from the participants.
Importance of Email Participants
In embodiments of the present invention, the system calculates the importance value of an email participant based on the number of conversations in which the participant has been involved and the contributions the participant has made. The email participants could be chosen from the email senders and recipients associated with a user's email account. It is assumed that the user's email has been organized into conversations, with each conversation consisting of email communications under a separate topic. To derive a higher-level communicative model and simplify the processing, a participant's conversation weight is calculated based on his/her activity in the conversations. The importance value associated with an email participant is then defined as the participant's decayed conversation weight.
The formula for calculating the importance value relies on the following definitions and parameters:
Based on the above definitions and parameters, a conversation weight cw for an email participant ep can be defined as:
cw(ep)=max[init(ep),F(ep)],
where
init(ep)=I0e−mδ
init(ep)=I1e−mδ
init(ep)=I2e−mδ
m is the number of emails received by the email participant ep. The initial conversation weight init(ep) is assigned based on the participant's domain to bootstrap a new email participant with a reasonable importance values. In the definition of the conversation weight, a maximum operation between init(ep) and F(ep) ensures that if the conversation count for an email participant increases to 1, the conversation weight equals F(ep) so that more frequent email exchanges do not adversely affect the importance of the email participant.
Finally, the importance value I(ep) associated with the email participant ep is defined as the decayed conversation weight of email participants:
I(ep)=e−tγcw(ep)
where t is the amount of time since a previous email was sent to or received by the email participant, and γ is the decay constant. Exemplary constant parameters could be set to α=0.01, β=0.01, γ=0.00223143, I0=0.5, I1=0.4, I2=0.3, and δ1=δ2=δ3=0.13862944.
Prominence of Emails
In embodiments of the present invention, the system calculates prominence values associated with a received email based on the importance values associated with the email participants and extracted email features. The importance values and extracted email features include: the importance value associated with the email sender, the average of the importance values associated with the email recipients other than the user, the timestamp of the email, the domain name of the email sender; and additional email features, such as direct address, request, attachment, and scheduling information contained in the received email. The formula for calculating the prominence value P(m) associated with email m is defined as:
P(m)=s(m)·r(m)·rc(m)·c(m)·d(m)·sd(m).
where the parameters used in the formula are:
where
Optimization and Customization
In one embodiment, the system can apply supervised machine learning to calculate the importance values associated with the email participants and the prominence values associated with the received emails. Supervised learning is the task of inferring algorithm parameters from supervised training data consisting of a set of training examples. In order to improve the prominence calculation, the system collects user feedback which indicates whether non-important emails or email participants are falsely included, and/or whether important emails or email participants are mistakenly omitted. The user feedback provides training data for the supervised machine learning, so that the supervised machine-learning algorithm may analyze the user feedback and infer a better set of parameters for calculation. The inferred classification rules can be used in calculating prominence values for future emails and email participants.
A supervised learning algorithm analyzes the training data to extract features or properties of the data, and improve the existing formula. More details on supervised machine learning are available in the documentation available from publicly available literature, such as “Introduction to Machine Learning,” by Ethem Alpaydin, 2nd Ed., The MIT Press, 2010, the disclosure of which is incorporated by reference in its entirety herein.
Customization in calculating prominence values is also feasible utilizing user's feedback. User contextual information such as user location, social context from emails, time information, and user tasks can also be applied to further customize the calculation. For example, a graphic user interface can be provided for the user to inspect the factors introduced to the formula and to specify customized weights for each factor. In contrast to the training by example method, a white box GUI-based refinement to the initial formula allows the user to quickly customize and improve prominence calculation results, which can accelerate the adoption process.
Exemplary Computer System
The data structures and code described in this detailed description are typically stored on a computer-readable storage medium, which may be any device or medium that can store code and/or data for use by a computer system. The computer-readable storage medium includes, but is not limited to, volatile memory, non-volatile memory, magnetic and optical storage devices such as disk drives, magnetic tape, CDs (compact discs), DVDs (digital versatile discs or digital video discs), or other media capable of storing computer-readable media now known or later developed.
The methods and processes described in the detailed description section can be embodied as code and/or data, which can be stored in a computer-readable storage medium as described above. When a computer system reads and executes the code and/or data stored on the computer-readable storage medium, the computer system performs the methods and processes embodied as data structures and code and stored within the computer-readable storage medium.
Furthermore, methods and processes described herein can be included in hardware modules or apparatus. These modules or apparatus may include, but are not limited to, an application-specific integrated circuit (ASIC) chip, a field-programmable gate array (FPGA), a dedicated or shared processor that executes a particular software module or a piece of code at a particular time, and/or other programmable-logic devices now known or later developed. When the hardware modules or apparatus are activated, they perform the methods and processes included within them.
The foregoing descriptions of various embodiments have been presented only for purposes of illustration and description. They are not intended to be exhaustive or to limit the present invention to the forms disclosed. Accordingly, many modifications and variations will be apparent to practitioners skilled in the art. Additionally, the above disclosure is not intended to limit the present invention.
Patent | Priority | Assignee | Title |
10755195, | Jan 13 2016 | International Business Machines Corporation | Adaptive, personalized action-aware communication and conversation prioritization |
9904669, | Jan 13 2016 | International Business Machines Corporation | Adaptive learning of actionable statements in natural language conversation |
Patent | Priority | Assignee | Title |
6832244, | Sep 21 2000 | International Business Machines Corporation | Graphical e-mail content analyser and prioritizer including hierarchical email classification system in an email |
7752279, | May 29 2007 | HUAWEI TECHNOLOGIES CO , LTD | System for facilitating thread-based message prioritization |
7844666, | Dec 12 2000 | Microsoft Technology Licensing, LLC | Controls and displays for acquiring preferences, inspecting behavior, and guiding the learning and decision policies of an adaptive communications prioritization and routing system |
8032602, | Feb 18 2009 | SNAP INC | Prioritization of recipient email messages |
8307043, | Jan 08 2010 | Malikie Innovations Limited | Method and apparatus for processing data on a computing device |
8713027, | Nov 18 2009 | Qualcomm Incorporated | Methods and systems for managing electronic messages |
20070168430, | |||
20080005249, | |||
20090254629, | |||
20100211644, | |||
20120150979, |
Executed on | Assignor | Assignee | Conveyance | Frame | Reel | Doc |
Sep 30 2011 | BRDICZKA, OLIVER | Palo Alto Research Center Incorporated | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 027028 | /0382 | |
Oct 05 2011 | Palo Alto Research Center Incorporated | (assignment on the face of the patent) | / | |||
Apr 16 2023 | Palo Alto Research Center Incorporated | Xerox Corporation | CORRECTIVE ASSIGNMENT TO CORRECT THE REMOVAL OF US PATENTS 9356603, 10026651, 10626048 AND INCLUSION OF US PATENT 7167871 PREVIOUSLY RECORDED ON REEL 064038 FRAME 0001 ASSIGNOR S HEREBY CONFIRMS THE ASSIGNMENT | 064161 | /0001 | |
Apr 16 2023 | Palo Alto Research Center Incorporated | Xerox Corporation | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 064038 | /0001 | |
Jun 21 2023 | Xerox Corporation | CITIBANK, N A , AS COLLATERAL AGENT | SECURITY INTEREST SEE DOCUMENT FOR DETAILS | 064760 | /0389 | |
Nov 17 2023 | Xerox Corporation | JEFFERIES FINANCE LLC, AS COLLATERAL AGENT | SECURITY INTEREST SEE DOCUMENT FOR DETAILS | 065628 | /0019 | |
Feb 06 2024 | Xerox Corporation | CITIBANK, N A , AS COLLATERAL AGENT | SECURITY INTEREST SEE DOCUMENT FOR DETAILS | 066741 | /0001 | |
Feb 06 2024 | CITIBANK, N A , AS COLLATERAL AGENT | Xerox Corporation | TERMINATION AND RELEASE OF SECURITY INTEREST IN PATENTS RECORDED AT RF 064760 0389 | 068261 | /0001 |
Date | Maintenance Fee Events |
Mar 25 2015 | ASPN: Payor Number Assigned. |
Sep 27 2018 | M1551: Payment of Maintenance Fee, 4th Year, Large Entity. |
Sep 29 2022 | M1552: Payment of Maintenance Fee, 8th Year, Large Entity. |
Date | Maintenance Schedule |
Apr 21 2018 | 4 years fee payment window open |
Oct 21 2018 | 6 months grace period start (w surcharge) |
Apr 21 2019 | patent expiry (for year 4) |
Apr 21 2021 | 2 years to revive unintentionally abandoned end. (for year 4) |
Apr 21 2022 | 8 years fee payment window open |
Oct 21 2022 | 6 months grace period start (w surcharge) |
Apr 21 2023 | patent expiry (for year 8) |
Apr 21 2025 | 2 years to revive unintentionally abandoned end. (for year 8) |
Apr 21 2026 | 12 years fee payment window open |
Oct 21 2026 | 6 months grace period start (w surcharge) |
Apr 21 2027 | patent expiry (for year 12) |
Apr 21 2029 | 2 years to revive unintentionally abandoned end. (for year 12) |