Techniques are described for analyzing user-supplied information, including to predict future aspects of additional information that will be supplied by users. The analyzed information may include distributed group discussions involving numerous users and occurring via user comments and other content items supplied to social networking site(s) and/or other computer-accessible sites. The techniques further include displaying information to users about analyzed and/or predicted information for one or more categories and/or topics or other attributes, to reflect corresponding constituent content items for those categories/topics/attributes for one or more time periods—the information display may further occur in a real-time or near-real-time manner with respect to corresponding events or activities to which the information corresponds (such as within seconds, minutes or hours), and may display information about total quantity of constituent content items, rate of change in total quantity, aggregate sentiment, etc.
| 
 | 33.  A non-transitory computer-readable medium having stored contents that, when executed, configure a computing system to perform a method, the method comprising:
 obtaining, by the configured computing system, information about a distributed group discussion having a plurality of content items submitted by a plurality of users related to multiple topics; determining, by the configured computing system, multiple visual aspects to use to display information to one or more users about the multiple topics, including at least one sizes of displayed items or vertical locations of displayed items; generating, by the configured computing system, information for display to the one or more users, the generated information including, for each of the multiple topics, at least one summarization that is based on a subset of the obtained information corresponding to the topic and that is represented using at least one of the multiple visual aspects; and providing, by the configured computing system, the generated information for display to the one or more users. 38.  A configured system comprising:
 one or more hardware processors of one or more computing systems; and one or more modules that are configured to, when executed by at least one of the one or more hardware processors, initiate display of summarized information about a distributed group discussion having a plurality of content items submitted by a plurality of users, by:
 obtaining information about one or more predictions regarding future content items that will be submitted for the distributed group discussion; determining multiple visual aspects to use to display information to one or more users about multiple topics indicated in the plurality of content items; generating information for display to the one or more users that is based at least in part on the one or more predictions, the generated information including, for each of the multiple topics, at least one summarization that is based on content items of the distributed group discussion corresponding to the topic and that is represented using at least one of the multiple visual aspects; and providing the generated information for display to the one or more users. 6.  A computer-implemented method comprising:
 obtaining, by one or more configured computing systems of a group discussion prediction service, information about a distributed group discussion that involves a plurality of users submitting a plurality of content items for the distributed group discussion, the obtained information including information about one or more predictions by the group discussion prediction service regarding future content items that will be submitted for the distributed group discussion; selecting, by the one or more configured computing systems, multiple factors to use in summarizing information about the distributed group discussion; determining, by the one or more configured computing systems, multiple visual aspects to use to display information about the multiple selected factors, wherein the multiple visual aspects include sizes of displayed items and vertical locations of displayed items, and wherein each of the multiple selected factors is associated with at least one of the multiple visual aspects; generating, by the one or more configured computing systems, information for display to one or more users that is based at least in part on the one or more predictions, the generated information including, for each of the multiple selected factors, a summarization that is represented using the at least one visual aspect associated with the selected factor and that is based on at least some of the obtained information corresponding to the selected factor; and providing, by the one or more configured computing systems, the generated information for display to the one or more users. 1.  A computer-implemented method comprising:
 obtaining, by one or more configured computing systems of a group discussion prediction service, information about a distributed group discussion involving a plurality of users, wherein the obtained information includes first information about a plurality of content items already submitted by the plurality of users during one or more time periods and includes second information about one or more predictions by the group discussion prediction service regarding future content items that will be submitted by users for the distributed group discussion during one or more future time periods; selecting, by the one or more configured computing systems, multiple factors to use in summarizing the obtained information for each of multiple time periods that include the one or more time periods and the one or more future time periods, wherein the multiple selected factors include a total quantity of content items supplied during each of the multiple time periods and include a rate of change in total quantity of content items supplied between two or more of the multiple time periods; determining, by the one or more configured computing systems, multiple visual aspects to use to display information about the multiple selected factors to one or more users, wherein each of the multiple selected factors is associated with one of the multiple visual aspects, and wherein the multiple visual aspects include sizes of displayed items, vertical locations or movements of displayed items, halos or shadows associated with displayed items, and colors of displayed item; generating, by the one or more configured computing systems, information for display that is based on at least some of the first information and on at least some of the second information, the generated information including, for each of the multiple selected factors, a summarization that is represented using the associated one visual aspect for the selected factor and that is based on a subset of the obtained information that corresponds to the selected factor; and initiating, by the one or more configured computing systems and before the one or more future time periods, display of the generated information to the one or more users. 2.  The method of  3.  The method of  4.  The method of  5.  The method of  7.  The method of  8.  The method of  9.  The method of  10.  The method of  11.  The method of  12.  The method of  13.  The method of  14.  The method of  15.  The method of  16.  The method of  17.  The method of  18.  The method of  19.  The method of  20.  The method of  21.  The method of  22.  The method of  23.  The method of  24.  The method of  25.  The method of  26.  The method of  27.  The method of  28.  The method of  29.  The method of  30.  The method of  31.  The method of  32.  The method of  34.  The non-transitory computer-readable medium of  35.  The non-transitory computer-readable medium of  36.  The non-transitory computer-readable medium of  37.  The non-transitory computer-readable medium of  39.  The system of  40.  The system of  41.  The system of  42.  The system of  | |||||||||||||||||||||||||||
This application claims the benefit of U.S. Provisional Patent Application No. 61/775,379, filed Mar. 8, 2013 and entitled “Displaying Information About Distributed Group Discussions,” which is hereby incorporated by reference in its entirety.
The following disclosure relates generally to techniques for displaying information corresponding to user-supplied information, such as to display information about assessments of distributed group discussions and/or about predicted future aspects of the discussions.
Discussions of current topics of interest by various distributed users are increasingly occurring via social networking sites and other computer-accessible sites that are available to the users, whether by users replying to and commenting on information supplied by other users, or by various users independently submitting information that in aggregate reflects topics of interest. In some situations, a particular distributed group discussion may occur between a particular selected subset of users (e.g., users that are members of a common group, club, service, etc.), while in other situations a distributed group discussion may include any user with access to one or more sites via which that discussion occurs.
Information about discussions that have occurred and/or are occurring may have value to various types of entities. However, various problems exist with obtaining and disseminating such information in a timely manner, including due to the distributed nature of the discussions.
The patent or application file contains at least one drawing executed in color. Copies of this patent or patent application publication with color drawing(s) will be provided by the Office upon request and payment of the necessary fee.
Techniques are described for analyzing user-supplied information in various ways, including in some embodiments to predict future aspects of additional related information that is expected to be supplied by users, such as during one or more future time periods. In at least some embodiments, the user-supplied information that is analyzed corresponds to distributed group discussions that involve numerous users and occur via user comments made to one or more computer-accessible sites, such as one or more social networking services. The analysis of user-supplied information may, in at least some embodiments, include determining particular topics that are being discussed for a specified category during one or more periods of time, quantifying an amount of user interest in particular topics and the category during the period of time from the discussion, and predicting future amounts of user interest in the particular topics and the category as part of an expected future discussion during one or more future period of times. As one illustrative example, millions of user Twitter tweets, user posts to social networking sites, and other user-supplied comments or other content items may be analyzed over a prior window of time (e.g., the last 72 hours) in a real-time or near-real-time manner, and changes in information corresponding to particular topics and categories of interest may be automatically predicted for a future span of time (e.g., the next 24 hours), with various types of further actions then being taken based on the predicted information. Additional details related to analyzing user-supplied information in particular manners are described below, and some or all of the described techniques are performed in at least some embodiments by automated operations of a Group Discussion Prediction (“GDP”) system.
The user comments and other user-supplied information that are obtained and analyzed may have various forms in various embodiments, such as to reflect information that is supplied by human users and made available to other users. For example, the user-supplied information may, in at least some embodiments, include posts and other user submissions to sources of such user-supplied information (generally referred to as “comment data sources” herein) that include one or more social networking services or sites, such as tweets to the Twitter service, and/or posts to Facebook, MySpace, Google+, LinkedIn, etc. In addition, in at least some embodiments and situations, the user-supplied information may include comments or other content items submitted to various other types of computer-accessible sites that may act as comment data sources, such as Instagram, Pinterest, Flickr, Picasa, YouTube, 43 Things, etc. In other embodiments and situations, any type of user-supplied information that is made available to one or more other users may be analyzed and used, including in text form, audio form and/or video form, such as user emails, SMS (“Short Message Service”) messages, Internet chat messages, telephone voice messages and other audio recordings, video recordings, search engine searches, news articles and other news releases, commerce-related submissions (e.g., user wish lists or gift registries, user shopping carts, etc.), Wiki-based submissions (e.g., to Wikipedia), etc., and is generally referred to herein as a “user-supplied comment” or “user comment.”
By gathering such user-supplied information during one or more time periods of interest, information may be assessed about distributed group discussions that occur during those time periods, including for discussions in which users actively review and respond to comments of others, and more generally for discussions that occur via various user comments that each reflects independent thoughts or opinions of a user that is not in response to any other particular user comment of another user. Such assessed information may correspond to one or more selected factors about one or more distributed group discussions, including one or more of the following factors for user comments or other content items being supplied with respect to one or more topics of interest: a quantity of user comments or other content items for the topic(s) that are supplied during a time period (or another measure of distributed user interest in those topics rather than content item quantity); subsets of such an overall quantity (or other measure of distributed user interest) supplied during a time period for the topic(s) that are associated with particular geographic locations or geographic areas, and/or with particular author users, and/or with particular content item data sources; changes in assessed values for one or more such factors between two or more time periods; etc. In addition, such assessed information corresponding to one or more selected factors for the topic(s) may be used to predict future information about user comments or other content items that will be supplied for the topic(s), such as predicted future values for a particular one or more of the selected factors for each of one or more future time periods, or instead for other types of predicted future information. Additional details related to assessing values for factors of interest and using such information to make corresponding predictions are included below.
In some embodiments, a variety of types of information may be displayed or otherwise provided to users or other entities about analyzed information and/or predicted information for one or more categories and/or topics or other attributes of interest, such as to reflect constituent content items that are associated with those categories, topics and/or attributes for one or more time periods—in at least some such embodiments, the information display or other providing may occur in a real-time or near-real-time manner with respect to corresponding events or activities to which the information corresponds (such as within seconds, minutes or hours of such events or activities). A display of information may, for example, indicate multiple categories, and visually indicate information about relative or absolute values for respective categories based on each of multiple selected factors, such as based on use of one or more of the following information for a displayed representation of a category: size, horizontal location, vertical location, color, an associated halo (or shadow), horizontal movement, vertical movement, displayed tags or other visual indicators, etc. —as one example, such information may be provided for a current time period and/or next future time period and for selected factors that include total quantity of constituent content items, rate of change in total quantity, and aggregate sentiment. A display of information may, for example, indicate multiple comment groups that each represents an associated topic or other attribute of constituent content items in the comment group (whether instead of or in addition to one or more category indicators), and visually indicate information about relative or absolute values for respective comment groups based on each of multiple selected factors, such as based on use of one or more of the following information for a displayed representation of a category: size, horizontal location, vertical location, color, an associated halo, horizontal movement, vertical movement, displayed tags or other visual indicators, etc. —as one example, such information may be provided for a current time period and/or next future time period and for selected factors that include total quantity of constituent content items, rate of change in total quantity, and aggregate sentiment. Additional details related to such display of information about analyzed aspects of distributed group discussions and/or about further predicted aspects of the discussions is included herein, including with respect to 
As part of the operation of the GDP service 150, it may further analyze the user-supplied content items to identify particular topics and other attributes of the content items, and to store corresponding information 165. The content item information 160 and topics information 165 may further be used to create corresponding comment groups, and to store corresponding information 170. The GDP service 150 may also use the comment group information 170 to determine particular topics that are associated with a particular category during one or more periods of time, such as based on the created comment groups associated with those topics, and then store corresponding category information 175. The GDP service 150 may further quantify the user-supplied content items included in a particular comment group and/or associated with a particular category, and use such information for one or more time periods to predict expected additional content items that will be received over one or more later time periods, with corresponding information being stored with the comment group information 170 and/or the category information 175—in some embodiments, the quantified information for one or more of the comment groups and/or categories about constituent content items for a time period may include encoded summary information for that time period, such as with respect to one or more encoding schemes specified in the comment group encoding information 180. As part of performing the prediction, the GDP service 150 may use one or more defined prediction templates from information 185, and may further generate and store such prediction templates for later use based on actual content items that are received—in some embodiments, the prediction template information may include an aggregation of encoded summary information for constituent content items that are part of a comment group and/or category for each of multiple time periods (e.g., consecutive time periods during a time window of interest), such as to be matched to aggregated encoded summary information from the comment group information 170 and/or the category information 175. In addition, the GDP service 150 may further take a variety of types of automated actions based on analyzed and/or predicted information, such as in accordance with client instructions or other client-specified criteria. Additional details related to operation of the GDP service in particular embodiments are included herein.
The one or more networks 100 may, for example, be a publicly accessible network of linked networks, possibly operated by various distinct parties, such as the Internet. In other embodiments, at least one of the network(s) 100 may be a private network, such as, for example, a corporate or university network that is wholly or partially inaccessible to non-privileged users. In still other embodiments, at least one of the network(s) 100 may include one or more private networks with access to and/or from the Internet. In the illustrated embodiment, the GDP service 150 may include one or more modules that perform particular operations, as discussed in greater detail with respect to 
Thus, user-supplied information of interest may be obtained in various manners in various embodiments, including by the GDP service or system accessing a particular comment data source to retrieve information from that comment data source (e.g., periodically, when the data is needed for analysis, etc.). For example, a GDP service may obtain information from a particular comment data source in various manners, such as via an API (“application programming interface”) provided by that data source, by retrieving information stored in a database or other storage mechanism, by scraping one or more Web pages of a Web site provided by the comment data source, etc. The GDP service may further obtain some or all of the available data from a particular comment data source, such as for one or more particular users and/or time periods, for all users and/or time periods, for users and/or information that meet specified criteria (e.g., publicly available information that does not have privacy restrictions), etc. In other embodiments and situations, at least some comment data sources may perform actions to supply at least some user-supplied information to the GDP service, such as in response to one or more prior requests of the GDP service (e.g., via an RSS, or Really Simple Syndication, feed, or other subscription request of the GDP service). In addition, in some embodiments and situations, at least some user-supplied comments may be received directly from a user that supplied them to one or more comment data sources, such as if a mobile application executing on a client device of the user also sends the comments to the GDP service. Furthermore, in some embodiments and situations, information that was initially supplied to one or more comment data sources may be available to the GDP service from another source, such as from an aggregator service that combines information from multiple comment data sources. In other embodiments, other types of user-supplied information may be obtained and/or other types of comment data sources may be used, and additional details are included below about obtaining user-supplied information of interest to analyze.
Once user-supplied information has been obtained for a particular time period, the user-supplied information may be analyzed in various manners. In some embodiments, the analysis includes analyzing user comments or other user-supplied content items to create corresponding comment groups for the time period. For example, each user-supplied content item may be analyzed to identify any topics of interest in the content, such as terms or phrases, hashtags, links to other data, or other n-grams, and optionally for topics that are determined to be of sufficient importance or relevance to the content item (e.g., by using a TF-IDF, or term frequency-inverse document frequency, analysis, or other content analysis technique). In addition, each user-supplied content item may be analyzed to identify any other types of attributes of interest that are associated with the content item (e.g., in the contents of the content item, in metadata associated with the content item, etc.), such as a location of where the content item was submitted from, an author user who generated or otherwise supplied the content item, a comment data source from which the content item was obtained, any n-grams from metadata associated with the content item, etc. After topics or other attributes of interest are identified, a comment group is created for each such topic or other attribute (or for a selected subset, such as to reflect a percentage or fixed quantity of the most relevant topics or other attributes) for a time period, with the created comment group including any analyzed content items from the time period that include the associated attribute for the comment group. In addition, in at least some embodiments, if additional user-supplied content items are available that were not used in the initial analysis (e.g., from one or more additional or supplemental comment data sources), additional such user-supplied content items that were supplied during the time period are retrieved, and additional content items that include the topic or other associated attribute for each created comment group are added to that comment group. Thus, each created comment group may be considered to represent its associated topic or other associated attribute for that time period. In other embodiments, information about content items may be analyzed and grouped in other manners, and additional details are included below about analyzing user comments or other user-supplied content items to create corresponding comment groups for a time period.
The analysis of user-supplied information for a time period may also include determining current topics that correspond to a particular content category for the time period. For example, a definition for a category of interest may be supplied or otherwise determined, such as to include one or more terms or other attributes corresponding to the category. User-supplied content items for the time period that include one or more of the definition terms or other definition attributes for the category are then identified (e.g., content items having one of the definition attributes, or all of the definition attributes, or a minimum defined quantity or percentage of the definition attributes), and the created comment groups to which those identified content items belong are then determined, with those determined comment groups being candidates to be associated with the content category for the time period. At least some of the determined comment groups are then excluded in some embodiments and situations, such as to remove comment groups that are under-inclusive or over-inclusive with respect to the subject matter relevant to the content category. The topics or other attributes associated with the remaining determined comment groups may then be identified as being the current topics for the content category during the time period. In at least some embodiments, the determination of candidate comment groups to exclude includes determining, for each candidate comment group, the frequency that its included user-supplied content items are in the identified content items for the content category. An average frequency and a standard deviation may then be determined across the candidate comment groups and used to establish lower and/or upper boundaries for the content category, such as to define a minimum exclusion threshold that is the average frequency minus half the standard deviation, and/or to define a maximum exclusion threshold that is the average frequency plus half the standard deviation. In other embodiments, comment groups to include for a content category may be determined in manners other than based on frequency of included user-supplied content items, and additional details are included below about determining current topics that correspond to a particular content category for a time period.
The analysis of user-supplied information for a time period may further include predicting information about expected additional content items that will be supplied by users for a comment group and/or content category, such as during each of one or more future time periods of interest. For example, the analysis may include quantifying information about the user-supplied content items included in the comment group and/or associated with the content category for the time period, such as based on a quantity of such user-supplied content items, or more generally for each of one or more selected factors of interest. In some situations, the quantification is further performed for particular subsets of the user-supplied content items included in the comment group and/or associated with the content category for the time period, such as to correspond to an intersection of that comment group and/or content category with a particular geographical location (for those content items supplied from each of one or more geographical locations), to correspond to an intersection of that comment group and/or content category with a particular author user (for those content items supplied by each of one or more author users), to correspond to an intersection of that comment group and/or content category with a particular comment data source (for those content items supplied to each of one or more comment data sources), etc. Such quantification information for the constituent content items supplied during the time period may then be combined with other corresponding quantification information for other related time periods, such as for all of the time periods during a sliding time window of interest (e.g., for every 30-minute time period during a 72-hour time window), and used to generate a histogram or other aggregation of that combined quantification information. In some embodiments, the quantification of information for a time period includes summarizing and encoding information about multiple selected factors for the constituent content items supplied during the time period, and the aggregation of that combined quantification information that is generated for multiple time periods during a time window of interest includes a succession of the encoded summary information for the selected factors for each of multiple successive time periods during the time window. The aggregated quantification information may then be used to generate predicted information about expected additional content items that will be supplied by users for the comment group or content category (or particular intersection) corresponding to the aggregated quantification information, such as with respect to one or more of the selected factors that are quantified and optionally summarized and encoded.
For example, a non-exclusive list of types of information that may be predicted with respect to a corresponding selected factor for a comment group and/or a category include the following: a predicted amount (or change) in total quantity of the comments included in the comment group and/or associated with the category (or particular intersection) in one or more future time periods; a predicted direction of increase or decrease in the quantity of the comments included in the comment group and/or associated with the category (or particular intersection) in one or more future time periods; a predicted change in which topics or other attributes will be associated with the category in one or more future time periods; a predicted amount (or change) of influence of one or more author users who supply content items included in the comment group and/or associated with the category (or particular intersection) in one or more future time periods; a predicted amount (or change) of one or more types of sentiment in the content items included in the comment group and/or associated with the category (or particular intersection) in one or more future time periods; a predicted amount (or change) of comments from one or more geographical locations that are included in the comment group and/or associated with the category (or particular intersection) in one or more future time periods; a predicted direction of increase or decrease in the quantity of the comments from one or more geographical locations that are included in the comment group and/or associated with the category (or particular intersection) in one or more future time periods; a predicted amount (or change) of comments from one or more comment data sources that are included in the comment group and/or associated with the category (or particular intersection) in one or more future time periods; a predicted direction of increase or decrease in the quantity of the comments from one or more data sources that are included in the comment group and/or associated with the category (or particular intersection) in one or more future time periods; etc.
In some embodiments and situations, the generating of the prediction information includes performing trend prediction based on the aggregated quantification information for a comment group and/or category (or particular intersection), such as by doing a 2nd degree polynomial least squares fit to the quantification information in the generated histogram or other aggregation (e.g., using weighting to reflect time and/or one or more other factors for the quantification information in the generated histogram or other aggregation), and using the resulting trend line to determine the predicted information. In other embodiments and situations, the generating of the prediction information includes matching a generated histogram or aggregation of encoded summary information or other quantification aggregation to a portion of a defined prediction template (e.g., to an initial subset of an additional histogram that is based on quantified information for prior actual user-supplied content items that have been received), and using an additional portion of the defined prediction template (e.g., a later subset of the additional histogram, a later portion of the aggregation of encoded summary information, etc.) to indicate the predicted information. In other embodiments, other types of information may be quantified and used to generate prediction information for a comment group and/or content category (or particular intersection), and additional details are included below about predicting information about expected additional content items that will be supplied by users for a comment group and/or content category (or particular intersection).
The analysis of user-supplied information for a time period may further include generating a prediction template that reflects actual user-supplied content items that are received over time for a comment group and/or content category, such as for later use in predicting information about expected additional content items that will be supplied by users. For example, a histogram or aggregation of encoded summary information or other aggregation of quantification information that is generated to reflect user-supplied content items received for a comment group and/or a content category (or particular intersection) may be analyzed to determine if the histogram or aggregation of encoded summary information or other quantification aggregation is successful in predicting other expected content items that will be received in other situations. In addition, or alternatively, trend prediction information that is generated based on a generated histogram or aggregation of encoded summary information or other quantification aggregation for a comment group and/or a content category may be analyzed to determine if the corresponding predicted information reflects later user-supplied content items that are actually received for the comment group or content category. When a generated histogram or aggregation of encoded summary information or other quantification aggregation and/or corresponding trend prediction information is found to be successful in predicting other user-supplied content items, it may be put into use as a prediction template in predicting additional future user-supplied content items that will be received, such as to further evaluate its performance. In other embodiments, prediction templates may be generated in other manners, and additional details are included below about generating a prediction template for later use in predicting information about expected additional content items that will be supplied by users.
After the analysis of user-supplied information for one or more time periods is performed, various types of resulting automated actions may be taken, such as based on analyzed information about the user-supplied content items that have already been received and/or based on predictions about additional user-supplied content items that are expected to be received. For example, information from the analysis may be provided to one or more clients of the GDP service, such as users or other entities who pay fees to receive information about specified content categories of interest. In addition, in some embodiments and situations, information from the analysis is used by the GDP service to take automated actions to supply additional content items that correspond to a comment group or category of interest (e.g., additional comments that are generated by the GDP service and made available to users), or to otherwise supply information corresponding to the comment group or category (e.g., initiating or changing related Internet-based advertising or other advertising), optionally in accordance with instructions or other criteria specified by one or more clients of the GDP service—such additional information may, in some situations, alter or otherwise affect future direction of a particular distributed group discussion, such as by including additional supplemental information to further support particular topics and conversations, by including additional information to refute or otherwise alter a discussion about particular topics, etc. In other embodiments, various other types of actions may be taken by the GDP service in appropriate circumstances, and additional details are included below about taking automated actions based on information generated from the analysis of user-supplied information for one or more time periods.
In addition, other types of analysis of user-supplied information for one or more time periods may be performed in at least some embodiments, such as to generate additional types of information related to distributed group discussions. As a first example, information about users who supply content items in one or more comment groups and/or associated with one or more categories during one or more time periods may be analyzed, such as to determine particular author users who have a high or low amount of influence within the comment groups or categories (e.g., relative to other author users for the same or other comment groups and/or categories), such as to be able to direct how a discussion evolves or changes. As a second example, information about content items in one or more comment groups and/or associated with one or more categories during one or more time periods may be analyzed, such as to perform a sentiment analysis with respect to contents of the content items (e.g., by using term/symbol list matching; topic decomposition and subspace projection; Bayesian classification or other classification techniques, such as using training data from human-annotated sentiment assignments of particular user comments; etc.). Such additional types of discussion-related information for one or more comment groups and/or one or more categories over one or more time periods may be used in various manners, including to encode summary information about one or more such additional types of discussion-related information for one or more corresponding selected factors, to take some or all of the same types of automated actions as discussed above, or instead to take other types of automated actions. In other embodiments, various other types of analyses may be performed by the GDP service to generate various other types of additional discussion-related information, and additional details are included below about performing analyses with respect to author user influence and/or sentiment analysis.
For illustrative purposes, some embodiments are described below in which specific types of user-supplied information is analyzed to provide particular types of resulting information related to distributed group discussions in specific ways, including to predict various types of information about expected future user-supplied information that will be received. However, it will be understood that such information related to distributed group discussions may be generated in other manners and using other types of input data in other embodiments, that the described techniques may be used in a wide variety of other situations for other types of data, that other types of information related to distributed group discussions may similarly be generated and used in various ways, and that the invention is thus not limited to the exemplary details provided.
In particular, 
In particular, in this example, a first user comment 205a is shown, which includes various metadata 210a and other content (which in this example includes a title and accompanying textual content). The metadata 210a that is available to the GDP service in this example includes an author user name 211a, an author user source-specific identifier 212a, information 213a about a geographical location from which the comment was supplied, and information 214a about the comment data source to which the user comment was supplied—at least some such information may not be displayed to other users, however, such as the geographical location information 213a and/or the comment data source information 214a. In this example, an analysis of the user comment may identify a variety of topics or other attributes that may be used for subsequent analysis, including in some embodiments and situations to treat each word or phrase in the contents as a topic, and to include each of the indicated types of metadata information 211a, 212a, 213a and 214a as a comment attribute. Examples of topics that may be used from the comment's contents include words in the title (e.g., “republican” 222a), words in the content body (e.g., “president” 220a, “Obama” 221a, “are” 224a, “the” 225a, etc.), and phrases (e.g., “debt limit” 223a). In other embodiments, a subset of the words/phrases and other attributes may be selected using one or more of a variety of types of information analysis techniques, such as to eliminate words that are too common to be useful in identifying the subject matter of the comment (e.g., “the”, “of”, etc.), and to ignore differences in capitalization, hyphenation, stemming, etc.
The user comments 205b and 205c similarly include metadata 210 and other contents, and include various types of information that may be used as topics or other attributes for those comments. For example, user comment 205b includes the term “Obama” 222b in a manner similar to term 221a of comment 205a, as well as various other terms (e.g., “eagle” 221b and “10860” 228b). Comment 205b is also illustrated as including two in-line metadata hashtags that may be used as topics for the comment, which in this example are “#OvalOffice” 226b and “#Obama” 227b. In some embodiments and situations, terms such as “Obama” and “#Obama” may be grouped together and treated as the same topic, while in the illustrated embodiment the terms are handled separately. User comment 205c similarly includes various terms, which in this example includes a user-selectable link 229c (e.g., an HTML hyperlink)—in some embodiments, such links and/or associated metadata for the links (e.g., terms that are included as part of the link, tags or other metadata associated with the link, etc.) may be used as topics, and in some embodiments contents of such a link may be retrieved and analyzed for use as contents and/or metadata of user comment 205c in a manner similar to other inline contents, while in other embodiments one or more both such types of information may not be used. In some embodiments, a user comment may further have additional information supplied by other users that may similarly be used as a topic or attribute for the comment (e.g., hashtags, such as #president or #funny; other types of tags, such as “funny” or “like” or “useful” or “©”; etc.), although such types of additional information are not illustrated in this example. It will be appreciated that the identification of topics or other attributes for one or more user comments may be performed in a variety of manners in various embodiments. For example, in some embodiments the analysis is performed in a language-neutral manner (e.g., when extracting n-grams from the contents of a user comment), and thus the user comments may include comments in multiple languages—in some such embodiments, translation capabilities may further be employed to translate at least some user comments into one or more target languages before the identification of topics or other attributes for those user comments. As one non-exclusive example of a particular set of techniques for identifying at least some topics or other attributes for at least some user comments, the following steps may be employed: n-grams are extracted from comment title and body text by first converting the text to a collection of n-grams (e.g., bi-grams or tri-grams); the n-grams containing predefined common words in at least some positions are removed from consideration; a most significant set of X n-grams are then selected based on scoring (e.g., via a pointwise mutual information, or PMI, algorithm), with X being a configurable predefined number or otherwise being determined based on the available data (e.g., a specified percentage); and changes in significance of the n-grams are tracked as more comments containing them are identified.
After the topics and other attributes are identified for a variety of user comments being analyzed, comments groups may be created for each of some or all of the topics and other attributes. A few example comment groups 230 are illustrated, along with information about particular example comments included in each comment group. For example, the “President” comment group 230a is based on the term “president”, and includes at least comments 205a and 205c, since those comments include terms 220a and 222c, respectively, that match the term (but does not include comment 205b in this example, despite its contents being related to the president of the United States, since the term “president” is not included in comment 205b). Comment groups 230b and 230c correspond to the terms “Obama” and “#Obama”, respectively, and the comment group based on the term “the” reflects that most or all comments may include such common terms. Comment groups 230e and 230f provide examples of comment groups corresponding to comment attributes that may be from metadata rather than contents of at least some user comments, such as for comment group 230e corresponding to a particular geographical location (in this example, the city of “Philadelphia”) and including any comments supplied from that geographical location (with other comment groups, not shown, corresponding to other geographical locations), and for comment group 230f corresponding to a particular author user (in this example, user “Andy Jones”) and including any comments supplied by that author user (with other comment groups, not shown, corresponding to other author users). It will be appreciated that geographical locations and users may be represented in a variety of manners in various embodiments, including at differing degrees of specificity (e.g., to represent geographical locations by individual addresses, GPS coordinates, neighborhoods, cities, regions, counties, states, countries, etc.; and to represent users via legal names, source-specific login names or screen names or other identifiers, numeric or other unique identifiers, etc.). While not illustrated here, comment groups may also be created for each of multiple distinct comment data sources, to include comments supplied to those comment data sources.
In the example of 
In this example, the additional user comment 245a from comment data source YYY includes an additional comment from author user “Andy Jones” 241a who also supplied comment 205a of 
Additional user comment 245b from comment data source ZZZ includes a photo and related supplementary text, and is added to the “Obama” comment group 230b (based on the term being included in the textual commentary) and the “Philadelphia” geographical location comment group 230e (based on geographical location information 243b) for the comment 245b. While not illustrated here, in some embodiments an analysis of data in a user comment may be performed to identify additional attributes that may be used to associate the user comment with one or more appropriate comment groups. For example, with respect to the example user comment 245b, an image recognition process and/or other analysis of the photo may be performed to identify additional metadata attributes (e.g., based on identified objects, people, locations, etc.) for the user comment—illustrative examples include the following: to recognize President Obama within the photo, and thus associate the user comment with comment group 230b even if the provided commentary was absent; to recognize Washington D.C. or the Capitol building from the photo, and associate the user comment with corresponding comment groups; to examine location and time metadata embedded in the photo or otherwise associated with the photo, and use the location data to associate the user comment with a corresponding comment group; etc. In some embodiments, a comment group 230e will include both comments that are supplied from the geographical location of Philadelphia and comments that include the term “Philadelphia” in the contents or other attributes of the comment, while in other embodiments the comment group 230e will only include comments that are supplied from the geographical location of Philadelphia. The modified comment group information 235 further illustrates an example of a data source comment group 230g in this example, which corresponds to comment data source XXX, although such a comment group 230g would not be modified in this supplemental operation in this example since the additional user comments are not from the comment data source XXX. In addition, if new topics or attributes are included in the additional user comments that were not present in the initially analyzed user comments (e.g., comments from author user “Jenny Smith” 241b), new comment groups may be created for such new topics or attributes in some embodiments, while in other embodiments such new comment groups will not be created.
After the matching user comments are associated with a category in section 250, all of the created comment groups to which those matching user comments belong are then identified, as shown in part in section 255 for the President Obama category. It will be appreciated that each user comment may be associated with multiple comment groups, and thus a large number of comment groups may be initially identified for a category. In the example of section 255, the identified comment groups are then analyzed to determine their relevance to the associated user comments shown in section 250. In particular, in this example, the frequency is determined for each identified comment group that the associated user comments in section 250 are included within that comment group. As an example, information 260 illustrates that 10,000 user comments have been identified as being associated with the “President Obama” category in section 250 (although only a small fraction are shown in section 250). Using the “The” comment group discussed in 
Additional processing is then performed to select a subset of the identified comment groups as being associated with the “President Obama” category for this time period. In particular, as shown in information 260, an average frequency is determined for the identified comment groups, and frequency-based minimum and maximum inclusion thresholds are determined based in part on the average frequency. As one example, the standard deviation (not shown) may be further determined for the frequency values, and the minimum and maximum inclusion thresholds may be determined based in part on the standard deviation. In this example, the lines 255a and 255b in section 255 denote the lines for the maximum and minimum inclusion thresholds, respectively. Accordingly, a term such as “the” is excluded in this example as being too ubiquitous or general for the category, and a term such as “Chip Kelly” is excluded as being too rare or specific (or irrelevant) for the category. The topics or other attributes of the remaining identified comment groups may then be identified as topics that are relevant to the “President Obama” category for this time period, including topics such as “Obama,” “President,” “democrat,” “inauguration,” “debt limit,” and “republican,” as well as geographical location attributes “Washington D.C.” and “Chicago”—while no author user attributes or comment data source attributes are illustrated in this example, it will be appreciated that such attributes may similarly be identified and included. It will further be appreciated that some of the identified topics for this category may be fairly constant over different time periods (e.g., “President” and “democrat”), while other identified topics may change over time to reflect subject matter that is or is not of sufficient current interest at a given time (e.g., such that “inauguration” and/or “debt limit” may not be included in this category during some time periods).
While such quantified information may be of use to reflect recent events and status, such quantified information may further be used to predict information about additional user comments that are expected to be received, including changes with respect to topics that are part of a category, and/or changes with respect to how many, when, where and by whom that expected future user comments will be received for a particular comment group or content category. 
Based on the match to portion 265a of prediction template 265, some or all of the additional portion 265b of the prediction template 265 may be used to predict additional expected future comments that will be received for the comment group or category represented by histogram graph 260, such as after any appropriate scaling is performed. In particular, in this example, the additional portion 265b of the prediction template 265 may be used to predict that the quantity of user comments will continue to fall for five more time periods, and to then level off for about eight additional time periods, after which the quantity will rise again somewhat before gradually decreasing. It will be appreciated that a variety of types of shapes may be represented by prediction templates in a variety of manners, such as to reflect a variety of types of distributed group behavior related to an underlying discussion that is occurring.
By generating such quantified information for each determined comment group for a content category, and by predicting additional corresponding information for each such comment group, a variety of types of information may be determined for the content category. As one example, if the quantity of comments that are predicted to occur for a first comment group decreases sufficiently, while the comments that are predicted to occur for a second comment group increases, the increasing influence of the topic for the second comment group to the content category may be determined. In addition, the time at which a particular topic will stop and/or start being a sufficiently relevant topic for a content category may be predicted. Based on such information, the GDP service or a client that receives such information may be able to take various types of corresponding actions, such as to reduce or stop activities related to topics that are losing relevance, to increase or start activities to try to increase interest in a topic of importance that is otherwise losing relevance, to target additional activities related to topics that are increasing in relevance, etc. In addition, when such quantified information is generated for geographical location attributes, comment data source attributes, and/or author user attributes that are determined topics for a content category, and when additional corresponding expected future information is predicted for such attribute-based comment groups, a variety of additional types of information may be determined for the content category. For example, not only can relative differences be determined in the relevance of different topics to a category, but such differences can similarly be tracked and predicted across different geographical areas (e.g., this topic is increasing in relevance in location A, but remaining constant in location B), different comment data sources, and/or different author users. Differences across other types of attributes may similarly be tracked and predicted, and such information about attribute-based comment groups may similarly be used by the GDP service or a client that receives such information to take various types of corresponding actions.
In addition to using a trend line prediction to generate prediction information for future time periods, the same or similar types of trend line prediction information may be used to assist in generating new defined prediction templates. For example, considering the histogram graph 270, various portions of the histogram graph may be selected for additional analysis, such as the last 24 hours (the portion labeled “72” that includes histogram data), the last 48 hours (the portions labeled “48” and “72” that include histogram data), the first 24 hours (the portion labeled “24” that includes histogram data), the first 48 hours (the portions labeled “24” and “48” that include histogram data), etc. For each such portion of the histogram, a prediction trend line may be generated based on the data in that portion, and the subsequent portions of data about actual user comments may be used to determine if the prediction trend line accurately predicted information about those actual user comments. If the prediction trend line for a portion of the histogram does accurately predict such information, that portion of the histogram may be selected to be used or evaluated as a prediction template for performing future predictions, and/or that portion of the histogram plus the later actual matching predicted data may be selected for use as such a prediction template. In the example of 
In particular, 
A comparison or other analysis of the quantified information in the table 1000 provides various information about the changes of the quantified information over time, including the following non-exclusive examples: a change in total quantity of content items over successive time periods, such as may be measured in an absolute number or percentage change (e.g., to determine that the total quantity increases in amount by approximately the same quantity between each of the first four time periods, which are time period N through time period N+3 as shown in rows 1001a-1001d, but the increase slows in time period N+4); changes between different geographical locations over time (e.g., to reflect that location 1 initially has much larger quantities that location 2 but that the quantities for location 1 begin to decrease in time period N+2, while the quantities continue to grow for location 2 until time period N+3 and surpass the quantities for location 1 in time period N+2); changes between different data sources over time (e.g., to reflect that data source 1 initially has much larger quantities that data source 2 but that the quantities for data source 1 begin to decrease in time period N+2, while the quantities continue to grow for data source 2 until time period N+3 and surpass the quantities for data source 1 in time period N+3); changes between different author users over time (e.g., to reflect that author user 1 initially supplies larger quantities that author user 2 but that the quantities for author user 1 begin to decrease in time period N+2, while the quantities continue to grow for author user 2 until time period N+3 and surpass the quantities for author user 1 in time period N+3); changes in aggregate user sentiment, such as to reflect that user sentiment changes from positive (with value ‘3’) to neutral (with value ‘2’) in time period N+3 and to negative (with value ‘1’) in time period N+5; changes in the most influential author user, such as to reflect that author user 1 continues to be the most influential in time period N+3 despite not supplying the most content items during that time period, and that author user 2 surpasses author user 1 as the most influential in time period N+4; etc. It will be appreciated that various other types of information may be determined in some embodiments from analyses of quantified information, whether instead of or in addition to the discussed types of information.
Furthermore, prediction template 3 (reference number 1303) illustrates that different prediction templates may include different amounts of data for the additional portions, such as if prediction templates 1 and 2 are used to predict future information for up to 6 future time periods, and prediction template 3 is used to predict future information for an additional number of future time periods (e.g., up to 24, up to 48, up to 72, up to 144, etc.)—in other embodiments, all of the prediction templates may include the same amounts of data for the additional portions, but different subsets of the additional portion data may be selected and used (e.g., if the additional portion includes data for 144 time periods, but at times only the next six time periods may be selected and used). In addition, prediction template 4 (reference number 1304) illustrates that different prediction templates may include summary data that is encoded using different encoding schemes, such as to include an initial portion and additional portion whose summary data is encoded using encoding scheme 2.
The initial portions of one or more of the prediction templates may be matched to the aggregation 1200 in various manners in various embodiments—in this example, matching information is illustrated for prediction templates 1 and 2, using a bitwise comparison with a binary yes or no determination of a match. Thus, in comparing the first encoded summary value “23313” of the aggregation 1200 to the first encoded summary value “23276” of the initial portion 1301a of prediction template 1, a matching value of “YY---” results, with ‘Y’ representing a binary yes and ‘-’ representing a binary no, based on the initial ‘2’ of each of the values matching, the next ‘3’ of each of the values matching, the next ‘3’ of the aggregation not matching the next ‘2’ of the prediction template, the next ‘1’ of the aggregation not matching the next ‘7’ of the prediction template, and the next ‘3’ of the aggregation not matching the next ‘6’ of the prediction template. Similar matching values are shown for the remaining part of the initial portion of prediction template 1, as well as for the initial portion of prediction template 2. In other embodiments, a degree of match between two pieces of encoded summary information may be made in manners other than a binary yes or no (e.g., for a selected factor whose values are enumerated with multiple numerical values, to determine a numerical difference between two values).
Given multiple candidate prediction templates such as prediction templates 1 and 2, a best match to the aggregation 1200 may be determined in various manners in various embodiments. In some embodiments, the overall degree of match between the entire aggregation and the entire initial portion may be used, which in this example results in prediction template 2 being a better match to aggregation 1200 than prediction template 1—in particular, prediction template 2 has an 80% degree of overall match (24 matches out of 30 possible values), while prediction template 1 has only a 30% degree of overall match (9 matches out of 30 possible values). If prediction template 2 is selected (e.g., based on being the best match; based on having a degree of match above a defined threshold, such as 75%; etc.), some or all of the additional portion of prediction template 2 may be used to predict future information for one or more of the selected factors represented by encoding scheme 1, as discussed in greater detail with respect to 
In other embodiments, the degree of match between a subset of the aggregation and a subset of a prediction template's initial portion may instead be assessed. For example, encoding scheme 1 includes information about 5 selected factors, but only 1 factor may currently be of interest, such as factor a) discussed in 
Based on prediction template 2 of 
The information for column 1403a corresponding to factor a), and for rows 1501g-1501i, illustrate alternative predicted information that may be generated with respect to the next six time periods for the total quantity of content items in the example comment group. In particular, if prediction template 1 of 
It will be appreciated that the details discussed with respect to the examples of 
In particular, 
While various GUI controls may be displayed in some embodiments, in the current example such GUI controls are not displayed, although some or all of the displayed indications of the classifications 1605 and/or categories 1610 may themselves be user-selectable controls, such as to enable a user to select a particular classification or category to obtain additional information about it. For example, the user may in some embodiments be able to click on, hover over or otherwise select the “Obama” category 1610b in the “Politics” classification, such as to obtain information about the corresponding category (e.g., in a manner similar to that of the “President Obama” category discussed with respect to 
In the example of 
As discussed in greater detail elsewhere, various of the types of information that are displayed may be personalized to the user and/or configurable to the user in various manners. In addition, in this example, various additional types of information and user-selectable controls 1902, 1920, 1925, and 1930 are displayed, such as to allow the user to obtain additional information of one or more types corresponding to the current comment group. For example, if the user selects control 1902, additional details about particular constituent content items of one or all types (e.g., photos) may be displayed in an updated GUI. Similarly, areas 1920, 1925, and 1930 may include information of various types to the user (although particular details are not currently shown in areas 1920 and 1925), such as details about some or all of the constituent content items, about particular author users (e.g., users who have high or low influence), about particular data sources from which the constituent content items are provided, about particular geographical areas with which the constituent content items are associated, etc. —in at least some embodiments, selection of one or more such user-selectable controls and information areas will provide expanded information to the user of that type. The GUI 1900 in this example further includes an additional user-selectable control 1904 to allow the user to modify the current display, such as to return to a previous page (e.g., the page of information displayed in the GUI 1800 of 
As one example of a visual aspect that may be conveyed in a display similar to that of 
Returning to 
In addition, in this example, the Y axis is associated with a selected factor that corresponds to a rate of change (e.g., a percentage increase or decrease) in the quantity of constituent content items with respect to a prior time period, with the vertical center of the overall item 2010 representing a constant rate of change in quantity in this example—in other embodiments, the Y axis may instead correspond to an absolute amount of change in quantity of constituent content items with respect to a prior time period. In this example, a sub-item 2020 that is higher in the display than the x-axis has a progressively higher positive rate of change in quantity, while a sub-item 2020 that is lower in the display than the x-axis has a progressively higher rate of a negative change in quantity (i.e., a reduction in rate of change in quantity). Thus, a visual inspection can readily determine that sub-item 2020f has a higher growth rate in quantity of constituent content items than does, for example, sub-item 2020b (which has a zero rate of quantity change, or a constant quantity), and with sub-item 2020b having a higher growth rate in quantity than the negative growth rates for sub-items 2020c and 2020d.
In addition to the displayed vertical location, different colors are also mapped to different vertical locations in this example to emphasize different rates (or alternatively, amounts) of quantity change. In particular, the blue color of sub-item 2020b represents a mostly constant quantity (or zero rate, or amount, of quantity change) for the constituent content items, the purple color of sub-item 2020f represents a relatively high value for the rate (or amount) of quantity change (and with varying degrees of color between blue and purple indicating increasing positive values), and with the black of sub-item 2020c indicating a relatively large negative value for the rate (or amount) of quantity change (with corresponding degrees of darker blue and gray between the blue of sub-item 2020b and the black of sub-item 2020c indicating increasing negative values). By using both color and vertical location to illustrate a single selected factor in this example, corresponding information can be easily determined by a user in multiple visual manners, although in other embodiments color and vertical location may represent different types of information for different selected factors.
As with the size-related visual aspect, the visual aspects corresponding to vertical location, horizontal location and/or color may in some embodiments have associated legends, keys, and/or scales that are displayed to the user, and one or more of the vertical location, horizontal location and color visual aspects may be further configurable by the user in various manners in various embodiments (e.g., to allow the user to specify a different selected factor with respect to one or more of the visual aspects, etc., to change a scale with respect to a particular selected factor, etc.).
In addition to size, vertical location, horizontal location, and color, the example GUI 2000 could further use movement in the vertical and/or horizontal directions that is represented within a single time period (e.g., via animation) to represent further selected factors of interest in at least some embodiments, although such movement-based visual aspects within a single time period are not used in this example GUI (although changes in location for different time periods do reflect changes in corresponding information over those time periods). For example, a horizontal movement visual aspect within a single time period could be used to illustrate a rate of change in sentiment or other selected factor assigned to the X axis (e.g., for actual information over one or more prior time periods to the current time period, for predicted information from the current time period through one or more future time periods, etc.), or instead could be used to illustrate information for a selected factor other than the one assigned to the X axis. As an alternative to animation, visual arrow indicators 2004 could be displayed to reflect information about vertical movement, such as with the length of the arrow indicating an amount of movement. Similar types of horizontal movement visual aspect may also be displayed, whether in addition to or instead of the vertical movement information.
In addition to size, horizontal location, vertical location, and color, the illustrated example GUI 2000 further includes information not only about the current real-time information for one or more time periods of interest, but also predicted future information for one or more future time periods of interest. In particular, in this example, some of the sub-items 2020 have an associated halo 2030 (or alternatively, a shadow that does not fully encompass the sub-item) that corresponds to predicted future information for that sub-item, with the size and/or color of the halo providing additional information about the corresponding predicted future information. For example, with respect to sub-item 2020f, if the size of the sub-item corresponds to the quantity of constituent content items for that sub-item, then the size of the prediction halo 2030a for the sub-item may indicate that the quantity of constituent content items for that sub-item is predicted to increase over the one or more future time periods of interest, with the size of the prediction halo 2030a indicating the predicted quantity. As can be visually determined, the predicted future quantity for the sub-item 2020f is thus roughly the same amount as the current quantity for sub-item 2020b, as both are a similar size. In a similar manner, the prediction halo 2030b for sub-item 2020d indicates that the predicted quantity of constituent content items for the sub-item 2020d is expected to decrease over the one or more future time periods of interest, with the size of the prediction halo 2030b indicating that predicted smaller future quantity. The colors of the prediction halos may also convey various types of information, such as to be selected to be different from the color of the corresponding sub-items (e.g., to enable easy identification of the prediction halo, such as to use the color orange of the prediction halo 2030a for all prediction halos, or for prediction halos whose predicted future quantity is larger than the current actual quantity of the corresponding sub-item), or to further use different colors for a given prediction halo to display additional information about the predicted future quantities (e.g., to use different colors for a prediction halo to indicate a predicted future rate of change). As with other aspects discussed, additional legend, key, and scale information may be displayed corresponding to the prediction halos, and some or all of the aspects of the prediction halo display may be configurable by the user in at least some embodiments.
Thus, given a variety of different visual aspects, a visual inspection of the example GUI 2000 of 
While various example visual aspects and corresponding selected factors and information are illustrated with respect to 
In addition, in some embodiments, different shapes may be used rather than circles to convey additional types of information for a corresponding sub-item based on the corresponding shape. As one example, the shapes used may not be symmetrical in both the X and Y axes, such as to use the different X and Y vales to convey additional information. For example, a circle may be replaced with an oval, such as an oval that is longer along the X axis than the Y axis, or that is instead longer along the Y axis than the X axis. If so, the length along the X axis could illustrate additional information about the selected factor for the X axis (e.g., a rate of change in sentiment), while the position along the X axis continues to indicate a current sentiment value, and while the distance along the Y axis could continue to be used to indicate a quantity size for constituent content items. In a similar manner, other types of shapes could be used that differ in the X and Y dimensions, such as diamonds, triangles, etc. Furthermore, a different shape that is not symmetrical could be used to optionally illustrate information over time. For example, a teardrop shape could be used to represent changes over time of a particular factor of interest. Consider, for example, a teardrop shape whose central axis runs left to right, such that the shape at the left end (whether the teardrop point or the teardrop bulb) corresponds to a past period of time and the shape at the right end of the teardrop corresponds to a current or future time, such that the length of the teardrop along the x-axis represents a length of time, and with the width of the teardrop in the y-axis direction at a given point indicating a value in sentiment or some other selected factor at a corresponding time—in this manner, the overall teardrop shape may be used to indicate changes in the sentiment value over the length of time corresponding to the teardrop length. It will be appreciated that other types of shapes can similarly be used in other manners and other embodiments.
In addition to shapes of individual sub-items, the information that is displayed within particular sub-items may further provide an additional visual aspect that conveys additional information, with the examples of 
As one example, sub-item 2130 illustrates displaying pie chart information within the sub-item to illustrate additional information about different subsets of the constituent content items for the sub-item. Thus, for example, the pie chart could correspond to an additional selected factor, with the different pie slices 2130a-2130c indicating the relative size of different subsets of the constituent content items corresponding to different values for that selected factor (e.g., with different pie slices representing different geographical locations associated with the constituent content items, different author users associated with the different constituent content items, different data sources associated with the different constituent content items, etc.). As with other visual aspects discussed, additional legend, key, and/or scale information could further be illustrated for such a visual aspect.
Sub-item 2110 illustrates an alternative type of information that could be displayed within the sub-item, and in particular illustrates timing-related information to show information for the constituent content items over time. As one example, the upper right quadrant 2110a could represent information about a current time for a selected factor, while the additional slices 2110b-2110d could illustrate the same type of information but for successive predicted future periods of time, or instead the different quadrants (or other divisions of the circle) could correspond to other combinations of two or more past and/or future time periods. Accordingly, each quadrant or other slice could indicate a particular value for a corresponding selected factor for a corresponding time period, such as to indicate an identifier of a highest data source or highest geographical location or highest author user during that time period, to indicate an actual or predicted rate of change for the corresponding time period, etc.
Sub-item 2140 illustrates yet another alternative type of information that could be displayed within a sub-item, and in particular illustrates a different type of timing-related information to show information for the constituent content items at a particular indicated time. For example, the illustrated information may be used to represent a clock and to indicate a particular time that a corresponding value of interest has occurred or is predicted to occur for a selected factor, such as a time at which a maximum or minimum value occurs for the selected factor, a time at which a change is expected to occur between two indicated values, etc. It will be appreciated that a variety of other types of information could similarly be displayed within a sub-item, including bar charts, line charts, textual information, etc.
Sub-items 2150 and 2160 illustrate other alternative types of information that could be displayed within a sub-item, and in particular illustrate two types of map-related information that may be used to show information for the constituent content items at a particular indicated time. For example, location-related information may be displayed within the sub-item corresponding to the constituent content items for the sub-item, such as to use coloring, shading, location indicators, etc. within the map to indicate different geographical locations that correspond to the constituent content items (e.g., to show relative quantities of constituent items in different geographical locations, or to instead show other types of location-related information).
While the illustrated GUI 2000 of 
It will be appreciated that the details discussed with respect to the examples of 
The user computing devices 350 are similarly illustrated as each having one or more CPU processors 351, one or more I/O components 352, memory 357, and storage 354, although particular I/O components and stored information is not illustrated. The other computing systems 360, 370 and 390 may similarly include some or all of the same types of components as the server computing system 300, but such components are not illustrated in this example for the sake of brevity. The server computing system 300, the GDP system 340 and the system 340 modules may also communicate with such other computing devices and systems in various manners, including via one or more networks 395 (e.g., the Internet, one or more cellular telephone networks, etc.).
In the illustrated embodiment, the GDP system 340 is executing in memory 330, and in this example includes several modules, including a Comment Group Creation Manager module 342, a Category Analysis Manager module 344, a Comment Prediction Manager module 346, a Prediction Template Generation Manager module 348, a Results Information Provider Manager module 347, and optionally one or more other modules 349. The system 340 and/or the system modules may in some embodiments include software instructions that, when executed, program or otherwise configure the processor(s) 305 and the server computing system(s) 300 to perform automated operations that implement at least some of the described techniques.
The GDP system 340 and its modules may obtain and use various information as part of its automated operations, such as to obtain user-supplied textual comments or other user-supplied content items from information 369 on one or more comment source computing systems 360 (e.g., computing systems that support one or more social networking sites or other available sites with user-supplied information) and/or from other computing systems (e.g., directly from computing devices 350 of users who supply the information, from other external computing systems 390, etc.), and may store such obtained information in a comment information database 321 on storage 320. The content items supplied to the computing systems 360 and/or 390 may originate from, for example, human users interacting with their user client computing devices 350, such as via a Web browser 358 executing in memory 357 of the client device, or via other software applications (not shown) executing on the client device.
In addition, the GDP system 340 may optionally obtain various types of client-related information from users or other entities that act as clients of the GDP system 340, such as by interacting with corresponding client computing systems 370 (e.g., via a corresponding optional module 349 that enables clients to register with the system 340 and/or to provide other types of client-specific information), and may store some or all such information in optional client information database 329 on storage 320. In at least some embodiments, some or all of the category definition information in database 328 may similarly be received from clients, such as to analyze information about a particular specified content category on behalf of a client who specifies a definition and/or other information about the content category. When such clients exist, the GDP system 340 may further provide various types of information to the clients (e.g., by sending the information to the client computing systems 370), and/or take various other types of automated actions on behalf of such clients, such as in accordance with specified client instructions or other specified criteria. While not illustrated here, in other embodiments some or all of the GDP system 340 may execute on behalf of a single client or a single entity (e.g., an organization with multiple employees or other members).
The Comment Group Creation Manager module 342 may perform automated operations to analyze various user-supplied textual comments or other user-supplied content items that have been supplied for one or more time periods (e.g., as may be stored in database 321 on storage 320 or in one or more other locations), such as to group content items based on common topics in their contents or based on other comment attributes that they share, and to optionally store corresponding comment topic information in database 322 on storage 320 and comment group information in database 323 on storage 320. In some embodiments, quantified comment group information that is stored in database 323 may including, for one or more of the comment groups, encoded summary information about constituent content items in the comment group for each of one or more time periods, such as with respect to one or more encodings specified in comment group encoding information 325 on storage 320.
The Category Analysis Manager module 344 may perform automated operations to determine topics or other attributes that are associated with a specified content category for one or more time periods, such as by identifying particular comment groups that are relevant for the specified content category for one or more time periods based on user-supplied comments or other content items included in those comment groups. In doing so, the module 344 may use information stored in comment information database 321, comment group database 323, comment topic database 322, and category definition database 328 stored on storage 320, and may store corresponding determined category information in category information database 324 stored on storage 320. In some embodiments, quantified category information that is stored in database 324 may including, for one or more of the categories, encoded summary information about constituent content items in the category (e.g., for all content items in the category, for content items in an intersection of the category and a particular constituent comment group, etc.) for each of one or more time periods, such as with respect to one or more encodings specified in comment group encoding information 325 on storage 320.
The Comment Prediction Manager module 346 may perform automated operations to quantify user-supplied comments or other content items that are associated with particular comment groups and categories, and to use such quantified information for a category or comment group to predict information about additional user-supplied comments or other content items that are expected to be supplied for the category and/or comment group. In some situations, the predictions are performed by generating prediction trends, and in other situations the predictions are performed using previously generated prediction templates. In doing so, the module 346 may use information stored in comment group database 323, category information database 324 and/or prediction template database 326 stored on storage 320, and may store quantified information about content items associated with particular comment groups and categories in comment group database 323 and/or category information database 324, and may store corresponding generated prediction information in comment group database 323 and category information database 324 (or instead in one or more other prediction information databases, not shown, in other embodiments). In some embodiments, the prediction template information may include encoded summary information for constituent content items that are part of a comment group and/or category for each of multiple time periods (e.g., consecutive time periods during a time window of interest), such as to be matched to encoded summary information from the comment group information in database 323 and/or the category information in database 324.
The Prediction Template Generation Manager module 348 may use quantified information about supplied comments or other content items associated with a category and/or with comment groups, and/or may use predicted information about additional user-supplied comments or other content items that are expected to be supplied for the category and/or comment groups, such as to identify or generate a histogram or other template of information that accurately reflects comments or other content items actually supplied for a category and/or comment groups, and that may be used as a template to predict information about additional user-supplied comments or other content items that are expected to be supplied for the category and/or comment groups. In doing so, the module 348 may use information stored in comments database 321, comment group database 323, and/or category information database 324, stored on storage 320, and may store corresponding new prediction templates in prediction template database 326.
The Results Information Provider Manager module 347 may retrieve analyzed information and/or predicted information for one or more categories and/or comment groups and/or classifications, such as to reflect constituent content items that are associated with those categories, comment groups and/or classifications for one or more time periods, and may then initiate the display of such information or otherwise provide such information to one or more entities. For example, such entities may include one or more of the following: a client (e.g., for display to a user representative of the client), such as via a Web browser or other application (e.g., a client-side app or module provided by the GDP system 340, such as one of the other modules 349), not shown, that is executing on a corresponding client computing system or device 370; other end-users of the GDP system, such as to publicly provide at least a subset of the analyzed information and/or predicted information to users via computing devices 350 and/or other computing systems 390; other affiliated or third-party systems (not shown), such as for further analysis or manipulation by those other systems and/or for those other systems to display or otherwise provide to the other systems' users at least a subset that is made available by the GDP system to the other systems of the analyzed information and/or predicted information; etc. In at least some such embodiments, the information display or other providing by the GDP system may occur in a real-time or near-real-time manner with respect to corresponding events or activities to which the information corresponds, such as the supplying of corresponding constituent content items (e.g., within seconds, minutes or hours of such events or activities). In addition, in at least some embodiments, the module 347 may display or otherwise provide information to an entity in a manner that is personalized to the entity (e.g., such as based on explicit or implicit preferences previously specified or determined for the entity), and may further optionally allow at least some such entities to configure various types of information to be displayed or otherwise provided, as discussed in greater detail elsewhere, including with respect to 
It will be appreciated that the illustrated computing systems are merely illustrative and are not intended to limit the scope of the present invention. For example, computing system 300 may be connected to other devices that are not illustrated, including through one or more networks such as the Internet or via the Web. More generally, a “client” or “server” computing system or device may comprise any combination of hardware that can interact and perform the described types of functionality, such as when programmed or otherwise configured with software, including without limitation desktop computers, laptop computers, slate computers, tablet computers or other computers, smart phone computing devices and other cell phones, Internet appliances, PDAs and other electronic organizers, database servers, network storage devices and other network devices, wireless phones, pagers, television-based systems (e.g., using set-top boxes and/or personal/digital video recorders and/or game consoles and/or media servers), and various other consumer products that include appropriate inter-communication capabilities. For example, the illustrated system 340 and/or its modules may include executable software instructions and/or data structures in at least some embodiments, which when loaded on and/or executed by particular computing systems or devices may be used to program or otherwise configure those systems or devices, such as to configure processors of those systems or devices. Alternatively, in other embodiments, some or all of the software modules and/or systems may execute in memory on another device and communicate with the illustrated computing system/device via inter-computer communication. In addition, while various items are illustrated as being stored in memory or on storage at various times (e.g., while being used), these items or portions of them can be transferred between memory and storage and/or between storage devices (e.g., at different locations) for purposes of memory management and/or data integrity. In addition, the functionality provided by the illustrated system modules may in some embodiments be combined in fewer modules or distributed in additional modules. Similarly, in some embodiments the functionality of some of the illustrated modules may not be provided and/or other additional functionality may be available.
Thus, in at least some embodiments, the illustrated modules and/or systems are software-based modules/systems including software instructions that, when executed by the CPU(s) 305 or other processor means, program the processor(s) to automatically perform the described operations for that module/system. Furthermore, in some embodiments, some or all of the modules and/or systems may be implemented or provided in other manners, such as at least partially in firmware and/or hardware means, including, but not limited to, one or more application-specific integrated circuits (ASICs), standard integrated circuits, controllers (e.g., by executing appropriate instructions, and including microcontrollers and/or embedded controllers), field-programmable gate arrays (FPGAs), complex programmable logic devices (CPLDs), etc. Some or all of the systems, modules or data structures may also be stored (e.g., as software instructions contents or structured data contents) on a non-transitory computer-readable storage medium, such as a hard disk or flash drive or other non-volatile storage device, volatile or non-volatile memory (e.g., RAM), a network storage device, or a portable media article (e.g., a DVD disk, a CD disk, an optical disk, a flash memory device, etc.) to be read by an appropriate drive or via an appropriate connection. The systems, modules and data structures may also in some embodiments be transmitted as generated data signals (e.g., as part of a carrier wave or other analog or digital propagated signal) on a variety of computer-readable transmission mediums, including wireless-based and wired/cable-based mediums, and can take a variety of forms (e.g., as part of a single or multiplexed analog signal, or as multiple discrete digital packets or frames). Such computer program products may also take other forms in other embodiments. Accordingly, the present invention may be practiced with other computer system configurations.
The routine begins at block 405, where information or a request is received. The routine continues to block 410 to determine if information has been received about one or more user comments (or other user-supplied content items), such as for user comments that have recently or concurrently been supplied to one or more comment sources and sent to the routine 400 by the comment source(s) (e.g., in response to one or more previous requests from the routine 400, such as with respect to blocks 425 or 490). If so, the routine continues to block 415 to store some or all of the received information for later analysis. If it is instead determined in block 410 that the received information or request of block 405 is not user comment information, the routine continues instead to block 420 to determine to retrieve user comments (or other user-supplied content items) from one or more external comment sources, such as to do so on a periodic basis or when the information is needed for a corresponding analysis. If so, the routine continues to block 425 to initiate the retrieval of user comments (or other user-supplied content items) from one or more external comment sources, such as one or more social networking services or other publicly accessible sites at which users may supply information. In the illustrated embodiment, the retrieval of the user comments in block 425 is performed in an asynchronous manner, by initiating requests to those external comment sources and later receiving corresponding responses, although in other embodiments the routine may instead complete the retrieval of particular user comments from particular comment sources in block 425 before continuing.
After blocks 415 or 425, or if it is instead determined in block 420 that the information or request received in block 405 is not to retrieve user comments, the routine continues to block 430. In block 430, the routine determines whether the information or request received in block 405 is to analyze user comments (or other user-supplied content items) to create corresponding comment groups, such as for information just received with respect to block 415 (e.g., for a current time period), or instead for a prior time period of a specified length (e.g., 30 minutes). If so, the routine continues to block 435 to execute a Comment Group Creation Manager routine to perform the analysis and creation of the comment groups, with one example of such a routine being described in greater detail with respect to 
After block 435, or if it is instead determined in block 430 that the information or request received in block 405 is not to analyze comments, the routine continues to block 440 to determine whether the information or request received in block 405 is to analyze user comments (or other user-supplied content items) and comment groups to determine current topics for one or more categories and one or more time periods, optionally with respect to comment groups that were just created in block 435 and/or for user comments just received in block 415. If so, the routine continues to block 445 to execute a Category Analysis Manager routine to determine the current topics for the category(ies) for the one or more time periods, with one example of such a routine being described in greater detail with respect to 
After block 445, or if it is instead determined in block 440 that the information or request received in block 405 is not to determine current topics for one or more categories, the routine continues instead to block 450 to determine whether the information or request received in block 405 is to predict information about future user comments (or other user-supplied content items) that are expected to be received for one or more comment groups and/or categories, such as for one or more future time periods, and optionally based on comment groups that were just created in block 435 and/or for user comments just received in block 415 and/or for category information that was just determined in block 445. If so, the routine continues to block 455 to execute a Comment Prediction Manager routine to quantify one or more aspects of the user comments (or other user-supplied content items) for the comment groups and/or the category(ies), and to generate corresponding predictions based on such quantified information, with one example of such a routine being described in greater detail with respect to 
After block 455, or if is instead determined in block 450 that the information or request received in block 405 is not to predict information about future user comments, the routine continues instead to block 460 to determine whether the information or request received in block 405 is to generate one or more prediction templates based on prior distributed group discussions and corresponding analyses. Such generation of one or more prediction templates may be performed, for example, with respect to information about actual user comments (or other user-supplied content items), comment groups and categories determined in blocks 435 and/or 445, and/or with respect to information about future comments (or other user-supplied content items) that are predicted in block 455. If so, the routine continues to block 465 to execute a Prediction Template Generation Manager routine to generate one or more such prediction templates, with one example of such a routine being described in greater detail with respect to 
After block 465, or if it is instead determined in block 460 that the information or request received in block 405 is not to generate prediction templates, the routine continues to block 470 to determine whether the information or request received in block 405 is to display or otherwise provide information to a user or other entity. Such a request or information may include, for example, a request from the user or other entity for the information, an automated determination that information is available to provide in response to a prior request (e.g., a request to push information to the user or other entity periodically or upon other indicated criteria being satisfied), an automated determination that a predetermined time has been reached or that other indicated criteria are currently satisfied (e.g., based on receipt or determination of current real-time or near-real-time data of one or more types, based on current prediction of future information of one or more types, etc.), etc. If so, the routine continues to block 475 to execute a Results Information Provider Manager routine to display or otherwise provide such information, with one example of such a routine being described in greater detail with respect to 
After block 475, or if it is instead determined in block 470 that the information or request received in block 405 is not to display or otherwise provide information, the routine continues instead to block 485 to determine whether a request corresponding to one or more other operations has been received. If so, the routine continues to block 490 to perform one or more other indicated operations as appropriate. Such other indicated operations may include, for example, one or more of the following non-exclusive list: receiving and storing definitions and other information about categories of interest; receiving and storing information about particular comment sources of interest (e.g., how to obtain comments or other user-supplied content items from them, timing for obtaining comments or other user-supplied content items from them, types of comments or other user-supplied content items available from them, etc.); receiving and storing information about clients of the service (e.g., categories of interest to the client, criteria under which to notify the client of discussion predictions and/or current discussion information of interest, types of automated actions to take under specified circumstances with respect to one or more categories of interest, etc.); etc.
After block 490, or if it is instead determined in block 485 that the information or request received in block 405 is not to perform other indicated operations, the routine continues instead to block 495 to determine whether to continue, such as until an explicit indication to terminate is received. If it is determined to continue, the routine returns to block 405, and otherwise continues to block 499 and ends. In at least some embodiments, the routine may execute in a continuous or near-continuous manner, such as to gather and store information about comments (or other user-supplied content items) as they become available, and to analyze such user comment information during each time period of an indicated length (e.g., every 30 minutes).
The illustrated embodiment of the routine begins at block 505, where user comment information for an indicated time period is retrieved, such as for information received and stored with respect to block 415 of 
After block 510, the routine continues to block 515 to, for each topic or other attribute that is identified in any of the user comments, create a corresponding comment group for the indicated time period that includes user comments matching that topic or other attribute for that time period. The routine then continues to block 525 to, if additional user information sources are available, retrieve additional comments or other user-supplied information from the additional sources that correspond to the topic or other attribute for each created comment group, and to add any such additional retrieved user-supplied information to the corresponding created comment groups.
In some embodiments, the routine may further perform additional optional activities with respect to block 540, to determine additional information about some or all of the created comment groups, such as to analyze the user comments included in some or all comment groups to determine particular author users who have high or low influence for the comment group (e.g., relative to other author users for the same comment group and/or for other comment groups), to analyze the user comments included in some or all comment groups to enable performance of a sentiment analysis with respect to contents of the comments, etc. Information generated by such optional additional activities may be stored and/or used in various manners, including to provide some or all of the generated information to clients, to take additional automated actions based on the generated information, to use some or all of the generated information as part of other analyses and determinations performed by the group discussion prediction service, etc. Additional details related to such additional optional activities are discussed in greater detail elsewhere.
After block 540, the routine continues to block 550 to determine whether to encode summary quantified information for one or more of the created content groups. If so, the routine continues to block 555 to retrieve information about one or more encoding schemes that are available to use, and optionally to indicate particular comments groups that are associated with particular encoding schemes. After block 555, the routine in block 560 further, for each encoding scheme and any associated comments groups (or in some embodiments for each comment group), uses the encoding scheme to generate encoded summary information that quantifies information about constituent content items of the comment group for the time period.
After block 560, or if it was instead determined in block 550 not to generate any encoded summary information, the routine continues to block 590 to store information about the created comment groups for later use, along with any associated determined and/or generated information for such comment groups, such as for information generated in blocks 515 and 525, block 540 and block 560.
After block 590, the routine continues to block 595 to determine whether to continue, such as until an explicit indication to terminate is received. If it is determined to continue, the routine returns to block 505, such as to wait until the next time period of an indicated length is completed, or until corresponding user comment information for such a next time period is available. If it is determined in block 595 to not continue, the routine continues to block 599 and returns, such as to return to block 435 of 
The illustrated embodiment of the routine begins at block 605, where category definition information is retrieved for one or more categories of interest. In block 610, the routine then retrieves user comment information for an indicated time period. In block 615, the routine then, for each category of interest, identifies user comments that have topics or other attributes matching the definition for the category, optionally using a specified minimum certainty threshold. As one example, in some embodiments a category definition may include one or a small number (e.g., four) of definition terms, and a user comment is identified as satisfying the category definition if it includes at least one of those definition terms, while in other embodiments the user comment may need to include all of the indicated definition terms or a specified minimum number of the definition terms to satisfy the minimum certainty threshold with respect to the category.
After block 615, the routine continues to block 620 to, for each category of interest, determine all the comment groups that include any of the user comments that were identified for the category, and to then analyze those determined comment groups based on one or more metrics. After the analysis, the determined comment groups are ranked with respect to the one or more metrics. As one example, the metrics may include the frequency in which comments of each determined comment group appear in the identified user comments for the category, such as to assess a relevance of each comment group to the category. The analysis of the determined comment groups based on the metric(s) may further include, in at least some embodiments, determining an average and a standard deviation with respect to the frequencies for the determined comment groups.
After block 620, the routine continues to block 640 to determine information about a maximum inclusion threshold and/or a minimum inclusion threshold for each category, such as in some embodiments to be based on the information about the determined average frequency for determined comment groups of the category and standard deviation information (e.g., to have the maximum inclusion threshold be one-half of the standard deviation above the average, and/or to have the minimum inclusion threshold be one-half of the standard deviation below the average frequency). After the maximum inclusion threshold and/or minimum inclusion threshold are determined, the determined comment groups for each category that are above the maximum or below the minimum inclusion thresholds are removed from the category, with the remaining determined comment groups reflecting current topics and other attributes that are currently most relevant for the category. Information about the determined comment groups for the category is then stored for later use. In some embodiments, only one of a maximum inclusion threshold and a minimum inclusion threshold may be used.
In some embodiments, the routine may further perform additional optional activities with respect to block 690, to determine additional information about some or all of the categories, such as to analyze the user comments associated with the category to determine particular author users who have high or low influence for the category (e.g., relative to other author users for the same category and/or for other categories), to analyze the user comments associated with the category to enable performance of a sentiment analysis with respect to contents of the user comments, etc. Information generated by such optional additional activities may be stored and/or used in various manners, including to provide some or all of the generated information to clients, to take additional automated actions based on the generated information, to use some or all of the generated information as part of other analyses and determinations performed by the group discussion prediction service, etc. Additional details related to such additional optional activities are discussed in greater detail elsewhere.
After block 690, the routine continues to block 695 to determine whether to continue, such as until an explicit indication to terminate is received. If it is determined to continue, the routine returns to block 605, such as to wait until the next time period of an indicated length is completed, or until corresponding comment group information for such a next time period is available. If it is determined in block 695 to not continue, the routine instead continues to block 699 and returns, such as to return to block 445 of 
The illustrated embodiment of the routine begins at block 705, where information is retrieved about created comment groups for one or more indicated time periods. In block 710, information is then retrieved about the comment groups that have been determined to be relevant for one or more categories for the indicated time periods. After block 710, the routine continues to block 715 to select some or all of the determined comment groups for further analysis, including in some embodiments to select some or all intersections of the determined comment groups with corresponding categories, locations, author users, and/or data sources. For each selected comment group and/or intersection, the routine in block 715 then quantifies information about the associated user-supplied comments for the selected comment group and/or intersection, such as by determining a quantity of matching comments for each of the indicated time periods of interest.
After block 715, the routine continues to block 717 to determine whether to use encoded summary comment group information for the selected comment groups and/or intersections to predict future information for those selected comment groups and/or intersections, such as if such encoded summary information is available for those selected comment groups and/or intersections (e.g., with respect to block 560 of 
In particular, in block 720, the routine continues to optionally retrieve additional determined comment quantity information for the selected comment groups and/or intersections, such as for prior time periods within a sliding time window (e.g., the last 72 hours). In block 725, the routine then, for each selected comment group and intersection, generates a histogram for some or all of the sliding time window using the determined comment quantities for the indicated time periods and any corresponding retrieved determined comment quantities for prior time periods, such as to have a separate value in the histogram for each time period of an indicated length during the sliding time window (and optionally removing any information from a previous histogram for time periods that are no longer part of the sliding time window).
After block 725, the routine continues to block 730 to retrieve information about any defined prediction templates that are available for use in performing predictions. The routine then continues to block 735 to select the next generated histogram, beginning with the first. In block 737, the routine then attempts to match the selected generated histogram to one or more of the defined prediction templates. In block 740, it is determined if any of the prediction templates had a sufficient match for the selected histogram, and if so, the routine then continues to block 745 to select the best matching prediction template, and to use additional portions of the prediction template to predict future comment quantities for one or more future time periods corresponding to the selected histogram. If it is instead determined in block 740 that there were not any matching prediction templates (or no prediction templates were available to attempt to match), the routine continues instead to block 750 to perform a time-weighted trend prediction of future comment quantities for one or more future time periods based on the comment quantities in the generated histogram. It will be appreciated that the matching of a histogram to a prediction template, the use of an additional portion of a matching template to predict future quantities, and the time-weighted trend prediction may each be performed in various manners in various embodiments, as discussed in greater detail elsewhere.
After blocks 745 or 750, the routine continues to block 755 to store the predicted future comment quantity information for subsequent use. In block 760, it is then determined if any additional histograms generated in block 725 remain to be analyzed, and if so returns to block 735 to select a next such generated histogram. Otherwise, the routine continues to block 765 to optionally perform one or more additional activities. Such additional activities may, in at least some embodiments, include providing information about predicted comment quantities and/or about determined current quantities for a generated histogram to one or more corresponding clients or other information sources that previously requested such information, optionally in accordance with any specified criteria by such clients or other information sources. In addition, in at least some embodiments, the routine may, in block 765, further take one or more other automated actions for a generated histogram in accordance with defined criteria and client instructions based at least in part on predicted comment quantities and/or on determined current comment quantities, such as to initiate and add one or more comments to one or more comment sources that include information relevant to a discussion for a category or a comment group associated with the generated histogram (e.g., to provide additional information corresponding to the category or to the topic of the comment group, or to otherwise influence a direction in which the discussion may take place in the future).
If it is determined in block 717 to use encoded summary comment group information for the selected comment groups and/or intersections to predict future information for those selected comment groups and/or intersections, the routine continues to block 770. In block 770, the routine retrieves encoded summary information for the selected comment groups and/or intersections for prior time periods within a sliding time window (e.g., the last 72 hours), such as for information generated in block 560 of 
After block 770, the routine continues to block 772 to retrieve information about any defined prediction templates that are available for use in performing predictions and that include aggregated encoded summary information for multiple time periods corresponding to at least a portion of the time window. For example, if the summary information for a comment group and/or intersection is encoded to represent information about five types of attributes for the constituent content items of the comment group and/or intersection during each of multiple prior time periods, the aggregated group of summary information for the retrieved matching prediction template similarly encodes information about those five types of attributes in at least some embodiments. The routine then continues to block 774 to select the next aggregated group of encoded summary information for one of the selected comment groups and/or intersections, beginning with the first. In block 776, the routine then attempts to match the selected aggregated group of encoded summary information to one or more of the defined prediction templates. In block 778, it is determined if any of the prediction templates had a sufficient match for the selected aggregated group of encoded summary information, and if so, the routine then continues to block 780 to select the best matching prediction template, and to use additional encoded summary information from the prediction template to predict future information for one or more future time periods corresponding to the selected aggregated group of encoded summary information. For example, if the summary information for a comment group and/or intersection is encoded to represent information about five types of attributes for the constituent content items of the comment group and/or intersection during each of multiple prior time periods, and the aggregated group of summary information for a matching prediction template similarly encodes information about those five types of attributes, the additional information from the matching prediction template is used to predict future information about one or more of those five types of attributes. If it is instead determined in block 778 that there were not any matching prediction templates, the routine continues instead to block 720. It will be appreciated that the matching of an aggregated group of encoded summary information to a prediction template, the use of additional encoded summary information of a matching prediction template to predict future information may each be performed in various manners in various embodiments, as discussed in greater detail elsewhere, with the examples discussed with respect to 
After block 780, the routine continues to block 782 to store the predicted future information for subsequent use. In block 784, it is then determined if any additional aggregated groups of encoded summary information generated in block 770 remain to be analyzed, and if so returns to block 774 to select a next such generated aggregated group of encoded summary information. Otherwise, the routine continues to block 790 to optionally perform one or more additional activities, including to provide information about predicted future information to one or more clients or other information sources and/or to further take one or more other automated actions in accordance with defined criteria and client instructions, such as in a manner previously described with respect to block 765.
After blocks 765 or 790, the routine continues to block 795 to determine whether to continue, such as until an explicit indication to terminate is received. If it is determined to continue, the routine returns to block 705, such as to wait until the next time period of an indicated length is completed, or until corresponding to wait until additional information about created comment groups and/or about determined comment groups for categories for such a next time period is available. If it is instead determined in block 795 not to continue, the routine instead continues to block 799 and returns, such as to return to block 455 of 
In particular, 
In blocks 820 and 825, each of the histogram portions is analyzed to determine whether the information in that histogram portion is able to accurately predict additional user comments that will be received, such as if the user comment information represented in that histogram portion represents a typical or otherwise repeated pattern of user comments. In particular, in block 820, for all but the last of the histogram portions, a time-weighted trend prediction is performed using the information in the histogram portion, and additional information about actual user comments received (e.g., from other later portions of the generated histogram) is used to determine whether that trend prediction is consistent with that actual user comment information. In block 825, for the last of the histogram portions, a time-weighted trend prediction is performed using the information in the histogram portion, and additional information about actual user comments received (e.g., from one or more time periods after the sliding time window corresponding to the generated histogram) is used to determine whether that trend prediction is consistent with that actual user comment information—in some situations, such as if the selected histogram was just completed and represents the most current comment information that is available, the performance of the analysis of block 825 may be deferred until additional user comment information is received and analyzed. The time-weighted trend predictions performed in blocks 820 and 825 may, for example, be similar to or the same as that performed in block 750 of routine 700, and the determination of whether the trend prediction is consistent may be performed with respect to a specified degree of error.
After block 825, the routine continues to block 835 to, for each histogram portion having a prediction that is consistent with actual information, store information about the histogram portion (and optionally the subsequent additional information used to verify the prediction) as a candidate histogram template for later use in generating other predictions, such as with respect to block 745 of routine 700.
In some embodiments, the routine may further perform additional optional activities with respect to block 855, to assess whether one or more prior predictions made from use of a prediction template for the histogram were accurate, such as with respect to block 745 of routine 700, and to corresponding alter use of the prediction template. For example, if the prior use of a prediction template was accurate, an accuracy rate for that prediction template may be increased accordingly, and the prediction template may continue to be used, optionally with an increased confidence in its accuracy based on the increased accuracy rate. Conversely, if the prior use of a prediction template was not accurate, an accuracy rate for that prediction template may be decreased accordingly, optionally with a corresponding decreased confidence in its accuracy based on the increased accuracy rate, and the prediction template may be taken out of use if its accuracy rate is too low. The prior use of a prediction template that may be assessed may in some situations include waiting until later actual information is available about user comments that are received, for comparison to the prior prediction made for the histogram. After block 855, the routine continues to block 860 to determine whether more generated histograms are available, and if so returns to block 810 to select a next such histogram.
If it is instead determined in block 860 that no more histograms are available, the routine continues to block 895 to determine whether to continue, such as until an explicit indication to terminate is received. If it is determined to continue, the routine returns to block 805, such as to wait until additional information about generated histograms and/or corresponding actual comment information corresponding to predictions are available. If it is instead determined in block 895 not to continue, the routine instead continues to block 899 and returns, such as to return to block 465 of 
In block 915, the routine then separates the generated aggregation into one or more initial portions that each correspond to a time window having a time length of one or more indicated lengths, including to optionally have multiple initial overlapping portions, such as an initial 6 hours of the aggregation, an initial 24 hours of the aggregation, an initial 48 hours of the aggregation, and an initial 72 hours of the aggregation—thus, if the aggregation includes, for example, a sliding time window of at least 72 hours (or 3 days), the multiple overlapping portions may correspond to the first day, the first and second days, and all 3 days. In other embodiments and situations, other types of aggregation portions and time lengths may be used. The routine further identifies one or more additional subsequent portions of the aggregation for each of the initial portions, with each additional subsequent portion having a time length of one or more indicated prediction lengths, such as a next 6 hours of the aggregation, a next 24 hours of the aggregation, a next 48 hours of the aggregation, and a next 72 hours of the aggregation. Thus, as one example, the initial portion may be for 72 hours, and the additional subsequent portion may be for the subsequent 72 hours, such as if at least 144 hours of encoded summary information are available in the aggregation. Other non-exclusive examples include the following: the initial portion is for 72 hours, and the additional subsequent portion is for the subsequent 24 hours; the initial portion is for 24 hours, and the additional subsequent portion is for the subsequent 24 hours (thus, the additional subsequent portion in this example may be part of the initial portion in the prior example in which the initial portion is for 72 hours); the initial portion is for 24 hours, and the additional subsequent portion is for the subsequent 6 hours; the initial portion is for 24 hours, and the additional subsequent portion is for the subsequent 72 hours; etc.
In block 920, each of the initial portion and additional subsequent portion combinations is optionally analyzed to determine whether the information in that additional portion is able to accurately predict additional user content items that will be received after data matching the initial portion has occurred, such as by comparing that combination to other prior actual data having a matching initial portion to see if the additional portion of the combination matches subsequent actual data from that prior actual data—such a comparison may show, for example, whether the combination being considered represents a typical or otherwise repeated pattern of user comments or other content items. The degree of match between corresponding initial portions and/or corresponding additional portions of a current candidate combination and prior actual data may be performed in various manners, as discussed in greater detail elsewhere—if the comparison of block 920 is performed, a current candidate combination may be retained as a candidate only if it sufficiently matches a specified quantity (e.g., one, more than one, etc.) of prior sets of actual data, such as with respect to a specified degree of error.
After block 920, the routine continues to block 935 to, for each combination of an initial portion and an additional subsequent portion that is determined in block 920 to be sufficiently representative, or for each combination of an initial portion and an additional subsequent portion determined in block 915 if the comparison in block 920 is not performed, store information about the combination of aggregated encoded summary information for use as part of a candidate prediction template for later use in generating other predictions, such as with respect to block 780 of routine 700. After block 935, the routine continues to block 960 to determine whether more generated aggregations are available, and if so returns to block 910 to select a next such aggregation.
In some embodiments, the routine may further perform additional optional activities with respect to block 965 if no more aggregations are determined in block 960 to be available, in order to assess whether one or more prior predictions made from use of a prediction template using aggregated encoded summary information were accurate, such as with respect to block 780 of routine 700, and to corresponding alter use of that prediction template. For example, if the prior use of a prediction template was accurate, an accuracy rate for that prediction template may be increased accordingly, and the prediction template may continue to be used, optionally with an increased confidence in its accuracy based on the increased accuracy rate. Conversely, if the prior use of a prediction template was not accurate, an accuracy rate for that prediction template may be decreased accordingly, optionally with a corresponding decreased confidence in its accuracy based on the increased accuracy rate, and the prediction template may be taken out of use as a candidate if its accuracy rate is too low.
After block 965, the routine continues to block 995 to determine whether to continue, such as until an explicit indication to terminate is received. If it is determined to continue, the routine returns to block 905, such as to wait until additional information about generated aggregations and/or corresponding actual comment information corresponding to predictions are available. If it is instead determined in block 995 not to continue, the routine instead continues to block 999 and returns, such as to return to block 465 of 
The routine begins at block 9005, where information is obtained about a user or other entity to whom to display or otherwise provide information, optionally including one or more indications of interest. Such obtained information may, for example, be based on a request received from the user or other entity (such as to optionally indicate one or more time periods of interest, a type of information view of interest, one or more classifications and/or categories and/or comment groups of interest, etc.), or instead may be determined based on expiration of a time associated with providing such information, an automated determination to provide such information (e.g., a determination that information of one or more indicated types is available to push to the user or other entity, such as based on a prior request; a determination that one or more other types of indicated criteria are currently satisfied, such as based on receipt or determination of current real-time or near-real-time data of one or more types, based on current prediction of future information of one or more types; etc.), etc.
After block 9005, the routine continues to block 9010 to optionally retrieve information about any configurations previously specified by the user or other entity, and/or any preferences previously determined for the user or other entity (e.g., based on explicit indications of preferences; based on inferred indications of preferences from prior actions of the user or other entity; etc.). For example, a first user may specify that he/she is interested in beginning interactions with a classification-level view that includes information about one or more indicated classifications of interest, while a second user may specify that he/she is interested in beginning interactions with a category-level view that includes information about one or more indicated categories of interest, while a third user may specify that he/she is interested in beginning interactions with a comment group-level view that includes information about one or more indicated comment groups of interest, etc. User or entity configurations or preferences may further include particular selected factors whose information is to be displayed in particular manners, as well as other information about types of information to expose or hide at particular times (e.g., legends, keys, scale information, tags and other visual indicators, user-selectable controls, etc.), as discussed in greater detail elsewhere.
After block 9010, the routine continues to block 9015 to determine particular types of information to display or otherwise provide at the current time, including a current information view, one or more current, past and/or future time periods for which to display information, particular classifications and/or categories and/or comment groups, particular types of selected factors to use, etc. In some embodiments, some or all such information may be user-configurable or otherwise personalized to a particular user or other entity, and in some embodiments, some or all such information may be predefined (e.g., defaults that can be overridden by the user or other entity, fixed information that cannot be overridden by the user of other entity, etc.)—as discussed with respect to blocks 9005 and 9010, at least some of the determined types of information may be obtained by operations performed with respect to those blocks. In block 9020, the routine then retrieves analyzed and/or predicted information of the types determined in block 9015, whether by accessing previously generated and stored information, interacting with one or more other modules 342-348 to cause the modules to dynamically generate and/or provide such information, etc.
In block 9025, the routine then prepares the retrieved information for display or other providing, such as to determine and generate information for particular visual aspects to be included for display to the user, to determine and generate tags or other structured data to be included in information being programmatically provided to another automated system, etc. The routine then continues to block 9030 to determine user-selectable and user-manipulatable controls, if any, to include in a display of the current information view, such as in a manner based on previous configuration and/or preferences of the user or other entity, based on predefined defaults or fixed controls, based on the current information view, etc. In block 9035, the routine then displays or otherwise provides the information to the user or other entity, such as by sending the information to a client device of the user or other entity for display using a browser or other client-side application executing on the client device. As discussed in greater detail elsewhere, a variety of visual aspects and selected factors may be used as part of the display. As one illustrative non-limiting example, the information displayed may be a category-level view that includes information about a particular category and about particular comment groups within that category, by displaying information for each of the comment groups based on that comment group's constituent content items for a current time period and a next future time period, and with the visual aspects and the corresponding selected factors that are used as follows: 5 visual aspects that include size, horizontal location, vertical location, color, and an associated halo; and 5 selected factors for those 5 visual aspects of current total quantity of constituent content items, current aggregate sentiment, current rate of change in total quantity, current rate of change in total quantity, and predicted total quantity of constituent content items for the future time period, respectively (i.e., size represents current total quantity, horizontal location represents aggregate sentiment, vertical location and color each represent current rate of change in total quantity, and halo represents predicted total quantity). As another illustrative non-limiting example, the information displayed may be a classification-level view that includes information about one or more classifications and about particular categories within each such classification (and optionally about particular comments groups within one or more such categories), by displaying information for each of the categories (and possibly comment groups) based on that category's (and possibly comment group's) constituent content items for a current time period and a next future time period, and with the visual aspects and the corresponding selected factors that are used as follows: 5 visual aspects that include size, horizontal location, vertical location, color, and an associated halo; and 5 selected factors for those 5 visual aspects of current total quantity of constituent content items, current aggregate sentiment, current rate of change in total quantity, current rate of change in total quantity, and predicted total quantity of constituent content items for the future time period, respectively (i.e., size represents current total quantity, horizontal location represents aggregate sentiment, vertical location and color each represent current rate of change in total quantity, and halo represents predicted total quantity).
In the illustrated embodiment, the routine then waits in block 9040 until a user instruction or other indication is received, and then continues to block 9050 to determine whether the instruction or indication results in the display of new information, and if so returns to block 9015 to initiate that display. For example, a user instruction may be received that selects a particular information view, classification, category, comment group and/or content item for which additional information is to be retrieved or displayed, or alternatively a user instruction may be received that modifies information being currently displayed in one or more other manners (e.g., to indicate a new selected factor for which visual aspects are to be provided, to indicate to advance to a previous or next page, etc.). Alternatively, the indication received in block 9040 may be initiated in other manners, such as an indication that new analyzed and/or predicted information corresponding to the current display is available for display, an indication that a timer has expired (e.g., if the display is updated periodically, such as every time period, every 5 minutes, every minute, every second, etc.), an indication from the GDP system or service (e.g., from another module of the GDP system) to update the display, etc.
If it is instead determined in block 9050 that the instruction or indication received in block 9040 does not result in the display of new information, the routine continues instead to block 9090 to optionally perform one or more other indicated operations as appropriate, and to then continue to block 9095. For example, if a user or other entity indicates that the current information display session is complete (e.g., based on a user logout, inactivity for a predetermined amount of time, etc.), the routine may continue to block 9095 and then to block 9099. Alternatively, the user or other entity may provide information to be stored for later use, indicate a change in a current user or other entity, etc.
After block 9090, the routine continues to block 9095 to determine whether to continue, such as until an explicit indication to terminate is received. If it is determined to continue, the routine returns to block 9005, such as to wait until a next request or other indication is received. If it is instead determined in block 9095 not to continue, the routine instead continues to block 9099 and returns, such as to return to block 475 of 
As discussed in greater detail elsewhere, in at least some such embodiments, the information display or other providing of routine 9000 may occur in a real-time or near-real-time manner with respect to corresponding events or activities to which the information corresponds (e.g., within seconds, minutes or hours of such events or activities), such as the receipt of constituent content items on based the displayed information is based at least in part. A display of information may, for example, indicate multiple categories, and visually indicate information about relative or absolute values for respective categories based on each of multiple selected factors, such as based on use of one or more of the following information for a displayed representation of a category: size, horizontal location, vertical location, color, an associated halo, horizontal movement, vertical movement, displayed tags or other visual indicators, etc. —as one example, such information may be provided for a current time period and/or next future time period and for selected factors that include total quantity of constituent content items, rate of change in total quantity, and aggregate sentiment. A display of information may, for example, indicate multiple comment groups that each represents an associated topic or other attribute of constituent content items in the comment group (whether instead of or in addition to one or more category indicators), and visually indicate information about relative or absolute values for respective comment groups based on each of multiple selected factors, such as based on use of one or more of the following information for a displayed representation of a category: size, horizontal location, vertical location, color, an associated halo, horizontal movement, vertical movement, displayed tags or other visual indicators, etc. —as one example, such information may be provided for a current time period and/or next future time period and for selected factors that include total quantity of constituent content items, rate of change in total quantity, and aggregate sentiment. Additional details related to such display of information about analyzed aspects of distributed group discussions and/or about further predicted aspects of the discussions is included herein, including with respect to 
It will also be appreciated that in some embodiments the functionality provided by the routines discussed above may be provided in alternative ways, such as being split among more routines or consolidated into fewer routines. Similarly, in some embodiments illustrated routines may provide more or less functionality than is described, such as when other illustrated routines instead lack or include such functionality respectively, or when the amount of functionality that is provided is altered. In addition, while various operations may be illustrated as being performed in a particular manner (e.g., in serial or in parallel) and/or in a particular order, those skilled in the art will appreciate that in other embodiments the operations may be performed in other orders and in other manners. It will similarly be appreciated that the data structures discussed above may be structured in different manners, including for databases or user interface screens/pages or other types of data structures, such as by having a single data structure split into multiple data structures or by having multiple data structures consolidated into a single data structure. Similarly, in some embodiments illustrated data structures may store more or less information than is described, such as when other illustrated data structures instead lack or include such information respectively, or when the amount or types of information that is stored is altered.
Non-exclusive example embodiments described herein are further described in the following clauses.
A. A computer-implemented method comprising:
obtaining, by one or more configured computing systems, information about one or more distributed group discussions having a plurality of content items, wherein the plurality of content items include multiple content items for each of one or more time periods, and wherein the obtained information includes, for each time period, information about one or more categories that correspond to the distributed group discussions and/or about one or more attributes of the multiple content items for the time period;
determining, by the one or more configured computing systems, multiple selected factors of the obtained information to display to one or more users, and determining multiple visual aspects to use to display the multiple selected factors;
generating, by the one or more configured computing systems, information for display to the one or more users, the generated information including, for each of the multiple selected factors, a subset of the obtained information that corresponds to the selected factor and that is represented using at least one of the multiple visual aspects; and
providing, by the one or more configured computing systems, the generated information for display to the one or more users.
B. The method of clause A wherein the one or more time periods include at least one current or recent time period and at least one future time period.
C. The method of clause A wherein the one or more time periods are part of a time window having multiple prior time periods.
D. The method of clause A wherein the one or more time periods are part of a time window having multiple future time periods.
E. The method of any of clauses A-D wherein the one or more configured computing systems are part of a group discussion prediction service.
F. The method of any of clauses A-E wherein the plurality of content items include a plurality of comments supplied by a plurality of users in a plurality of geographical locations to a plurality of social networking sites, the plurality of comments being at least some of the distributed group discussions.
G. The method of any of clauses A-F wherein the plurality of content items are supplied by a plurality of users to a plurality of data sources, the plurality of content items being at least some of the distributed group discussions.
H. The method of any of clauses A-G wherein the obtained information includes information about multiple attributes, and wherein at least some of the multiple attributes are conversation topics.
I. The method of any of clauses A-H wherein the obtained information includes information about multiple attributes, and wherein at least some of the multiple attributes are terms supplied by users within the plurality of content items.
J. The method of any of clauses A-I wherein the obtained information includes information about multiple attributes, and wherein at least some of the multiple attributes are from metadata associated with the plurality of content items.
K. The method of any of clauses A-J wherein the obtained information includes information about multiple categories.
L. The method of any of clauses A-K wherein the multiple selected factors include at least one of a group including total quantity of content items during a time period that are associated with a comment group representing at least one of the one or more attributes, rate of change in total quantity of content items between two or more time periods that are associated with a comment group representing at least one of the one or more attributes, aggregate sentiment of content items during a time period that are associated with a comment group representing at least one of the one or more attributes, total quantity of content items during a time period that are associated with at least one of the one or more categories, rate of change in total quantity of content items between two or more time periods that are associated with at least one of the one or more categories, and aggregate sentiment of content items during a time period that are associated with at least one of the one or more categories.
M. The method of any of clauses A-L wherein the multiple selected factors include total quantity of content items during a time period that are associated with a comment group representing at least one of the one or more attributes, rate of change in total quantity of content items between two or more time periods that are associated with a comment group representing at least one of the one or more attributes, and aggregate sentiment of content items during a time period that are associated with a comment group representing at least one of the one or more attributes.
N. The method of any of clauses A-M wherein the multiple selected factors include total quantity of content items during a time period that are associated with at least one of the one or more categories, rate of change in total quantity of content items between two or more time periods that are associated with at least one of the one or more categories, and aggregate sentiment of content items during a time period that are associated with at least one of the one or more categories.
O. The method of any of clauses A-N wherein the multiple visual aspects include at least one of a group including absolute size of a displayed item, relative size between two or more displayed items, absolute vertical location of a displayed item, relative vertical location of a displayed item to at least one other location, absolute horizontal location of a displayed item, relative horizontal location of a displayed item to at least one other location, absolute vertical movement of a displayed item, relative vertical movement of a displayed item to at least one location, absolute horizontal movement of a displayed item, relative horizontal movement of a displayed item to at least one location, a halo associated with a displayed item, a shadow associated with a displayed item, color of a displayed item, highlighting of a displayed item, bolding of a displayed item, italicizing of a displayed item, underlining of a displayed item, displayed textual content, and displayed graphical content.
P. The method of any of clauses A-O wherein the multiple visual aspects include absolute size of a displayed item, absolute vertical location of a displayed item, absolute horizontal location of a displayed item, a halo associated with a displayed item, and color of a displayed item.
Q. The method of any of clauses A-P wherein the multiple visual aspects include relative size between two or more displayed items, relative vertical location of a displayed item to at least one other location, relative horizontal location of a displayed item to at least one other location, a halo associated with a displayed item, and color of a displayed item.
R. The method of any of clauses A-Q wherein each of the multiple selected factors is associated with one of the multiple visual aspects.
S. The method of any of clauses A-R wherein one or more of the multiple selected factors is configurable by at least one of the one or more users.
T. The method of any of clauses A-S wherein one or more of the multiple visual aspects is configurable by at least one of the one or more users.
U. The method of any of clauses A-T wherein one or more of the multiple selected factors is automatically personalized for at least one of the one or more users.
V. The method of any of clauses A-U wherein one or more of the multiple visual aspects is automatically personalized for at least one of the one or more users.
W. The method of any of clauses A-V wherein the multiple selected factors include total quantity of content items during a time period that are associated with at least one of the one or more categories, rate of change in total quantity of content items between two or more time periods that are associated with at least one of the one or more categories, and aggregate sentiment of content items during a time period that are associated with at least one of the one or more categories, wherein the multiple visual aspects include relative size between two or more displayed items, relative vertical location of a displayed item to at least one other location, relative horizontal location of a displayed item to at least one other location, a halo associated with a displayed item, and color of a displayed item, wherein the total quantity for a current or past time period having actual data is represented using the relative size, wherein the total quantity for a future time period having predicted data is represented using the halo, wherein the rate of change of quantity is represented using at least one of the vertical location and the color, and wherein the aggregate sentiment is represented using the relative horizontal location.
X. The method of any of clauses A-W wherein the providing of the generated information includes initiating the display of the generated information to the one or more users on one or more client devices of the one or more users.
Y. A computer-implemented method comprising:
receiving, by a configured client device of a user, information generated by one or more configured computing systems, wherein the received information is about one or more distributed group discussions having a plurality of content items for one or more time periods, wherein the received information reflects information about one or more categories that correspond to the distributed group discussions and/or about one or more attributes of at least some of the plurality of content items, and wherein the generated information is based on multiple selected factors and multiple visual aspects such that the generated information includes, for each of the multiple selected factors, a subset of the obtained information that corresponds to the selected factor and that is represented using at least one of the multiple visual aspects; and
initiating, by the configured client device, display of the received information to the user.
Z. The method of clause Y combined with any of clauses A-X.
AA. A configured system comprising:
one or more hardware processors of one or more computing systems; and
one or more modules that are configured to, when executed by at least one of the one or more hardware processors, perform the method of any of clauses A-Z.
BB. A non-transitory computer-readable medium having stored contents that, when executed, configure a computing system to perform the method of any of clauses A-Z.
From the foregoing it will be appreciated that, although specific embodiments have been described herein for purposes of illustration, various modifications may be made without deviating from the spirit and scope of the invention. Accordingly, the invention is not limited except as by specified claims and the elements recited therein. In addition, while certain aspects of the invention may be presented in certain claim forms at certain times, the inventors contemplate the various aspects of the invention in any available claim form. For example, while only some aspects of the invention may be recited as being embodied in a computer-readable medium at particular times, other aspects may likewise be so embodied.
Zimmermann, Bernhard G., Browning, Randolph, Snelling, David Aaron, Papp, Stefan Nicholas, Bressler, Benjamin Ralph, Young, Jr., James Robert, Harrison, Trinity Alana
| Patent | Priority | Assignee | Title | 
| 10061761, | Jul 22 2016 | International Business Machines Corporation | Real-time dynamic visual aid implementation based on context obtained from heterogeneous sources | 
| 10090002, | Dec 11 2014 | International Business Machines Corporation | Performing cognitive operations based on an aggregate user model of personality traits of users | 
| 10282409, | Dec 11 2014 | International Business Machines Corporation | Performance modification based on aggregation of audience traits and natural language feedback | 
| 10366707, | Dec 11 2014 | International Business Machines Corporation | Performing cognitive operations based on an aggregate user model of personality traits of users | 
| 10540787, | Apr 19 2017 | AVEVA SOFTWARE, LLC | Automated trend chart graphical display generation in a historian system | 
| 10558706, | Dec 17 2014 | YAHOO ASSETS LLC | Method and system for determining user interests based on a correspondence graph | 
| 10572524, | Feb 29 2016 | Microsoft Technology Licensing, LLC | Content categorization | 
| 10592539, | Jul 11 2014 | TWITTER, INC | Trends in a messaging platform | 
| 10592574, | May 05 2015 | Snap Inc. | Systems and methods for automated local story generation and curation | 
| 10601749, | Jul 11 2014 | TWITTER, INC | Trends in a messaging platform | 
| 10628737, | Jan 13 2017 | Verizon Patent and Licensing Inc | Identifying constructive sub-dialogues | 
| 11108717, | Jul 11 2014 | Twitter, Inc. | Trends in a messaging platform | 
| 11244013, | Jun 01 2018 | International Business Machines Corporation | Tracking the evolution of topic rankings from contextual data | 
| 11334715, | Dec 13 2016 | Kabushiki Kaisha Toshiba; Toshiba Digital Solutions Corporation | Topic-identifying information processing device, topic-identifying information processing method, and topic-identifying computer program product | 
| 11392633, | May 05 2015 | Snap Inc. | Systems and methods for automated local story generation and curation | 
| 11449539, | May 05 2015 | Snap Inc. | Automated local story generation and curation | 
| 11500908, | Jul 11 2014 | Twitter, Inc. | Trends in a messaging platform | 
| 11748797, | Feb 16 2017 | The University of Tulsa | System and method for providing recommendations to a target user based upon review and ratings data | 
| 9813484, | Dec 31 2014 | MOTOROLA SOLUTIONS, INC. | Method and apparatus analysis of event-related media | 
| 9881094, | May 05 2015 | SNAP INC | Systems and methods for automated local story generation and curation | 
| 9992150, | Dec 15 2014 | Meta Platforms, Inc | Event planning on online social networks | 
| Patent | Priority | Assignee | Title | 
| 8176046, | Oct 22 2008 | AMERICAN EXPRESS KABBAGE INC | System and method for identifying trends in web feeds collected from various content servers | 
| 8261193, | Apr 21 2009 | ARIS USA, INC | Method and system for capturing mashup data for collective intelligence and user-generated knowledge | 
| 8285598, | May 12 1999 | VALENTINE COMMUNICATIONS LLC | Promoting offers through social network influencers | 
| 8312056, | Sep 13 2011 | Xerox Corporation | Method and system for identifying a key influencer in social media utilizing topic modeling and social diffusion analysis | 
| 8321792, | Apr 21 2009 | ARIS USA, INC | Method and system for capturing and using mashup data for trend analysis | 
| 8364171, | Apr 08 2007 | Meta Platforms, Inc | Systems and methods to determine the current popularity of physical business locations | 
| 8380607, | Dec 17 2010 | Indiana University Research and Technology Corporation | Predicting economic trends via network communication mood tracking | 
| 8732584, | Nov 09 2010 | Xerox Corporation | System and method for generating an information stream summary using a display metric | 
| 20050102619, | |||
| 20100100537, | |||
| 20110276681, | |||
| 20110302006, | |||
| 20110313842, | |||
| 20110314007, | |||
| 20110320373, | |||
| 20120011158, | |||
| 20120016948, | |||
| 20120117475, | |||
| 20120143694, | |||
| 20120150645, | |||
| 20120166278, | |||
| 20120323908, | |||
| 20130036107, | |||
| 20130198204, | |||
| 20140012927, | |||
| 20140278676, | 
| Executed on | Assignor | Assignee | Conveyance | Frame | Reel | Doc | 
| Mar 07 2014 | Blab, Inc. | (assignment on the face of the patent) | / | |||
| Sep 22 2014 | SNELLING, DAVID AARON | BLAB, INC | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 034083/ | 0973 | |
| Sep 29 2014 | BROWNING, RANDOLPH | BLAB, INC | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 034083/ | 0973 | |
| Sep 29 2014 | PAPP, STEFAN NICHOLAS | BLAB, INC | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 034083/ | 0973 | |
| Sep 29 2014 | ZIMMERMANN, BERNHARD G | BLAB, INC | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 034083/ | 0973 | |
| Sep 29 2014 | BRESSLER, BENJAMIN RALPH | BLAB, INC | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 034083/ | 0973 | |
| Oct 29 2014 | HARRISON, TRINITY ALANA | BLAB, INC | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 034083/ | 0973 | |
| Oct 30 2014 | YOUNG, JAMES ROBERT, JR | BLAB, INC | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 034083/ | 0973 | |
| Jun 15 2016 | BLAB, INC | Pacific Western Bank | SECURITY INTEREST SEE DOCUMENT FOR DETAILS | 038973/ | 0842 | |
| Jun 29 2017 | BLAB, INC | Deloitte Development LLC | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 042925/ | 0624 | |
| Jun 29 2017 | Pacific Western Bank | Deloitte Development LLC | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 042925/ | 0551 | 
| Date | Maintenance Fee Events | 
| Aug 01 2017 | STOL: Pat Hldr no Longer Claims Small Ent Stat | 
| Apr 23 2020 | M1551: Payment of Maintenance Fee, 4th Year, Large Entity. | 
| Sep 16 2024 | REM: Maintenance Fee Reminder Mailed. | 
| Mar 03 2025 | EXP: Patent Expired for Failure to Pay Maintenance Fees. | 
| Date | Maintenance Schedule | 
| Jan 24 2020 | 4 years fee payment window open | 
| Jul 24 2020 | 6 months grace period start (w surcharge) | 
| Jan 24 2021 | patent expiry (for year 4) | 
| Jan 24 2023 | 2 years to revive unintentionally abandoned end. (for year 4) | 
| Jan 24 2024 | 8 years fee payment window open | 
| Jul 24 2024 | 6 months grace period start (w surcharge) | 
| Jan 24 2025 | patent expiry (for year 8) | 
| Jan 24 2027 | 2 years to revive unintentionally abandoned end. (for year 8) | 
| Jan 24 2028 | 12 years fee payment window open | 
| Jul 24 2028 | 6 months grace period start (w surcharge) | 
| Jan 24 2029 | patent expiry (for year 12) | 
| Jan 24 2031 | 2 years to revive unintentionally abandoned end. (for year 12) |