Methods and apparatus, including computer program products for tracking usage of business data elements include receiving an electronic document that includes instances of business data elements and identifying an instance of a particular business data element in the electronic document. A counter associated with the particular business data element is incremented based on the identified instance. The electronic document has a format defined by a business communication schema, and the business communication schema includes a set of predefined business data elements for use in transacting electronic business.

Patent
   7711676
Priority
Nov 12 2004
Filed
Nov 12 2004
Issued
May 04 2010
Expiry
Nov 20 2026
Extension
738 days
Assg.orig
Entity
Large
35
100
all paid
16. A system for tracking usage of business data elements, the system comprising:
a) means for identifying instances of business data elements in a first plurality of electronic documents being communicated between business entities during a first time duration, each electronic document in the first plurality having a format corresponding to a business communication schema, wherein the business communication schema includes a set of predefined business data elements for use in electronically communicating business data from a first business entity to a second business entity;
b) means for tracking usage of the business data elements across the first plurality of electronic documents in the set of predefined business data elements in response to identifying instances of the business data elements in the set of predefined business data elements in each electronic document;
c) means for comparing a quantity of instances of the business data elements with threshold values, each threshold value corresponding to a particular business data element;
d) means for removing the business data elements from the business communication schema when the quantity of instances of the business data elements is less than their corresponding threshold values during the first time duration;
e) means for presenting a user with at least one of the quantity of instances and the business data elements prior to removing the business data elements from the business communication schema based on the quantity of instances being less than their corresponding threshold values;
f) means for receiving an instruction from the user to remove the business data elements from the business communication schema;
g) means for resetting the quantity of instance based on the quantity being at least equal to the corresponding threshold values;
h) means for initiating a second time duration upon resetting of the quantity; and
i) means for repeating the functions of b) through d) for a second plurality of electronic documents being communicated between business entities during the second time duration.
1. A computer program product, embodied in a computer-readable medium, the computer program product being operable to cause data processing apparatus to:
a) receive, during a first time duration, a first electronic document being communicated between business entities, the first electronic document comprising instances of a plurality of business data elements, the first electronic document having a format corresponding to a business communication schema, wherein the business communication schema includes a set of predefined business data elements for use in electronically communicating business data from a first business entity to a second business entity;
b) identify an instance of a particular business data element in the first electronic document;
c) increment a counter associated with the particular business data element in response to identifying the instance of the particular business data element in the first electronic document;
d) receive, during the first time duration, a second electronic document being communicated between business entities;
e) identify an instance of the particular business data element in the second electronic document;
f) increment the counter associated with the particular business data element in response to identifying the instance of the particular business data element in the second electronic document;
g) subsequent to elapse of the first time duration, compare the counter with a threshold value;
h) based on the counter being less than the threshold value, delete the particular business data element from the business communication schema;
i) prior to deleting the particular business data element from the business communication schema, notify a user if the counter is less than the threshold value;
j) receive an instruction from the user to remove the particular business data element from the business communication schema:
k) reset the counter when the counter is at least equal to the threshold value;
l) begin a second time duration upon resetting of the counter; and
m) repeat operations b) through h) for a third electronic document being communicated between business entities during the second time duration.
22. A computer-implemented method for tracking usage of business data elements, the method comprising:
a) receiving, during a first time duration, a first electronic document being communicated between business entities, the first electronic document comprising instances of a plurality of business data elements, the first electronic document having a format corresponding to a business communication schema, wherein the business communication schema includes a set of predefined business data elements for use in electronically communicating business data from a first business entity to a second business entity;
b) identifying an instance of a particular business data element in the first electronic document;
c) incrementing a counter associated with the particular business data element in response to identifying the instance of the particular business data element in the first electronic document;
d) receiving, during the first time duration, a second electronic document being communicated between business entities;
e) identifying an instance of the particular business data element in the second electronic document;
f)incrementing the counter associated with the particular business data element in response to identifying the instance of the particular business data element in the second electronic document;
g) subsequent to elapse of the first time duration, comparing the counter with a threshold value;
h) based on the counter being less than the threshold value, removing the particular business data element from the business communication schema
i) prior to removing the particular business data element from the business communication schema based on the counter being less than the threshold value, presenting a user with at least one of the counter and the particular business data element;
j) receiving an instruction from the user to remove the particular business data element from the business communication schema;
k) resetting the counter based on the counter being at least equal to the threshold value;
l) initiating a second time duration upon resetting of the counter; and
m) repeating steps b) through h) for a third electronic document being communicated between business entities during the second time duration.
2. The computer program product of claim 1 wherein the particular business data element is a first business data element of the plurality of business data elements, the counter is a first counter of a plurality of counters associated with the plurality of business data elements, wherein the first counter is associated with the first business data element, the computer program product being operable to cause data processing apparatus to:
identify an instance of the first business data element in the first electronic document;
increment the first counter in response to identifying the instance of the first business data element;
identify an instance of a second business data element of the plurality of business data elements in the first electronic document; and
increment a second counter of the plurality of counters, wherein the second counter is associated with the second business data element in response to identifying The instance of the second business data element.
3. The computer program product of claim 1 wherein the first electronic document is received from a translation module, the translation module operable to translate the first electronic document into the format corresponding to the business communication schema from a different communication schema format.
4. The computer program product of claim 3 wherein one of the formats corresponding to the business communication schema or the different communication schema format comprises a format compatible with Core Components Technical Specification.
5. The computer program product of claim 3 wherein one of the different communication schema format or the format corresponding to the business communication schema comprises an intermediary format in a business document translation infrastructure, the intermediary format comprising a format used for translating electronic documents from a format corresponding to a first communication schema into a format corresponding to a second communication schema, with the first communication schema and the second communication schema selected from a set of available communication schemas.
6. The computer program product of claim 3 wherein the computer program product is operable to cause data processing apparatus to translate the first electronic document from the format corresponding to the business communication schema into a different communication schema format.
7. The computer program product of claim 3 wherein the counter is a first counter of a plurality of counters associated with the plurality of business data elements, wherein the plurality of counters associated with the plurality of business data elements are used by the translation module in selecting business data elements for translating electronic documents.
8. The computer program product of claim 7 wherein the translation module uses the plurality of counters to determine preferences among similar business data elements for translating electronic documents.
9. The computer program product of claim 2 wherein the plurality of counters are used to perform one of modeling a business communication schema or modifying a business communication schema.
10. The computer program product of claim 2 wherein the plurality of counters associated with the plurality of business data elements are used to identify business data elements as candidates for deletion from the business communication schema.
11. The computer program product of claim 1 wherein the counter associated with the particular business data element is incremented only once in response to identifying more than one instance of the particular business data element.
12. The computer program product of claim 1 wherein the computer program product is operable to cause data processing apparatus to:
identify, in the first electronic document, a plurality of instances of the particular business data element; and
increment the counter associated with the particular business data element for each identified instance of the plurality of instances.
13. The computer program product of claim 2 wherein the plurality of counters is used to identify usage trends of business data elements.
14. The computer program product of claim 1, the computer program product being operable to cause data processing apparatus to store a date for the first electronic document in response to identifying an instance of the particular business data element, wherein the stored date is associated with the particular business data element.
15. The computer program product of claim 1 wherein the first electronic document comprises a message defined in the business communication schema and the message includes the particular business data element, the computer program product being operable to cause data processing apparatus to store an identifier for the message in response to identifying an instance of the particular business data element, wherein the stored identifier for the message is associated with the particular business data element.
17. The system of claim 16 wherein each electronic document has a format corresponding to one of a plurality of available business communication schemas and each available business communication schema has a corresponding set of predefined business data elements.
18. The system of claim 17 further comprising means for translating each electronic document at least from a first one of the plurality of available business communication schemas to a second one of the plurality of available business communication schemas, wherein the means for tracking usage of the business data elements tracks usage of business data elements for at least one of the first one of the plurality of available business communication schemas or the second one of the plurality of available business communication schemas.
19. The system of claim 16 further comprising means for monitoring the tracked usage of the business data elements.
20. The system of claim 16 wherein the tracked usage of the business data elements is used in selecting among business data elements in the set of predefined business data elements during a translation of an electronic document from a first one of a plurality of available business communication schemas to a second one of the plurality of available business communication schemas.
21. The system of claim 16 wherein the means for tracking usage comprises means for tracking usage trends.
23. The method of claim 22, wherein repeating the steps during the second time duration comprises:
receiving, during the second time duration, the third electronic document being communicated between business entities;
identifying at least one instance of the particular business data element in the third electronic document;
incrementing the reset counter associated with the particular business data element for each instance of the particular business data element in the third electronic document;
subsequent to elapse of the second time duration, comparing the reset counter with a threshold value; and
based on the reset counter being less than the threshold value, removing the particular business data element from the business communication schema.
24. The method of claim 22 further comprising translating the first electronic document into the format corresponding to the business communication schema from a different communication schema format.
25. The method of claim 24 wherein one of the different communication schema format or the format corresponding to the business communication schema comprises an intermediary format in a business document translation infrastructure, the intermediary format comprising a format used for translating electronic documents from a format corresponding to a first communication schema into a format corresponding to a second communication schema, with the first communication schema and the second communication schema selected from a set of available communication schemas.
26. The method of claim 24 further comprising translating the first electronic document from the format corresponding to the business communication schema into a different communication schema format.
27. The method of claim 24 further comprising using the counter in selecting business data elements for translating electronic documents.
28. The method of claim 22 wherein the counter is used to identify usage trends of business data elements.
29. The computer program product of claim 1 wherein the business communication schema comprises a first business communication schema, the second electronic document has a format corresponding to a second business communication schema, and the first business communication schema is disparate from the second business communication schema.
30. The computer program product of claim 1 wherein the first electronic document being communicated between business entities is an electronic document being communicated among a first set of business entities, the second electronic document being communicated between business entities is an electronic document being communicated among a second set of business entities, and at least one of the first set of business entities is disparate from the second set of business entities.

The present invention relates to data processing by digital computer, and more particularly to tracking usage of data elements in electronic business communications.

Companies have conventionally exchanged electronic business information using Electronic Data Interchange (EDI). While EDI has allowed companies to communicate more efficiently than through the use of traditional paper-based communications, smaller companies face challenges to participate in electronic business (or electronic collaboration). These companies need to invest in complex and expensive computer systems to be installed at local computers, or to register with marketplaces at remote computers accessible through the Internet. In either case, the companies are bound by the particulars of the local or remote computer systems. Changes lead to further costs for software, hardware, user training, registration, and the like.

More recently, the development of the Extensible Markup Language (XML) has offered an alternative way to define formats for exchanging business data. XML provides a syntax that can be used to enable more open and flexible applications for conducting electronic business transactions, but does not provide standardized semantics for messages used in business processes. Initiatives to define standardized frameworks for using XML to exchange electronic business data have produced specifications such as the Electronic Business Extensible Markup Language (ebXML) Core Components Technical Specification (CCTS) and ISO 11179, which is incorporated in ebXML CCTS. Despite the efforts of such initiatives to develop a single standard for conducting electronic business, a number of competing XML and non-XML-based standards and proprietary formatting schema have been developed and are in common use, including cXML, ebXML, SAP IDoc, SAP IFR XML, OAG BOD, ANSI X12, EDIFACT, SWIFT, FIX, RosettaNet, and xCBL. In addition, some companies continue to use EDI-based systems based at least in part on their substantial investments in EDI integration. The large number of available schemas complicates efforts to conduct electronic business because a company will often adopt a particular schema that is not supported by all of the company's trading partners. Further complicating matters is the fact that many of the schema are constantly evolving (i.e., new business data elements are regularly added).

The present invention provides methods and apparatus, including computer program products, that implement techniques for tracking usage of business data elements from one or more communication schemas used for transacting electronic business.

In one general aspect, the techniques feature an electronic document having a format defined by a business communication schema. The business communication schema includes a set of predefined business data elements (e.g., business information entities) for use in transacting electronic business. An electronic document that includes instances of business data elements is received, and an instance of a particular business data element in the electronic document is identified. A counter associated with the particular business data element is incremented as a result of identifying the instance.

The invention can be implemented to include one or more of the following advantageous features. The electronic document includes instances of multiple business data elements from the set of predefined business data elements, and each of the business data elements has an associated counter. The electronic document is received from a translation module that is operable to translate the electronic document into the format defined by the business communication schema from a different communication schema format. One of the communication schema formats is an intermediary format in a business document translation infrastructure. The intermediary format is a format used for translating electronic documents from a first communication schema into a second communication schema. The first communication schema and the second communication schema are selected from a set of available communication schemas. The counters associated with each business data element is used by the translation module in selecting business data elements for translating electronic documents.

The translation module uses the counters to determine preferences among similar business data elements for translating electronic documents and/or the counters are used to identify business data elements as candidates for deletion from the business communication schema. In the latter case, a counter associated with each business data element is compared with a threshold value, and a user is notified if the counter is less than the threshold value. The counter for a particular business data element is incremented once in response to identifying one or more instances of the particular business data element in an electronic document or is incremented for each identified instance. A date for the electronic document is stored in association with the particular business data element as a result of identifying an instance of the particular business data element. The electronic document includes a message defined in the business communication schema, and the message includes the particular business data element. An identifier for the message is stored in association with the particular business data element as a result of identifying an instance of the particular business data element.

The invention can be implemented to realize one or more of the following advantages. Statistics and data on the actual usage of business data elements can be automatically collected during the exchange of electronic business documents. These statistics and data can be used to assist in automatic or semi-automatic mapping of business data elements between different business data schemas. The statistics can be used to establish a preference within the mapping procedure for more frequently used business data elements in cases where two or more similar business data elements are potentially applicable. For example, the most frequently used business data element from a set of similar elements can be automatically selected unless certain conditions for using another business data element are met.

By establishing preferences, the statistics and data can also help avoid the use of incorrect business data elements during mapping (e.g., elements that do not correctly represent the semantic meaning of the underlying data). The statistics and data can be used to identify business data elements that are not used or that are used infrequently. A schema that includes business data elements can be modified to selectively delete unused or infrequently used business data elements. Such deletions allow the schema to evolve toward a more streamlined set of elements, which can increase the efficiency of both mapping between different schemas and generating electronic documents based on the schema by removing the need to consider business data elements that have become obsolete as a result of evolving business requirements. CCTS-based schemas can be evolved to follow business requirements by extending the schemas to include new elements and restricting the schemas by removing unused elements. A selectively streamlined schema can also help avoid potential semantic misunderstandings resulting from the use of obsolete business data elements. The statistics can be used in semi-automatic modeling. Semi-automatic modeling provides and recommends only the more-frequently used business data elements and/or business data elements with a positive usage trend, among business data elements that are semantically similar. For example, a modeler writes a definition in a specific context for his required additional business information. The semi-automatic modeling approach searches for same or similar business data elements in the library that are appropriate for the defined requirements. If the system finds more than one similar business data elements in the repository, additional information about the frequency of usage of each business data element can be used to select an appropriate business data element. The statistics can also be used for modeling and optimization of new data models (e.g., if a complete new business document is required). Preferences for the more used business data elements and/or business data elements with a positive usage trend for use in the new data model can define a more efficient data model. Implementations of the invention provide one or more of the above advantages.

Details of one or more implementations of the invention are set forth in the accompanying drawings and in the description below. Further features, aspects, and advantages of the invention will become apparent from the description, the drawings, and the claims.

FIG. 1 is a block diagram of a system for transacting electronic business using one or more business communication schemas.

FIG. 2 is an illustration of counters for exemplary business data elements in an intermediary communication schema.

FIG. 3 is an excerpt from a sample XML definition of a communication schema.

FIG. 4 is a flow diagram of a process for tracking usage of business data elements in received messages.

FIG. 5 is a flow diagram of a process for reviewing statistics and information relating to the business data elements in a communication schema.

Like reference numbers and designations in the various drawings indicate like elements.

In general, electronic business communications can be conducted using electronic documents. An electronic document does not necessarily correspond to a file. A document may be stored in a portion of a file that holds other documents, in a single file dedicated to the document in question, or in multiple coordinated files. An electronic document used in transacting electronic business is formatted in accordance with one of a wide variety of available business communication schemas (e.g., EDIFACT, X12, xCBL, a CCTS-based schema, or IDoc). An electronic document can correspond to an electronic business message, transaction, group of related messages, or group of related transactions.

Each business communication schema includes a set of business data elements from which electronic documents can be constructed. For example, a purchase order electronic document can be constructed using an aggregation of business data elements that specify the buyer and the seller, identify the document as a purchase order, list the ordered products, specify delivery terms, and provide other relevant information. A communication schema can be defined using XML.

Business data elements for a communication schema are used to represent object classes (e.g., a person or an address), characteristics of an object class (e.g., a postal code), a business context (e.g., chemical industry), a data type (e.g., valid values for a characteristic), associations (e.g., between a person and an address), representation classes, messages, transactions, and the like. Each business data element in an electronic business framework typically includes a unique name. The unique name identifies the element and/or to indicate what is represented by values or sub-elements within the element. In some communication schemas, elements are defined by code names. For example, xCBL uses a numerical code to identify a data type for each instance of a business data element. Other communication schemas define elements using semantic names, which can include multiple concatenated terms that describe characteristics of the component. For example, ISO 11179, which is incorporated in CCTS, defines a naming convention in which each data element is described by a name that semantically describes the business data element (“address.street” or “party.company”) and how it is represented (e.g., text or code).

A typical business communication schema will enable a large number of different transactions and/or messages using a large set of business data elements. Some of the business data elements are shared among multiple different transaction and/or message types while other business data elements may be associated with a particular transaction and/or message type. Business data elements in an electronic document are organized according to a format for the business communication schema, which can include a format that is common between different transaction and/or message types, and/or formatting characteristics that are specific to each transaction and/or message type.

FIG. 1 is a block diagram of a system 100 for transacting electronic business using one or more business communication schemas. The system 100 includes a first monitor 105 connected to a first computer 110 and a second monitor 125 connected to a second computer 120. Electronic business communications between the first computer 110 and the second computer 120 are conducted over a network 115, such as the Internet, in accordance with a business communication schema. To facilitate electronic business communications, the first computer 110 includes a data storage device 130 containing a first schema repository 135 and the second computer 120 includes a data storage device 140 containing a second schema repository 145. Each of the first schema repository 135 and the second schema repository 145 store metadata describing one or more formats defined by a business communication schema.

The monitor 105 displays user interfaces for allowing a user to enter or otherwise define business data to be included in an electronic document. The first computer 110 generates the electronic document in accordance with the metadata stored in the first schema repository 135. In particular, the first computer 110 organizes the data entered by the user according to a communications schema format defined in the first schema repository 135. The generated electronic document can then be transmitted over the network 115 to a receiving entity, such as the second computer 120. The second computer 120 is capable of interpreting received electronic documents in accordance with the metadata stored in the second schema repository 145. In particular, the second computer 120 interprets data contained in a received electronic document according to a communications schema format defined in the second schema repository 145.

One or more communications schemas can be defined in each schema repository 135 and 145. In some cases, two enterprises that wish to transact electronic business agree to use a particular communication schema that both enterprises support. In other words, the same communication schema is defined in both the first schema repository 135 and the second schema repository 145. In such a case, an electronic document generated by the first computer 110 using the particular communication schema can be interpreted by the second computer 120 using the metadata in the second schema repository 145, and the monitor 125 can display user interfaces that include the data contained in the electronic document.

In other situations, two enterprises that wish to transact electronic business use communication schemas that are incompatible with one another. For example, a first communication schema is defined in the first schema repository 135, and a different communication schema is defined in the second schema repository 145. If an electronic document is generated by the first computer 110 and sent directly to the second computer 120, the second computer 120 is unable to interpret the electronic document because the second schema repository 145 does not include information for mapping business data elements between different schemas.

To exchange an electronic document between computers 110 and 120 that support different communication schemas, it is possible to translate the electronic document from the first communication schema format to the second communication schema format using a translation infrastructure 165 in an intermediary computer 150. The intermediary computer 150 includes a storage device 155 containing an intermediary schema repository 160. The intermediary schema repository 160 includes metadata defining both the first and second communication schemas. To communicate with the second computer 120, the first computer 110 sends an electronic document generated using the first communication schema to the translation infrastructure 165. The translation infrastructure 165 translates the electronic document from the first communication schema format to the second communication schema format, including translating the business data elements, using the metadata stored in the intermediary schema repository 160. The translated electronic document is sent to the second computer 120, which interprets the included data using the second communication schema metadata stored in the second schema repository 145.

In some implementations, the translation infrastructure 165 translates electronic documents from the first communication schema format directly to the second communication schema format. In other implementations, the translation infrastructure 165 translates electronic documents from the first communication schema format to an intermediary communication schema format and then from the intermediary communication schema format to the second communication schema format. The intermediary communication schema format is based on the UN/CEFACT XML Naming and Design Rules for UN/CEFACT CCTS. The translation infrastructure 165 can generally translate between any number of different communication schema formats using metadata describing the various different communication schema formats stored in the intermediary schema repository 160. Metadata defining a library of business data elements can also include data necessary for mapping the business data elements to other formats. The translation infrastructure 165 can be, for example, the Exchange Infrastructure (XI), available from SAP AG of Walldorf (Baden), Germany.

In addition to translating electronic documents, the translation infrastructure 165 also collects statistics and other information relating to electronic documents that pass through the intermediary computer 150. For example, the translation infrastructure 165 counts instances of business data elements that are included in electronic documents. A counter corresponding to each business data element in a communication schema is incremented when the translation infrastructure 165 identifies an instance of the business data element. In some implementations, the counter is incremented for each separate instance of the business data element in an electronic document. Alternatively, the counter is incremented only once for each electronic document that includes the business data element regardless of how many instances are included in the electronic document.

Additional data that can be collected includes a name or other identifier for each message or transaction that includes one or more instances of the business data element and/or a date or time that the electronic document is sent, received, or translated. The additional data can also include statistics used to identify usage trends (e.g., increasing, decreasing, and cyclical usage trends) by storing counter values and other data for multiple different time periods. The value of the various counters and the additional data collected is stored in a statistics database 175 contained in a storage device 170 for the intermediary computer 150. The statistics database 175 can also be stored in the storage device 155 that contains the intermediary schema repository 160 or even as part of the metadata defining the communication schema that is stored in the storage device 155 (e.g., the counter and other data for each business data element can be stored as additional attributes of the complex Type element definition as illustrated in FIG. 3). The counter values and the other data can be used to determine which business data elements should be favored in mapping procedures, which business data elements can be deleted, which business data elements are used only with specific messages or transactions, how to model new schema (e.g., an intermediary format), and how to optimize existing schema.

Statistics and other data can be collected for business data elements of the first communication schema, the intermediary communication schema, and/or the second communication schema as well as any other communication schemas supported by the translation infrastructure 165. Accordingly, the statistics database 170 can collect usage information for a number of different schemas. In addition, statistics and other data can be collected for business data elements independent of any translation involving a schema that includes the business data elements. For example, the statistics database 175 can be incorporated into the first computer 110 and/or the second computer 120 for collecting statistics on electronic documents sent from or received at the first computer 110 and/or the second computer 120, respectively. In addition, the translation infrastructure 165 and associated intermediary schema repository 160 do not need to be in a separate location or computer 150 but can be incorporated into the first computer 110 or second computer 120. In cases where a business data element includes multiple different instances (e.g., a generic element that is used in multiple different electronic documents), the instances can be related to one another in a hierarchy, and each instance can have its own corresponding counter. Furthermore, incrementing of counters can be based on the location of the business data element, which is expressed by an xPath navigation path.

FIG. 2 is an illustration of counters for exemplary business data elements in an intermediary communication schema 200. The intermediary communication schema 200 includes multiple different business data elements 205, including an “Address.Details” element 205(1), a “Batch.Details” element 205(2), a “Price Component.Details” element 205(3), a “Batch.Toll Free.Indicator” element 205(4), and a “Price Component.Base Amount” element 205(5). Each business data element 205 can be an aggregation of more basic business data elements 205 and/or can be combined with other business data elements 205 to form a higher-level business data element 205 (e.g., a complex element or a message). Each business data element 205 has an associated counter 210 that is incremented when an instance of the corresponding business data element 205 is identified.

An electronic document formatted in accordance with an IDoc schema format 215 is received at a translation module 220. The translation module 220 translates the electronic document into an intermediary schema format 225. For each different business data element 205 of the intermediary schema that includes at least one instance, the counter 210 corresponding to the business data element 205 is incremented. For example, if the electronic document, after translation into the intermediary schema format, includes one or more instances of the “Price Component. Details” element 205(3), the corresponding counter 210(3) is incremented (as indicated at 230). After inspecting the electronic document for instances of the various business data elements of the intermediary schema 200, the electronic document is delivered to a translation module 235 for translation into one or more of an EDIFACT schema format 240, an xCBL schema format 245, and/or an X12 schema format 250.

Similar counters can also be used for counting instances of business data elements for incoming electronic document formats (e.g., the IDoc schema format 215) and outgoing electronic document formats (e.g., the EDIFACT schema format 240, the xCBL schema format 245, and/or the X12 schema format 250). In addition, during translation of electronic documents, the translation modules 220 and 235 can use the values stored in the counters 210 and/or counters for elements in the incoming and outgoing schema formats as well as other data relating to the business data elements 205 (e.g., stored in the statistics database 175). For example, a translation module 220 might compare the counter value 210(3) for the “Price Component.Details” element 205(3) and the counter value 210(5) for the “Price Component.Base Amount” element 205(5) to select the “Price Component.Details” element 205(3) over the “Price Component.Base Amount” element 205(5). In some cases, the selection might also take into account the messages in which each of multiple possible elements have historically been used (e.g., to select an element that, although used infrequently, is used with a particular message type while a more frequently used element has not previously been used with the particular message type).

FIG. 3 is an excerpt from a sample XML definition 300 of a communication schema. The XML definition 300 includes a “PeriodType” element 305 that is an aggregation of a number of sub-elements, including a “DurationDateTime” element 310 and an “Indicator” element 315. Each of the “DurationDateTime” element 310 and the “Indicator” element 315 include a “FrequencyofUsage” set of attributes 320. A “counter value” attribute 325 stores a value corresponding to the number of electronic documents in which an instance of the element 310 or 315 appeared. The value is incremented each time another electronic document that includes an instance of the element 310 or 315 is received. The “counter value” attribute 325 is used to determine whether a particular element 310 or 315 is used relatively frequently or infrequently.

A “last usage date” attribute 330 stores a date on which the most recent usage in an electronic document occurred. The “last usage date” attribute 330 can be used to determine if the particular element 310 or 315 has been used recently. A “used in messages” attribute 335 identifies one or more messages (e.g., message types) in which an instance of the element 310 or 315 appeared and can be used to identify elements 310 or 315 that are used with only a limited number of messages. A “counting duration” attribute 340 defines a time period for which statistics are collected before analyzing the results and/or clearing the counter. A “counting minimum value” attribute 345 defines a threshold counter value. If the value in the “counter value” element 325 is less than the threshold value, the element 310 will be flagged for possible deletion at the end of the time period defined in the “counting duration” attribute 340.

Additional attributes can also be included. For example, the XML definition 300 can include an “always required indicator” attribute for elements that should not be deleted from the XML definition 300. It is possible to maintain statistics and information on usage of such elements (e.g., for use in selecting among similar business data elements during translation processes), but the statistics will not be used to determine whether the element can be deleted.

FIG. 4 is a flow diagram of a process 400 for tracking usage of business data elements in received messages. An electronic business message is received (405) at a translation module or during a process of sending or receiving a message from a transmitting computer (e.g., the first computer 110 of FIG. 1) or at a receiving computer (e.g., the second computer 120 of FIG. 1). The received message is interpreted, and a business data element for updating statistics is selected from the set of business data elements for a particular communication schema (410). It is determined whether an instance of the business data element exists in the message (415). If not, it is determined whether additional business data elements for the particular communication schema have not yet been checked (435). If there are additional data elements to be checked, the process 400 selects a new business data element (410).

If an instance of the business data element does appear in the message, a counter associated with the business data element is incremented (420). Data indicating the last usage date is updated to reflect the date of the current message (425), and the message name (e.g., describing the message type) is added to the usage information associated with the business data element (430). It is determined whether additional business data elements for the particular communication schema have not yet been checked (435). If so, the process 400 selects a new business data element (410). Otherwise, the process 400 ends (440).

FIG. 5 is a flow diagram of a process 500 for reviewing statistics and other information relating to the business data elements in a communication schema. Once a time period for counting instances of one or more business data elements (e.g., as defined by the “counting duration” attribute 340 of FIG. 3) has elapsed, a business data element from the communication schema is selected for purposes of reviewing the associated statistics (505). It is determined whether an instance counter corresponding to the selected business data element is less than a predefined threshold for the business data element (510). If not, it can be assumed that the business data element is used sufficiently frequently to warrant maintaining the business data element in the communication schema. Accordingly, the counter for the business data element is reset (525) to begin a new time period for counting instances, and it is determined whether statistics for additional elements still need to be checked (535).

If the instance counter corresponding to the selected business data element is less than a predefined threshold for the business data element, a user is notified and presented with statistics and/or other information relating to the business data element (515). For example, the user can be presented with the count value, the last usage date, and the message names in which the business data element appeared. In some cases, multiple different status levels corresponding to different threshold values can be used to indicate a level of urgency for deleting the particular business data element. For example, if usage of the business data element is significantly below the minimum threshold, it may indicate that the element is a more likely candidate for deletion. If the user decides to delete the element (520), the element is removed from the communication schema (530). Otherwise, the counter for the business data element is reset (525) to begin a new time period for counting instances. Once the element is deleted or the counter is reset, it is determined whether statistics for additional elements still need to be checked (535). If so, another business data element is selected (505) and the process 500 repeats. Otherwise, the process 500 ends (540).

The invention and all of the functional operations described in this specification can be implemented in digital electronic circuitry, or in computer software, firmware, or hardware, including the structural means disclosed in this specification and structural equivalents thereof, or in combinations of them. The invention can be implemented as one or more computer program products, i.e., one or more computer programs tangibly embodied in an information carrier, e.g., in a machine readable storage device or in a propagated signal, for execution by, or to control the operation of, data processing apparatus, e.g., a programmable processor, a computer, or multiple computers. A computer program (also known as a program, software, software application, or code) can be written in any form of programming language, including compiled or interpreted languages, and it can be deployed in any form, including as a stand alone program or as a module, component, subroutine, or other unit suitable for use in a computing environment. A computer program does not necessarily correspond to a file. A program can be stored in a portion of a file that holds other programs or data, in a single file dedicated to the program in question, or in multiple coordinated files (e.g., files that store one or more modules, sub programs, or portions of code). A computer program can be deployed to be executed on one computer or on multiple computers at one site or distributed across multiple sites and interconnected by a communication network.

The processes and logic flows described in this specification, including the method steps of the invention, can be performed by one or more programmable processors executing one or more computer programs to perform functions of the invention by operating on input data and generating output. The processes and logic flows can also be performed by, and apparatus of the invention can be implemented as, special purpose logic circuitry, e.g., an FPGA (field programmable gate array) or an ASIC (application specific integrated circuit).

Processors suitable for the execution of a computer program include, by way of example, both general and special purpose microprocessors, and any one or more processors of any kind of digital computer. Generally, the processor will receive instructions and data from a read only memory or a random access memory or both. The essential elements of a computer are a processor for executing instructions and one or more memory devices for storing instructions and data. Generally, a computer will also include, or be operatively coupled to receive data from or transfer data to, or both, one or more mass storage devices for storing data, e.g., magnetic, magneto optical disks, or optical disks. Information carriers suitable for embodying computer program instructions and data include all forms of non volatile memory, including by way of example semiconductor memory devices, e.g., EPROM, EEPROM, and flash memory devices; magnetic disks, e.g., internal hard disks or removable disks; magneto optical disks; and CD ROM and DVD-ROM disks. The processor and the memory can be supplemented by, or incorporated in, special purpose logic circuitry.

To provide for interaction with a user, the invention can be implemented on a computer having a display device, e.g., a CRT (cathode ray tube) or LCD (liquid crystal display) monitor, for displaying information to the user and a keyboard and a pointing device, e.g., a mouse or a trackball, by which the user can provide input to the computer. Other kinds of devices can be used to provide for interaction with a user as well; for example, feedback provided to the user can be any form of sensory feedback, e.g., visual feedback, auditory feedback, or tactile feedback; and input from the user can be received in any form, including acoustic, speech, or tactile input.

The invention can be implemented in a computing system that includes a back-end component, e.g., as a data server, or that includes a middleware component, e.g., an application server, or that includes a front-end component, e.g., a client computer having a graphical user interface or a Web browser through which a user can interact with an implementation of the invention, or any combination of such back-end, middleware, or front-end components. The components of the system can be interconnected by any form or medium of digital data communication, e.g., a communication network. Examples of communication networks include a local area network (“LAN”) and a wide area network (“WAN”), e.g., the Internet.

The computing system can include clients and servers. A client and server are generally remote from each other and typically interact through a communication network. The relationship of client and server arises by virtue of computer programs running on the respective computers and having a client-server relationship to each other.

The invention has been described in terms of particular embodiments, but other embodiments can be implemented and are within the scope of the following claims. For example, the operations of the invention can be performed in a different order and still achieve desirable results. As one example, the process depicted in FIG. 4 does not require the particular order shown, or sequential order, to achieve desirable results (e.g., the operations 420, 425, and 430 can be performed in a different order). In certain implementations, multitasking and parallel processing may be preferable. Other embodiments are within the scope of the following claims.

Stuhec, Gunther

Patent Priority Assignee Title
10192202, Dec 31 2014 SAP SE Mapping for collaborative contribution
10229440, Jun 30 2014 SAP SE Accelerated price calculation using in-memory technology
10311134, Sep 21 2011 PatNotate LLC Data processing systems, devices, and methods for content analysis
10325011, Sep 21 2011 PatNotate LLC Data processing systems, devices, and methods for content analysis
10382421, Mar 01 2006 Oracle International Corporation Flexible framework for secure search
10482133, Sep 07 2016 International Business Machines Corporation Creating and editing documents using word history
10505873, Dec 30 2014 SAP SE Streamlining end-to-end flow of business-to-business integration processes
11038867, Mar 01 2006 Oracle International Corporation Flexible framework for secure search
11232251, Sep 21 2011 PatNotate LLC Data processing systems, devices, and methods for content analysis
11830266, Sep 21 2011 PatNotate LLC Data processing systems, devices, and methods for content analysis
8316007, Jun 28 2007 Oracle International Corporation Automatically finding acronyms and synonyms in a corpus
8332430, Mar 01 2006 Oracle International Corporation Secure search performance improvement
8352475, Feb 28 2001 Oracle International Corporation Suggested content with attribute parameterization
8412717, Jun 27 2007 Oracle International Corporation Changing ranking algorithms based on customer settings
8433712, Mar 01 2006 Oracle International Corporation Link analysis for enterprise environment
8595255, Mar 01 2006 Oracle International Corporation Propagating user identities in a secure federated search system
8601028, Mar 01 2006 Oracle International Corporation Crawling secure data sources
8626794, Mar 01 2006 Oracle International Corporation Indexing secure enterprise documents using generic references
8707451, Mar 01 2006 Oracle International Corporation Search hit URL modification for secure application integration
8725770, Mar 01 2006 Oracle International Corporation Secure search performance improvement
8868540, Mar 01 2006 Oracle International Corporation Method for suggesting web links and alternate terms for matching search queries
8875249, Mar 01 2006 Oracle International Corporation Minimum lifespan credentials for crawling data repositories
9081816, Mar 01 2006 Oracle International Corporation Propagating user identities in a secure federated search system
9177124, Mar 01 2006 Oracle International Corporation Flexible authentication framework
9251364, Mar 01 2006 Oracle International Corporation Search hit URL modification for secure application integration
9311429, Jul 23 2013 SAP SE Canonical data model for iterative effort reduction in business-to-business schema integration
9430720, Sep 21 2011 PatNotate LLC Data processing systems, devices, and methods for content analysis
9442832, Jul 07 2014 SAP SE User workflow replication for execution error analysis
9467437, Mar 01 2006 Oracle International Corporation Flexible authentication framework
9479494, Mar 01 2006 Oracle International Corporation Flexible authentication framework
9508027, Sep 21 2011 PatNotate LLC Data processing systems, devices, and methods for content analysis
9558402, Sep 21 2011 PatNotate LLC Data processing systems, devices, and methods for content analysis
9626451, Jul 23 2013 SAP SE Canonical data model for iterative effort reduction in business-to-business schema integration
9853962, Mar 01 2006 Oracle International Corporation Flexible authentication framework
9953013, Sep 21 2011 PatNotate LLC Data processing systems, devices, and methods for content analysis
Patent Priority Assignee Title
4714995, Sep 13 1985 Northrop Grumman Corporation Computer integration system
4864502, Oct 07 1987 VANTAGE TECHNOLOGY HOLDINGS, LLC Sentence analyzer
5089814, Apr 28 1989 Motorola, Inc. Automatic time zone adjustment of portable receiver
5130924, Jun 30 1988 International Business Machines Corporation; INTERNATIONAL BUSINESS MACHINES CORPORATION, ARMONK, NEW YORK, 10504, A CORP OF NEW YORK System for defining relationships among document elements including logical relationships of elements in a multi-dimensional tabular specification
5339392, Jul 27 1989 Thomson Reuters Global Resources Unlimited Company Apparatus and method for creation of a user definable video displayed document showing changes in real time data
5444841, Nov 08 1991 International Business Machines Corporation Graphical user interface control for replicating data fields in forms
5694598, Oct 12 1994 Qwest Communications International Inc Method for mapping data between a relational format and an object-oriented format
5717923, Nov 03 1994 Intel Corporation Method and apparatus for dynamically customizing electronic information to individual end users
5797137, Mar 26 1996 BULL HN INFORMATIONS SYSTEMS INC Method for converting a database schema in relational form to a schema in object-oriented form
5899988, Feb 28 1997 Oracle International Corporation Bitmapped indexing with high granularity locking
6018742, Jul 07 1998 Perigis Corporation Constructing a bifurcated database of context-dependent and context-independent data items
6124391, Aug 18 1998 Evonik Degussa GmbH Superabsorbent polymers having anti-caking characteristics
6125391, Oct 16 1998 Red Hat, Inc Market makers using documents for commerce in trading partner networks
6163781, Sep 11 1997 SYNTIRO HEALTHCARE SERVICES, INC Object-to-relational data converter mapping attributes to object instance into relational tables
6226674, Jun 16 1998 Alcatel Method for extending OSI ping function capability
6226675, Oct 16 1998 Red Hat, Inc Participant server which process documents for commerce in trading partner networks
6366917, Apr 01 1998 WEBPUTTY, INC Method of modifying a populated database structure by modifying metadata describing the database structure
6401085, Mar 05 1999 Accenture Global Services Limited Mobile communication and computing system and method
6421681, Sep 25 1998 International Business Machines Corporation Framework for representation and manipulation of record oriented data
6490695, Jan 22 1999 Oracle America, Inc Platform independent memory image analysis architecture for debugging a computer program
6535919, Jun 29 1998 Canon Kabushiki Kaisha Verification of image data
6542912, Oct 16 1998 Red Hat, Inc Tool for building documents for commerce in trading partner networks and interface definitions based on the documents
6560608, Jun 09 2000 Silicon Valley Bank Method and apparatus for automatically selecting a rule
6571239, Jan 31 2000 International Business Machines Corporation Modifying a key-word listing based on user response
6591260, Jan 28 2000 Red Hat, Inc Method of retrieving schemas for interpreting documents in an electronic commerce system
6606462, Jan 11 2002 Xerox Corporation Reliability model based copy count correction with self modification during system recovery for predictive diagnostics
6662237, Jun 24 1999 Silicon Valley Bank System for documenting application interfaces and their mapping relationship
6694338, Aug 29 2000 Silicon Valley Bank Virtual aggregate fields
6757739, Jun 05 2000 Silicon Valley Bank Method and apparatus for automatically converting the format of an electronic message
6785538, Mar 23 2000 NEC Corporation Communication system, communication method, and storage medium storing communication program for mobile device users
6789216, Jan 22 1999 Sun Microsystems, Inc. Platform independent memory image analysis architecture for debugging a computer program
6801201, Dec 17 2001 TRADING CENTRAL CANADA INC Method for chart markup and annotation in technical analysis
6879994, Jun 22 1999 Mavenir LTD System and method for processing and presenting internet usage information to facilitate user communications
6910182, Jan 31 2000 XMLCities, Inc.; XMLCITIES, INC Method and apparatus for generating structured documents for various presentations and the uses thereof
6938044, Dec 28 2001 TERADATA US, INC Tracking usage of resources of a database system
6941511, Aug 31 2000 International Business Machines Corporation; IBM Corporation High-performance extensible document transformation
6959416, Jan 30 2001 International Business Machines Corporation Method, system, program, and data structures for managing structured documents in a database
6985905, Mar 03 2000 RADIANT LOGIC INC System and method for providing access to databases via directories and other hierarchical structures and interfaces
7028312, Mar 23 1998 SOFTWARE AG USA, INC XML remote procedure call (XML-RPC)
7058645, Mar 03 2003 International Business Machines Corporation Mapping between native data type instances
7069020, Oct 26 2001 LG Electronics Inc. Automatic time setting method of asynchronous mobile communication terminal
7080083, Dec 21 2001 CHARTOLEAUX KG LIMITED LIABILITY COMPANY Extensible stylesheet designs in visual graphic environments
7127516, Jun 29 1998 Canon Kabushiki Kaisha Verification of image data
7136467, Jan 04 2002 TANGOE US, INC Customer-oriented telecommunications data aggregation and analysis method and object oriented system
7146399, May 25 2001 International Business Machines Corporation Run-time architecture for enterprise integration with transformation generation
7155665, Jun 18 2001 Microsoft Technology Licensing, LLC Method for creating an embedded database in a spreadsheet
7181463, Oct 24 2003 Microsoft Technology Licensing, LLC System and method for managing data using static lists
7194695, Mar 31 2003 Unisys Corporation Logistics management system presenting user interface for performing multiple freight management tasks
7225203, Apr 05 2001 Canon Kabushiki Kaisha Information storage system and information management system
7245924, May 10 2001 MAXELL HOLDINGS, LTD ; MAXELL, LTD Cellular phone and a base station thereof
7246128, Jun 12 2002 GLOBAL CONNECT TECHNOLOGY, INC Data storage, retrieval, manipulation and display tools enabling multiple hierarchical points of view
7275079, Aug 08 2000 PayPal, Inc Common application metamodel including C/C++ metamodel
7281018, May 26 2004 Microsoft Technology Licensing, LLC Form template data source change
7313756, Dec 15 2003 Microsoft Technology Licensing, LLC Schema editor extensions
7373595, Jun 27 2002 Microsoft Technology Licensing, LLC System and method for validating an XML document and reporting schema violations
7421395, Feb 18 2000 Microsoft Technology Licensing, LLC System and method for producing unique account names
20010034733,
20020116389,
20020120506,
20020145944,
20020147730,
20020147748,
20030028857,
20030083077,
20030120665,
20030149934,
20030212904,
20040158567,
20040162871,
20040177160,
20040203620,
20040205621,
20050033719,
20050144277,
20050166223,
20050198068,
20050198074,
20050228803,
20050240875,
20050278372,
20060025987,
20060085450,
20060095288,
20060101068,
20060106746,
20060106824,
20060136489,
20060178868,
20060184539,
20070101391,
20070118354,
20070150387,
EP1239375,
EP1293850,
EP1424643,
EP1793288,
WO1055891,
WO153967,
WO163477,
WO9840795,
///
Executed onAssignorAssigneeConveyanceFrameReelDoc
Nov 11 2004STUHEC, GUNTHERSAP AktiengesellschaftASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS 0156340744 pdf
Nov 12 2004SAP Aktiengesellschaft(assignment on the face of the patent)
Jul 07 2014SAP AGSAP SECHANGE OF NAME SEE DOCUMENT FOR DETAILS 0336250334 pdf
Date Maintenance Fee Events
May 26 2010ASPN: Payor Number Assigned.
Oct 24 2013M1551: Payment of Maintenance Fee, 4th Year, Large Entity.
Oct 24 2017M1552: Payment of Maintenance Fee, 8th Year, Large Entity.
Oct 27 2021M1553: Payment of Maintenance Fee, 12th Year, Large Entity.


Date Maintenance Schedule
May 04 20134 years fee payment window open
Nov 04 20136 months grace period start (w surcharge)
May 04 2014patent expiry (for year 4)
May 04 20162 years to revive unintentionally abandoned end. (for year 4)
May 04 20178 years fee payment window open
Nov 04 20176 months grace period start (w surcharge)
May 04 2018patent expiry (for year 8)
May 04 20202 years to revive unintentionally abandoned end. (for year 8)
May 04 202112 years fee payment window open
Nov 04 20216 months grace period start (w surcharge)
May 04 2022patent expiry (for year 12)
May 04 20242 years to revive unintentionally abandoned end. (for year 12)