In some aspects of the present application, a method for identifying significant events related to machine problems. The method includes receiving one or more machine problems; determining one or more machine problem classifications based on the one or more machine problems; generating a probability distribution that the one or more machines problems are related to the one more machine problem classifications; determining if one or more events are associated with the one or more machine problems during a predetermined time interval; and determining if the one or more events are significantly related to the one or more machine problem classifications using a statistical algorithm.
|
1. A method for identifying significant events related to machine problems, the method comprising:
receiving one or more machine problems and associated one or more fault codes;
determining one or more machine problem classifications based on the one or more machine problems and the associated one or more fault codes that were received;
generating a log of the one or more machine problems, the associated one or more fault codes, and the one or more machine problem classifications that were determined from a plurality of users for a predetermined time interval;
generating a baseline probability distribution comprising the one or more machines problems with respect to the one or more machine problem classifications;
for each fault code that was received, calculating a secondary probability distribution comprising a number of times that each of the associated fault codes occurred for the one or more machine problems classifications that were determined;
determining that two or more fault codes in the log are significantly related to the one or more machine problems using a Chi-squared distribution test to compare the baseline probability distribution against each of the second probability distributions; and
diagnosing the machine problems using the one or more fault codes that were determined to be significantly related.
13. A non-transitory computer-readable storage medium containing instructions which, when executed on a processor, perform a method for identifying significant events related to machine problems comprising:
receiving one or more machine problems and associated one or more fault codes;
determining one or more machine problem classifications based on the one or more machine problems and the associated one or more fault codes that were received;
generating a log of the one or more machine problems, the associated one or more fault codes, and the one or more machine problem classifications that were determined from a plurality of users for a predetermined time interval;
generating a baseline probability distribution comprising the one or more machines problems with respect to the one or more machine problem classifications;
for each fault code that was received, calculating a secondary probability distribution comprising a number of times that each of the associated fault codes occurred for the one or more machine problems classifications that were determined;
determining that two or more fault codes in the log are significantly related to the one or more machine problems using a Chi-squared distribution test to compare the baseline probability distribution against each of the second probability distributions; and
diagnosing the machine problems using the one or more fault codes that were determined to be significantly related.
7. An apparatus arranged to identify significant sequences of fault codes related to machine problems, the apparatus comprising:
a processor in communication with a memory having instructions, that when executed by the processor cause the processor to:
receive one or more machine problems and associated one or more fault codes;
determine one or more machine problem classifications based on the one or more machine problems and the associated one or more fault codes that were received;
generate a log of the one or more machine problems, the associated one or more fault codes, and the one or more machine problem classifications that were determined from a plurality of users for a predetermined time interval;
generate a baseline probability distribution comprising the one or more machines problems with respect to the one or more machine problem classifications;
for each fault code that was received, calculate a secondary probability distribution comprising a number of times that each of the associated fault codes occurred for the one or more machine problems classifications that were determined;
determine that two or more fault codes in the log are significantly related to the one or more machine problems using a Chi-squared distribution test to compare the baseline probability distribution against each of the second probability distributions; and
diagnose the machine problems using the one or more fault codes that were determined to be significantly related.
2. The method according to
3. The method according to
4. The method according to
5. The method according to
6. The method according to
8. The apparatus according to
9. The apparatus according to
10. The apparatus according to
11. The apparatus according to
12. The apparatus according to
14. The non-transitory computer-readable storage medium according to
15. The non-transitory computer-readable storage medium according to
|
1. Field of the Disclosure
The present application is directed to an identification technique of significant sequences of fault codes by statistical hypothesis testing.
2. Background of the Disclosure
In conventional maintenance assistant, service is performed based on an individual's domain knowledge expertise. Thus, the service is very heuristic, involving a great deal of randomness. For example, when fixing a machine problem, a customer service technician would look at machine logs. From the machine log history, the customer service technician would pick up some information he/she thought “useful” to diagnose the problem. The customer service technician would keep trying until the problem was successfully solved. On average, the repair process is long and costly.
What is need is an improved method to identify significant sequences of fault codes to reduce machine downtime and improve machine performance.
In accordance with some aspects of the present disclosure, a method for identifying significant sequences of fault codes related to machine problems is disclosed. The method can include receiving one or more machine problems; determining one or more machine problem classifications based on the one or more machine problems; generating a probability distribution that the one or more machines problems are related to the one more machine problem classifications; determining if one or more events are associated with the one or more machine problems during a predetermined time interval; and determining if the one or more events are significantly related to the one or more machine problem classifications using a statistical algorithm.
In some aspects, the one or more events can include one or more fault codes related to a fault associated with the one or more machine problem classifications.
In some aspects, the predetermined time interval can be between about 2 and 10 days.
In some aspects, the method can include receiving an identification of the one or more machine problems, one or more machine problem classifications, a fault code related to the one or more machine problems, or combinations thereof from a user.
In some aspects, the method can include determining if two or more events for a particular machine problem are significantly related using hypothesis testing; and arranging the two or more events into an event sequence if the two or more events that are determined to be significant.
In some aspects, the determining can include determining if the event sequence is significant if and only if any events in the event sequence are pair wise favorably associated. Additionally or alternatively, the determining can include determining if the event sequence is significant if and only if there is a path of events in the event sequence which is step wise favorably associated.
In some aspects, the machine problem classifications can include one or more subsystems related to a machine type.
In some aspects, the probability distribution can be a multinomial distribution.
In accordance with aspects of the present disclosure, an apparatus arranged to identify significant sequences of fault codes related to machine problems is disclosed. The apparatus can include a processor in communication with a memory having instructions, that when executed by the processor cause the processor to: receive one or more machine problems; determine one or more machine problem classifications based on the one or more machine problems; generate a probability distribution that the one or more machines problems are related to the one more machine problem classifications; determine if one or more events are associated with the one or more machine problems during a predetermined time interval; and determine if the one or more events are significantly related to the one or more machine problem classifications using a statistical algorithm.
In some aspects, the one or more events can include one or more fault codes related to a fault associated with the one or more machine problem classifications.
In some aspects, the predetermined time interval can be between about 2 and 10 days.
In some aspects, the processor can be further configured to receive an identification of the one or more machine problems, one or more machine problem classifications, a fault code related to the one or more machine problems, or combinations thereof from a user.
In some aspects, the processor can be further configured to determine if two or more events for a particular machine problem are significantly related using hypothesis testing and arrange the two or more events into an event sequence if the two or more events that are determined to be significant.
In some aspects, the event sequence is significant if and only if any events in the event sequence are pair wise favorably associated. Additionally or alternatively, the event sequence is significant if and only if there is a path of events in the event sequence which is step wise favorably associated.
In some aspects, the machine problem classifications can include one or more subsystems related to a machine type.
In some aspects, the probability distribution can be a multinomial distribution.
In accordance with aspects of the present disclosure, an article of manufacture is disclosed that includes a storage medium having instructions, that when executed by a processor cause the processor to: receive one or more machine problems; determine one or more machine problem classifications based on the one or more machine problems; generate a probability distribution that the one or more machines problems are related to the one more machine problem classifications; determine if one or more events are associated with the one or more machine problems during a predetermined time interval; and determine if the one or more events are significantly related to the one or more machine problem classifications using a statistical algorithm.
In some aspects, the article can include instructions to cause the processor to determine if two or more events for a particular machine problem are significantly related using hypothesis testing and arrange the two or more events into an event sequence if the two or more events that are determined to be significant.
In some aspects, the event sequence is significant if and only if any events in the event sequence are pair wise favorably associated. Additionally or alternatively, the event sequence is significant if and only if there is a path of events in the event sequence which is step wise favorably associated.
It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory only and are not restrictive of the disclosure, as claimed.
The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate several embodiments of the disclosure and, together with the description, serve to explain the principles of the disclosure.
Reference will now be made in detail to various exemplary embodiments of the present application, examples of which are illustrated in the accompanying drawings. Wherever possible, the same reference numbers will be used throughout the drawings to refer to the same or like parts.
In accordance with aspects of the present disclosure, a method to filter out randomness and identify the statistically significant fault code (related to the machine problem) and a path (sequential/parallel hypothesis testing) is described to focus on a manageable knowledge if the significant fault codes are too many (which means the information provided by them are too broad and vague).
At 110, the machine problem classification is determined based on the information supplied by the user. At 115, a baseline probability distribution is created for machine problems (customer calls) over machine problem classifications (subsystems, problem code, etc).
This distribution is taken as the baseline when identifying the association between fault codes and machine problem classifications. The distribution can be a multinomial distribution where K is the number of problem classifications and N is the total number of machine problems. Xi=1 if the ith machine problem falls in the class 1, i=1 . . . N and I=1 . . . K. Pl is the probability of a machine problem in the class 1, which is estimated by (ΣI {Xi=1}/N). l{} is an indicator function. In some aspects, customer calls can be treated as machine problems and subsystems can be treated as machine problem classification.
Returning to
The relevance to the one or more fault codes and the machine problem classification is determined. For example, Zij=1 if the event (or fault code) j presents in the machine problem i, else Zij=0, i=1 . . . N and j=1 . . . M, where M is the total number of events. If S is an event combination or sequence, ZiS=IIZij with event j in S. In some aspects, a fault code can be considered as the unique diagnostic variable, thus event here can mean fault code. In some aspects, events can represent other meaning (useful) variables.
Continuing with the example above,
A distribution can generated for event distribution over problem classifications, which can be compared with the baseline distribution to justify the significance association between corresponding event and machine problem classifications. For i=1 . . . N and j=1 . . . M, Yij=Zij*Xi. Qjl is the probability of the event j in the class i, which is estimated by (Σ{Yij=1}/N). If Qjl is statistically different from P1 across 1, then event j is statistically associated to one or more classes, rather than random association between machine problems and classes.
At 125, a determination is made as to whether events are significant. In some aspects, a significant events testing can be performed as follows. P=(P1, P2, . . . , PK) and Qj=(Qj1, Qj2, . . . , QjK), for j=1 . . . M. A hypothesis test is used. The null hypothesis, denoted by the 0 argument, is: H0: Qj=P and the alternative hypothesis, denoted by the A argument, is HA: Qj!=P.
Test statistics is derived through log-likelihood ratio: G=2*ΣQjl*IN(Qjl/Pl). G is approximately distributed as Chi-square distribution. The null hypothesis is rejected if G>Chi (K-1, 0.95), where Chi (K-1, 0.95) is the 95th percentile of Chi-square distribution with K-1 degree of freedom. The test can be controlled by defaulting type 1 (false positive results) error as 0.05.
Only significant events will be considered to include in the sequence. Significant sequence of events will be defined as the events in the sequence are not randomly associated, but favorably associated with each other, which is different significant definition from the above significant events. Events A and B are favorably not randomly associated (independent): P (A and B) >P(A)*P(B) where “>” means statistically greater.
In accordance with some aspects of the present disclosure, different ways of testing a sequence can be used. For example, suppose S={e1,e2, . . . ,eV}, where V=|S| and e1, e2 . . . are the events in the sequence S. S is significant if and only if any events in S are pair wise favorably associated. For any if in 1 . . . V, P (ei and ej)>P(ei)*P(ej). It is equivalent to test V*(V-1)/2 hypothesis simultaneously: H0: P(ei and ej)=P(ei)*P(ej) vs HA: P(ei and ej)>P(ei)*P(ej).
In this example, overall type 1 error (0.05) can be controlled, thus if denote each individual type 1 error as alphat, where t indexes the V*(V-1)/2 tests, then Σalphat_t ≦0.05. This example is a conservative method of selecting significant sequence. Only all the events in the sequence are pair wise positively correlated, the sequence would be considered as significant.
By way of another example, S is significant if and only if there is a path of events in S which is step wise favorably associated. There exists (i1, i2, . . . , iV) such that P({eil . . . ei(k-1)} and eik) >P({eil . . . ei(k-1)})*P(eik) for k=2, . . . ,V. It is equivalent to test V-1 hypothesis one by one: H0: P({eil . . . ei(k-1)} and eik)=P({eil . . . ei(k-1)})*P(eik) vs HA: P({eil . . . ei(k-1)} and eik) >P({eil . . . ei(k-1)})*P(eik) where K=2 . . . V.
In this example, each individual test's type 1 error can be controlled to be less overall default (0.05). This example is a more generous method of selecting significant sequence. It includes the situation that an event is associated with an event sequence, maybe not associated with each individual in the sequence.
Continuing with the example above, the test statistics or log-likelihood ratio (G) for fault code “16.602.28” is 17.27, where the number of degrees of freedom is 6 (the number of subsystem classes (7)-1). Referring to the Chi-square distribution at significance level 0.01, we can tell this fault code is significantly related to certain machine problems, i.e., Electrical/UI/Net Contrlr and Integrated System. Thus we should use it to diagnose machine problems. In some aspects, the significance level can be between 0.01 and 0.05 depending on how much tolerance for error is permitted.
Out of 818 fault codes, there are only 225 significant ones, meaning that we can filter out a big portion of fault codes (occur randomly or triggered by other significant fault codes), and only keep the informative ones for the diagnosis.
Typically, a single fault code is not enough to provide sufficient information to identify the machine problem since it tends to provide limited and vague information. If there were other fault code associated with it, a better understanding of the problem could be achieved, since this fault code combination would limit the causes of the machine problem in a very small range. Therefore, the search for fault code sequence using the disclosed hypothesis testing provides a better change that machine problems can be addressed in a more efficient manner. For example, using the example above with the fault code “16.602.28,” five significant fault code pairs including “16.602.28,” i.e. “16.602.28” with “16.630.46,” “16.602.28” with “16.750.46,” “16.602.28” with “16.754.14,” “16.602.28” with “16.760.19,” with “16.602.28” with “16.761.19” is found. If the fault code pairs are extended to fault code triples, no triples were found using the conservative method; however, if the less conservative example is used, the significant fault code combination can be extended to “16.602.28” with “16.630.46” with “16.750.46” with “16.754.14” with “16.760.19” with “16.761.19.” The intersection of the information contained in each fault code can be much clearer and related the machine problem, which can expedite the whole diagnosis process to save service time (cost).
For the purposes of this specification and appended claims, unless otherwise indicated, all numbers expressing quantities, percentages or proportions, and other numerical values used in the specification and claims, are to be understood as being modified in all instances by the term “about.” Accordingly, unless indicated to the contrary, the numerical parameters set forth in the following specification and attached claims are approximations that can vary depending upon the desired properties sought to be obtained by the present disclosure. At the very least, and not as an attempt to limit the application of the doctrine of equivalents to the scope of the claims, each numerical parameter should at least be construed in light of the number of reported significant digits and by applying ordinary rounding techniques.
It is noted that, as used in this specification and the appended claims, the singular forms “a,” “an,” and “the,” include plural referents unless expressly and unequivocally limited to one referent. Thus, for example, reference to “an acid” includes two or more different acids. As used herein, the term “include” and its grammatical variants are intended to be non-limiting, such that recitation of items in a list is not to the exclusion of other like items that can be substituted or added to the listed items.
While particular embodiments have been described, alternatives, modifications, variations, improvements, and substantial equivalents that are or can be presently unforeseen can arise to applicants or others skilled in the art. Accordingly, the appended claims as filed and as they can be amended are intended to embrace all such alternatives, modifications variations, improvements, and substantial equivalents.
Patent | Priority | Assignee | Title |
Patent | Priority | Assignee | Title |
6535865, | Jul 14 1999 | HEWLETT-PACKARD DEVELOPMENT COMPANY, L P | Automated diagnosis of printer systems using Bayesian networks |
6763482, | Jun 19 2001 | Xerox Corporation | Printer diagnostics method |
6782345, | Oct 03 2000 | Xerox Corporation | Systems and methods for diagnosing electronic systems |
6782495, | Jun 19 2001 | Xerox Corporation | Method for analyzing printer faults |
7174264, | Jul 14 2003 | Fuji Xerox Co., Ltd. | Fault diagnosis apparatus |
7243045, | Apr 21 2004 | FUJIFILM Business Innovation Corp | Failure diagnosis method, failure diagnosis apparatus, image forming apparatus, program, and storage medium |
7467841, | Sep 07 2006 | Kabushiki Kaisha Toshiba; Toshiba Tec Kabushiki Kaisha | Maintenance scheduling system, maintenance scheduling method and image forming apparatus |
7778943, | Feb 09 2007 | Honeywell International Inc. | Stochastic evidence aggregation system of failure modes utilizing a modified dempster-shafer theory |
20070028220, |
Executed on | Assignor | Assignee | Conveyance | Frame | Reel | Doc |
Aug 12 2011 | ZHAO, SHI | Xerox Corporation | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 026748 | /0614 | |
Aug 15 2011 | Xerox Corporation | (assignment on the face of the patent) | / | |||
Nov 07 2022 | Xerox Corporation | CITIBANK, N A , AS AGENT | SECURITY INTEREST SEE DOCUMENT FOR DETAILS | 062740 | /0214 | |
May 17 2023 | CITIBANK, N A , AS AGENT | Xerox Corporation | RELEASE OF SECURITY INTEREST IN PATENTS AT R F 062740 0214 | 063694 | /0122 | |
Jun 21 2023 | Xerox Corporation | CITIBANK, N A , AS COLLATERAL AGENT | SECURITY INTEREST SEE DOCUMENT FOR DETAILS | 064760 | /0389 | |
Nov 17 2023 | Xerox Corporation | JEFFERIES FINANCE LLC, AS COLLATERAL AGENT | SECURITY INTEREST SEE DOCUMENT FOR DETAILS | 065628 | /0019 | |
Feb 06 2024 | Xerox Corporation | CITIBANK, N A , AS COLLATERAL AGENT | SECURITY INTEREST SEE DOCUMENT FOR DETAILS | 066741 | /0001 | |
Feb 06 2024 | CITIBANK, N A , AS COLLATERAL AGENT | Xerox Corporation | TERMINATION AND RELEASE OF SECURITY INTEREST IN PATENTS RECORDED AT RF 064760 0389 | 068261 | /0001 |
Date | Maintenance Fee Events |
Jan 29 2015 | ASPN: Payor Number Assigned. |
Aug 28 2018 | M1551: Payment of Maintenance Fee, 4th Year, Large Entity. |
Aug 31 2022 | M1552: Payment of Maintenance Fee, 8th Year, Large Entity. |
Date | Maintenance Schedule |
Mar 03 2018 | 4 years fee payment window open |
Sep 03 2018 | 6 months grace period start (w surcharge) |
Mar 03 2019 | patent expiry (for year 4) |
Mar 03 2021 | 2 years to revive unintentionally abandoned end. (for year 4) |
Mar 03 2022 | 8 years fee payment window open |
Sep 03 2022 | 6 months grace period start (w surcharge) |
Mar 03 2023 | patent expiry (for year 8) |
Mar 03 2025 | 2 years to revive unintentionally abandoned end. (for year 8) |
Mar 03 2026 | 12 years fee payment window open |
Sep 03 2026 | 6 months grace period start (w surcharge) |
Mar 03 2027 | patent expiry (for year 12) |
Mar 03 2029 | 2 years to revive unintentionally abandoned end. (for year 12) |