Searching a multi-lingual database

Searching a multi-lingual database
US8027994

A system and method of searching a database in which documents of different languages are included. The system includes a synonym or keyword dictionary which is bi-directional and allows for translation of keywords between a first language and other languages. The translated words keywords for the document are stored in an inverted index which is then used for searching, either in a selected language, a second language or in all languages, as determined by the user. This use of multiple searching and a translated synonym dictionary avoids the need for translation of the entire document and avoids inaccuracies which may result from translations.

PTO Wrapper PDF
Dossier Espace Google

Patent 8027994
Priority Feb 01 2002
Filed Aug 21 2008
Issued Sep 27 2011
Expiry Jul 13 2023 Extension 527 days
Inventors Drissi, Yo…
Assg.orig Internatio…
Assg.curr Internatio…
Entity Large
Referenced by 3
References 132
Maint.: EXPIRED<2yrs

REFERENCE TO RELATED…
BACKGROUND OF THE IN…
SUMMARY OF THE INVEN…
BRIEF DESCRIPTION OF…
DETAILED DESCRIPTION…

1. A computerized method of searching documents written in a plurality of languages, the method comprising:

receiving a first query that includes at least one keyword in a first language;

generating a second query by translating the at least one keyword into a second language;

applying the first query against documents including at least one document written in the first language and at least one document written in the second language with the at least one keyword in the first language;

applying the second query against documents written in the second language;

generating a first set of results based on the first query, wherein the first set of results includes each document written in the first language that matches the first query; and

generating a second set of results based on the first and second queries, wherein the second set of results includes each document written in the second language that matches at least one of the first query or the second query.

11. A computer system comprising:

a system for searching documents written in a plurality of languages, the system comprising at least one computer, wherein the searching is implemented using a method including:

receiving a first query that includes at least one keyword in a first language;

generating a second query by translating the at least one keyword into a second language;

applying the second query against documents written in the second language;

generating a first set of results based on the first query, wherein the first set of results includes each document written in the first language that matches the first query; and

generating a second set of results that includes each document written in the second language based on the first and second queries, wherein the second set of results matches at least one of the first query or the second query.

16. A document searching program stored on a computer-useable medium, which causes a computer system to perform a method when executed on the computer system, wherein the documents are written in a plurality of languages, the method comprising:

receiving a first query that includes at least one keyword in a first language;

generating a second query by translating the at least one keyword into a second language;

applying the second query against documents written in the second language;

generating a first set of results based on the first query, wherein the first set of results includes each document written in the first language that matches the first query; and

2. The method of claim 1, wherein the generating a second query uses a keyword dictionary that supports translation between the plurality of languages.

3. The method of claim 2, wherein the keyword dictionary includes a synonym listing that translates a keyword in any of the plurality of languages into a corresponding keyword in another of the plurality of languages.

4. The method of claim 1, wherein the generating a second query translates the at least one keyword using the Unicode system.

5. The method of claim 1, further comprising providing at least one of the first and second sets of results for display to a user.

6. The method of claim 1, wherein the applying the first query includes:

applying the first query against an inverted index in the first language; and

applying the first query against an inverted index in the second language.

7. The method of claim 1, further comprising:

identifying keywords from each of the plurality of documents;

translating each identified keyword into each of the plurality of languages; and

creating an index in each of the plurality of languages, wherein the applying the first query uses at least one of the plurality of indexes.

8. The method of claim 7, further comprising creating a single inverted index from the translated keywords.

9. The method of claim 7, wherein the translating uses a keyword dictionary that supports translation between the plurality of languages.

10. The method of claim 1, further comprising determining a native language of a user.

12. The system of claim 11, wherein the generating a second query uses a keyword dictionary that supports translation between the plurality of languages.

13. The system of claim 11, the method further including providing at least one of the first and second sets of results for display to a user.

14. The system of claim 11, the method further including determining a native language of a user.

15. The system of claim 11, the method further including:

identifying keywords from each of the plurality of documents;

translating each identified keyword into each of the plurality of languages; and

creating an index in each of the plurality of languages, wherein the system for applying the first query uses at least one of the plurality of indexes.

17. The program of claim 16, wherein the generating a second query uses a keyword dictionary that supports translation between the plurality of languages.

18. The program of claim 16, the method further comprising a providing at least one of the first and second sets of results for display to a user.

19. The program of claim 16, the method further comprising a determining a native language of a user.

20. The program of claim 16, the method further comprising:

identifying keywords from each of the plurality of documents;

translating each identified keyword into each of the plurality of languages; and

creating an index in each of the plurality of languages, wherein the applying the first query uses at least one of the plurality of indexes.

REFERENCE TO RELATED APPLICATIONS

This application is a continuation of U.S. patent application Ser. No. 11/151,047, filed on 13 Jun. 2005 now U.S. Pat. No. 7,433,894, which is a continuation of U.S. patent application Ser. No. 10/066,346, filed on 1 Feb. 2002 now U.S. Pat. No. 6,952,691, both of which are hereby incorporated herein by reference.

BACKGROUND OF THE INVENTION

1. Field of the Invention

The present invention relates to the field of searching a database using search term(s) entered by a user. More particularly, the present invention is a system and method for searching on a database including material in different languages where the search term(s) are entered in one of the languages where the database need not be translated into the different languages.

2. Background Art

Various methods have been proposed for searching a database wherein the database includes material in multiple languages. One approach is to translate the entire database into the language in which a search term is entered or the language of the user. However, this could involve a large amount of translation for a sizable database (and multiple translations if the database is used by users in different languages). Further, each process of translating a document has the potential for losing (or distorting) some of the meaning of the original text.

For these reasons, it is desirable to avoid translating the documents to allow for a search in a particular language.

Another approach is to use synonym list and apply it to the search term(s) entered in one language. That is, the text of the documents in the database remain in the original language and synonyms in each language for each search term(s) are used for the search of the database. This system may work in some cases but is undesirable in other cases because considering all of synonyms in the different languages could lead to incorrect results. The word for “network” in Spanish is “red” and a search on “network” which blindly translates the search term would incorrectly find English documents which include the color “red”.

Further, some of the documents include text in one language and key words presented in a different language to avoid changing the meaning. Thus, it is desirable to search a database which includes these terms but would not be effective to search only for the translated form of the word.

As will be apparent to one skilled in the relevant art, the process of translating and searching in multiple languages can consume substantial computing resources. Many of the multi-language database searching techniques require a powerful computer or take an inordinate amount of time to process a single search, the amount depending on the size of the database, the number of supported languages and the nature of the queries. However, the computing resources have a cost associated with them, either in requiring a larger or faster system or in terms of tying up the computer while a large task is running to the exclusion of other users. Further, a search which takes a long period of time may prevent the user from interactively modifying the search to obtain meaningful results. Accordingly, it is desirable to avoid using large computing resources.

Accordingly, existing systems methods for searching databases have undesirable disadvantages and limitations which will be apparent to those skilled in the art in view of the following description of the present invention.

SUMMARY OF THE INVENTION

The present invention overcomes the disadvantages and limitations of the prior art systems by providing a simple, yet effective, method and system for searching a database including documents in multiple supported languages. The present invention also supports searching a database in which the text is comprised of documents written in multiple languages, including those documents which are written in one language but which include words or phrases from a second language.

The present invention has the advantage that a translation of the documents in the database into each of the supported languages is not required.

The present invention also has the advantage that the meaning of the original document is not lost or distorted through a translation process to allow searching of the document in different languages.

The present invention also allows for the searching of a database in a native or natural language while finding documents which are written in other languages.

Other objects and advantages of the system and method of the present invention will be apparent to those skilled in the relevant art, in view of the following description of the preferred embodiment, taken together with the accompanying drawings and the appended claims.

BRIEF DESCRIPTION OF THE DRAWINGS

Having thus described some of the objects and advantages of the present invention, other objects and advantages will be apparent to those skilled in the art in view of the following description of the invention taken in conjunction with the accompanying drawings in which:

FIG. 1 is a diagrammatic view of a traditional search technique in which documents exist in two different languages;

FIG. 2 is a diagrammatic view of a diagram of an improved multi-lingual document database index system of the present invention;

FIG. 3 is a dual language (or multi-language) database search system of the present invention;

FIG. 4 is a flow chart illustrating sample logic performed in practicing the present invention; and

FIG. 5 is a synonym table of the type which is useful in carrying out the present invention as described in connection with FIGS. 2-4, associating a word in one language with its counterpart in another language.

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT

In the following description of the preferred embodiment, the best implementation of practicing the invention presently known to the inventor will be described with some particularity. However, this description is intended as a broad, general teaching of the concepts of the present invention describing a specific embodiment but is not intended to be limiting the present invention to that as shown in this embodiment, especially since those skilled in the relevant art will recognize many variations and changes to the specific structure and operation shown and described with respect to these figures.

FIG. 1 illustrates a traditional search system, that is, one of the prior art, in which documents in English (a first language) are represented by the symbol 102 and documents in a second language such as a national language (NL) are represented by the symbol 122. While each set of documents is maintained separately, each is indexed through a process of extracting the keywords and creating an index, represented by the box 104 for the English documents 102 and the box 124 for the second language documents 122. The next step is that an inverted index is performed for each set of documents, the English inverted index at block 106 and the second language index represented by block 126. Then, a search or query is formatted and applied against a selected one of the databases, represented by an English query at 108 and a national language query at block 128. The results of the English query are shown by block 110 and the results of a national language query are represented by the box 130. Thus, the steps of the process are carried out separately for each database and including indexing the document at block 112, creating an inverted index at block 114 and conducting a search and providing an output at block 116. While the steps are the same regardless of which type of database is used, each database is kept separate and each is searched separately and each generates separate results. Since this same structure could be applied to any number of separate databases, this system could expand to support the number of languages desired.

However, some technical documents are written in a native language (such as Spanish) but use technical terms from another language (for example, from English). In such a system, searching the national language database for the national language equivalent of a search term will not find the search term if it is included in the document in another language.

FIG. 2 illustrates a system for merging documents in different languages into a single index. As shown in this Figure, documents in a first language (English) are represented by the symbol 202 and documents in a second language (a national language) are represented by the symbol 204. Keywords are identified from each document in a step 206, then translated into each supported language at block 208. Separate indices 210, 212 in each language are created from the translated keywords. Then, an inverted index 214 is created from the translated keywords. The translation of keywords is preferably accomplished using a keyword dictionary 220 which included words in English associated with the corresponding keywords in the national language (and vice versa) to form a synonym listing which effectively translates a keyword in one language into the corresponding term in another language (and vice versa). This listing of synonyms accomplishes the translation of keywords in the creation of the indices and for later searching as will be described in connection with FIG. 3. In order to manage various languages, it is proposed to translate each term using the Unicode system (UTF8), although any other system which is accurate and consistent could also be used to advantage in the present invention.

Thus, the process of creating an inverted index involves steps of creating in block 232 an index in each language and in creating a merged inverted index in block 234 using the keyword dictionary 220 which includes synonyms in each supported language. While two languages are shown in the figures of the present invention, the present invention can easily be expanded to support the desired number of languages, and, while English is described as one language for the documents and for the searches, the present invention is not limited to serving documents in English and another language could be substituted, if desired.

FIG. 3 illustrates a search system of the type which is useful in the present invention. A query is input at block 310 then passed to a keyword dictionary represented by block 320. The keyword dictionary 320 includes a bi-directional translation system which translates keywords from the English (or first) language 322 to a national (or second) language 324 and vice versa, using, in its preferred embodiment, a stored synonym list in the form of a bi-directional table such as is illustrated and described later, particularly in connection with FIG. 5. The synonym table is designed to support a plurality of languages and allow translation between the supported languages. The result is a pair of queries, one query 330 in the first language (e.g., English) and a second query 340 in a second language (such as the national language). The English language query 330 is applied against both the English inverted index 334 and the national language index 334, and the national language query 340 is applied against the national language index 344, and generate results: an English-language hitlist 338 and a national language hitlist 348. The user then can select (represented by the box 350) which results are of interest to the user, at least to start the process, since it is possible that the user will select one, determine that it is inappropriate and try another selection. If the user has limited capabilities in understanding English, he may prefer to look at the results 348 in the national language. If the national language results 348 are not sufficient (or nonexistent), then he may go on to the English language results 338. In the alternative, the user may recognize that the results of interest are most likely to be the English results 338 and may start with those results. In another alternative, the user finds so many results in English that he decides to review the more selective list in his national language.

FIG. 4 illustrates a flow chart of one process of practicing the present invention. As shown in this FIG. 4, the process begins with a determination of the language of the user and whether the user wishes to limit his universe to documents written in his native language. The first step is to determine the language of the user at block 410. Perhaps the user has entered his native or national language or perhaps it is determined from his entries, such as a query in a given language. Then, at block 420 the user enters the query in terms of keywords. Those keywords are translated at block 430 and the queries produced are submitted to the searching mechanism at block 440. Results are obtained at the block 450 and a set of results may be selected at block 460.

In FIG. 5, a portion of synonym table is shown by the reference numeral 500. The table includes a plurality of columns, each associated with a different language. In the FIG. 5 as shown, these supported languages are English in column 510, Spanish in column 520, French in column 530 and Italian in column 540. An additional column 550 is shown provided for another language such as German or Japanese, recognizing, of course, that some languages have different type of characters from English and some languages have so many different symbols that it may be necessary to use a double byte character set to represent some of such languages like Japanese. Two sets of synonyms are shown in rows in this FIG. 5, one associated with the English word “network” in row 560 and one associated with the English word “processor” in row 570. In practice, the synonym table 500 may have additional columns as desired as shown by the symbol 590 (or may have fewer columns if fewer languages are supported and the selection of supported languages is a matter of design choice and not a feature of the present invention) and will have a row for each keyword, shown by the symbol 580. It is important to note that each entry is associated with a language so that it is possible to associate a word with its language and distinguish between the Spanish word for network (red) from the English word for the color red, if desired. While the table is shown in tabular form for ease in understanding the concept of a synonym table, the table may exist in other known formats in storage according to conventional data processing techniques.

The present invention, it will be recognized, is especially adapted for use in a data processing system such as a general purpose computer with a stored program containing computer program means including a plurality of instructions. Those instructions will generally be written in a high level language which is readable by a human and translated into machine language, that is, simple instructions which are understood by the data processing system. In an appropriate instance such instructions could be directly written in a machine language programming language, if desired, a system which allows for efficiency of execution but which is more difficult to program. The present invention is not limited to any particular input language.

As used in the present document, software, computer program and computer program means are used interchangeably. Software in the present context means any expression, in any language, code or notation, of a set of instructions intended to cause a system having an information processing capability to perform a particular function either directly or after either or both of the following a) conversion to another language, code or notation; b) reproduction in a different material form. The use of the Unicode system for managing different languages has been used in the description of the preferred embodiment but other suitable methods for representing different languages could also be used to advantage in the present invention, if desired.

The term national language has been used to represent a language associated with a user of the system. This language could be any language supported by the system, and might include different languages for different users. So, “national language” might represent Spanish for a Mexican or a person from Spain and might represent French for a person from France or other French-speaking locales. Appropriate synonym tables are available for a variety of common languages as are systems for locating key words and separating common text with little uniqueness from key words which are descriptive of the document under consideration. Such key word locating systems are often technologically directed and identify words which are of interest to the technology under consideration.

Of course, many modifications of the present invention will be apparent to those skilled in the relevant art in view of the foregoing description of the preferred embodiment, taken together with the accompanying drawings and the appended claims. For example, the present invention has been described in connection with documents and searches in English and in a national language whereas the number of supported languages need not be 2 and need not be a single national language. Further, in some circumstances, the documents could be written in a combination of supported languages. Additionally, some elements of the present invention can be used to advantage without the corresponding use of other elements. For example, the use of the synonym or keyword dictionary is not the only way to accomplish the translation of keywords into other language. Further, various other devices could be substituted to advantage depending on the environmental circumstances. Accordingly, the foregoing description of the preferred embodiment should be considered as merely illustrative of the principles of the present invention and not in limitation thereof.

INVENTORS:

Drissi, Youssef, Kim, Moon Ju, Kozakov, Lev, Leon Rodriguez, Juan

THIS PATENT IS REFERENCED BY THESE PATENTS:

Patent	Priority	Assignee	Title
10671251,	Dec 22 2017	FATHOM TECHNOLOGIES, LLC	Interactive eReader interface generation based on synchronization of textual and audial descriptors
11443646,	Dec 22 2017	FATHOM TECHNOLOGIES, LLC	E-Reader interface system with audio and highlighting synchronization for digital books
11657725,	Dec 22 2017	FATHOM TECHNOLOGIES, LLC	E-reader interface system with audio and highlighting synchronization for digital books

THIS PATENT REFERENCES THESE PATENTS:

Patent	Priority	Assignee	Title
5136505,	Aug 03 1988	Sharp Kabushiki Kaisha	Electronic translator apparatus for translating words or phrases and auxiliary information related to the words or phrases
5398302,	Feb 07 1990		Method and apparatus for adaptive learning in neural networks
5499366,	Aug 15 1991	Borland Software Corporation	System and methods for generation of design images based on user design inputs
5737734,	Sep 15 1995	BHW INFO EDCO COM, LLC	Query word relevance adjustment in a search of an information retrieval system
5794178,	Sep 20 1993	Fair Isaac Corporation	Visualization of information using graphical representations of context vector based relationships and attributes
5819263,	Jul 19 1996	AMERIPRISE FINANCIAL, INC	Financial planning system incorporating relationship and group management
5878423,	Apr 21 1997	GOOGLE LLC	Dynamically processing an index to create an ordered set of questions
5893092,	Dec 06 1994	University of Central Florida Research Foundation, Inc	Relevancy ranking using statistical ranking, semantics, relevancy feedback and small pieces of text
5899991,	May 12 1997	Teleran Technologies, L.P.	Modeling technique for system access control and management
5956708,	Mar 06 1997	International Business Machines Corporation	Integration of link generation, cross-author user navigation, and reuse identification in authoring process
5956711,	Jan 16 1997	Walter J., Sullivan, III	Database system with restricted keyword list and bi-directional keyword translation
5956740,	Oct 23 1996	Mazda Motor Corporation	Document searching system for multilingual documents
5987457,	Nov 25 1997	HANGER SOLUTIONS, LLC	Query refinement method for searching documents
5991713,	Nov 26 1997	GOOGLE LLC	Efficient method for compressing, storing, searching and transmitting natural language text
6005860,	May 30 1997	HANGER SOLUTIONS, LLC	Using a routing architecture to route information between an orignation module and a destination module in an information retrieval system
6008817,	Dec 31 1997	Comparative Visual Assessments, Inc.; COMPARATIVE VISUAL ASSESSMENTS, INC	Comparative visual assessment system and method
6041326,	Nov 14 1997	International Business Machines Corporation	Method and system in a computer network for an intelligent search engine
6055528,	Jul 25 1997	JUSTSYSTEMS EVANS RESEARCH INC	Method for cross-linguistic document retrieval
6065026,	Jan 09 1997	CANTOR FITZGERALD SECURITIES, AS SUCCESSOR AGENT; GLEACHER PRODUCTS CORP , AS ADMINISTRATIVE AGENT AND COLLATERAL AGENT	Multi-user electronic document authoring system with prompted updating of shared language
6081774,	Aug 22 1997	RPX Corporation	Natural language information retrieval system and method
6085162,	Oct 18 1996	Gedanken Corporation	Translation system and method in which words are translated by a specialized dictionary and then a general dictionary
6085186,	Sep 20 1996	AT HOME BONDHOLDERS LIQUIDATING TRUST	Method and system using information written in a wrapper description language to execute query on a network
6094647,	Jun 14 1989	GOOGLE LLC	Presearch type document search method and apparatus
6102969,	Sep 20 1996	AT HOME BONDHOLDERS LIQUIDATING TRUST	Method and system using information written in a wrapper description language to execute query on a network
6111572,	Sep 10 1998	International Business Machines Corporation	Runtime locale-sensitive switching of calendars in a distributed computer enterprise environment
6141005,	Sep 10 1998	INTERNATIONAL BUSINESS MACHINES CORPORATIONB	Combined display of locale-sensitive calendars in a distributed computer enterprise environment
6163785,	Sep 04 1992	Caterpillar Inc.	Integrated authoring and translation system
6169986,	Jun 15 1998	Amazon Technologies, Inc	System and method for refining search queries
6219646,	Oct 18 1996	Gedanken Corp.	Methods and apparatus for translating between languages
6226638,	Mar 18 1998	Fujitsu Limited	Information searching apparatus for displaying an expansion history and its method
6237011,	Oct 08 1997	Nuance Communications, Inc	Computer-based document management system
6240408,	Jun 08 1998	KCSL, Inc.	Method and system for retrieving relevant documents from a database
6240412,	Mar 06 1997	International Business Machines Corporation	Integration of link generation, cross-author user navigation, and reuse identification in authoring process
6259933,	Jul 20 1998	Lucent Technologies Inc	Integrated radio and directional antenna system
6262725,	Sep 10 1998	International Business Machines Corporation	Method for displaying holidays in a locale-sensitive manner across distributed computer enterprise locales
6275789,	Dec 18 1998		Method and apparatus for performing full bidirectional translation between a source language and a linked alternative language
6275810,	Sep 10 1998	International Business Corporation	Method for scheduling holidays in distributed computer enterprise locales
6278967,	Aug 31 1992	CANTENA SERVICE AGENT CORPORATION; CATENA SERVICE AGENT CORPORATION	Automated system for generating natural language translations that are domain-specific, grammar rule-based, and/or based on part-of-speech analysis
6327590,	May 05 1999	GOOGLE LLC	System and method for collaborative ranking of search results employing user and group profiles derived from document collection content analysis
6338055,	Dec 07 1998	INNOVATION TECHNOLOGY GROUP, INC	Real-time query optimization in a decision support system
6349307,	Dec 28 1998	Gula Consulting Limited Liability Company	Cooperative topical servers with automatic prefiltering and routing
6360196,	May 20 1998	Sharp Kabushiki Kaisha	Method of and apparatus for retrieving information and storage medium
6424973,	Jul 24 1998	Jarg Corporation	Search system and method based on multiple ontologies
6453159,	Feb 25 1999	Extreme Networks, Inc	Multi-level encryption system for wireless network
6463430,	Jul 10 2000	KOFAX, INC	Devices and methods for generating and managing a database
6516312,	Apr 04 2000	International Business Machine Corporation	System and method for dynamically associating keywords with domain-specific search engine queries
6523026,	Feb 08 1999	Huntsman International LLC	Method for retrieving semantically distant analogies
6526440,	Jan 30 2001	GOOGLE LLC	Ranking search results by reranking the results based on local inter-connectivity
6560634,	Aug 15 1997	Verisign, Inc; VERISIGN REGISTRY SERVICES, INC	Method of determining unavailability of an internet domain name
6571249,	Sep 27 2000	Siemens Aktiengesellschaft	Management of query result complexity in hierarchical query result data structure using balanced space cubes
6581072,	May 18 2000	HEWLETT-PACKARD DEVELOPMENT COMPANY, L P	Techniques for identifying and accessing information of interest to a user in a network environment without compromising the user's privacy
6602300,	Feb 03 1998	Fujitsu Limited	Apparatus and method for retrieving data from a document database
6604099,	Mar 20 2000	GOOGLE LLC	Majority schema in semi-structured data
6604101,	Jun 28 2000	QNATURALLY SYSTEMS INC	Method and system for translingual translation of query and search and retrieval of multilingual information on a computer network
6629097,	Apr 28 1999	Douglas K., Keith	Displaying implicit associations among items in loosely-structured data sets
6636848,	May 31 2000	International Business Machines Corporation	Information search using knowledge agents
6643661,	Apr 27 2000	Oracle International Corporation	Method and apparatus for implementing search and channel features in an enterprise-wide computer system
6654734,	Aug 30 2000	GOOGLE LLC	System and method for query processing and optimization for XML repositories
6711568,	Nov 25 1997	R2 SOLUTIONS LLC	Method for estimating coverage of web search engines
6718333,	Jul 15 1998	NEC Corporation	Structured document classification device, structured document search system, and computer-readable memory causing a computer to function as the same
6738764,	May 08 2001	VALTRUS INNOVATIONS LIMITED	Apparatus and method for adaptively ranking search results
6738767,	Mar 20 2000	Meta Platforms, Inc	System and method for discovering schematic structure in hypertext documents
6766316,	Jan 18 2001	Leidos, Inc	Method and system of ranking and clustering for document indexing and retrieval
6772150,	Dec 10 1999	A9 COM, INC	Search query refinement using related search phrases
6778979,	Aug 13 2001	III Holdings 6, LLC	System for automatically generating queries
6813496,	Jul 30 1999	Nokia Corporation	Network access control
6829599,	Oct 02 2002	Xerox Corporation	System and method for improving answer relevance in meta-search engines
6836777,	Nov 15 2001	WHOWHATWARE, LLC	System and method for constructing generic analytical database applications
6901399,	Jul 22 1997	Microsoft Technology Licensing, LLC	System for processing textual inputs using natural language processing techniques
6928432,	Apr 24 2000	BOARD OF TRUSTEES OF THE LELAND STANFORD JUNIOR UNIVERSITY, THE	System and method for indexing electronic text
6941294,	Aug 28 2000	BEN GROUP, INC	Method and apparatus for digital media management, retrieval, and collaboration
6952691,	Feb 01 2002	International Business Machines Corporation	Method and system for searching a multi-lingual database
7027974,	Oct 27 2000	Leidos, Inc	Ontology-based parser for natural language processing
7039625,	Nov 22 2002	International Business Machines Corporation	International information search and delivery system providing search results personalized to a particular natural language
7051023,	Apr 04 2003	R2 SOLUTIONS LLC	Systems and methods for generating concept units from search queries
7117199,	Feb 22 2000	Nokia Technologies Oy	Spatially coding and displaying information
7124364,	Nov 21 2001	Contecs:DD LLC	Data dictionary method
7127456,	Dec 05 2002	TERADATA US, INC	System and method for logging database queries
7136845,	Jul 12 2001	Microsoft Technology Licensing, LLC	System and method for query refinement to enable improved searching based on identifying and utilizing popular concepts related to users' queries
7174564,	Sep 03 1999	Intel Corporation	Secure wireless local area network
7197508,	Jul 25 2003		System and method for obtaining, evaluating, and reporting market information
7318057,	May 31 2000	International Business Machines Corporation	Information search using knowledge agents
20010021947,
20020002452,
20020007364,
20020007384,
20020016787,
20020042789,
20020059289,
20020091671,
20020095594,
20020095621,
20020107992,
20020156776,
20020156792,
20020184206,
20030126136,
20030142128,
20030144982,
20030149686,
20030149687,
20030177111,
20030221171,
20030225722,
20030225747,
20040019588,
20040024745,
20040024748,
20040030690,
20040044669,
20040068486,
20040111408,
20040181511,
20040181525,
20040205656,
20040214570,
20040220905,
20040249808,
20040254920,
20050055341,
20050065773,
20050065774,
20050154708,
20060036588,
20060191996,
20090036159,
EP851368,
EP964344,
EP1072984,
JP10187752,
JP11219368,
WO201400,

ASSIGNMENT RECORDS Assignment records on the USPTO

Executed on	Assignor	Assignee	Conveyance	Frame	Reel	Doc
Aug 21 2008		International Business Machines Corporation	(assignment on the face of the patent)

MAINTENANCE FEES AND DATES: Maintenance records on the USPTO

Date	Maintenance Fee Events
Jan 21 2015	M1551: Payment of Maintenance Fee, 4th Year, Large Entity.
Jan 15 2019	M1552: Payment of Maintenance Fee, 8th Year, Large Entity.
May 15 2023	REM: Maintenance Fee Reminder Mailed.
Oct 30 2023	EXP: Patent Expired for Failure to Pay Maintenance Fees.

Date	Maintenance Schedule
Sep 27 2014	4 years fee payment window open
Mar 27 2015	6 months grace period start (w surcharge)
Sep 27 2015	patent expiry (for year 4)
Sep 27 2017	2 years to revive unintentionally abandoned end. (for year 4)
Sep 27 2018	8 years fee payment window open
Mar 27 2019	6 months grace period start (w surcharge)
Sep 27 2019	patent expiry (for year 8)
Sep 27 2021	2 years to revive unintentionally abandoned end. (for year 8)
Sep 27 2022	12 years fee payment window open
Mar 27 2023	6 months grace period start (w surcharge)
Sep 27 2023	patent expiry (for year 12)
Sep 27 2025	2 years to revive unintentionally abandoned end. (for year 12)