The present invention provides a method and system for efficient information storage and retrieval of information. The method includes the steps of: scanning/selecting/capturing a selected portion of text of the information wherein the selected portion of text scanned is typically a close-to-unique identifier of the text from which the portion was excerpted and serves as a key when the information is accessed electronically; and placing the key in an electronically available index/directory to facilitate retrieval of the information. The method may further include retrieving and storing the information associated with the key and using it to index, organize, and make available for search and retrieval the full information originally viewed by the user.
|
1. A method for efficient information storage and retrieval of information, comprising the steps of:
capturing with a scanner a selected portion of text of printed information wherein the selected portion of text captured is user defined topic classification indicia keys with a close-to-unique identifier of the text for recommending user defined topic matters, and serves as a key when the information is accessed electronically;
placing the key in an electronically available index to facilitate retrieval of the information; and
recommending topic matters based on the user defined topic classification indicia keys of previously scanned text of printed information and previously stored electronic information that are related to the printed information being captured during retrieval of the printed information.
18. A computer-readable medium having computer-executable instructions for performing steps for efficient information storage and retrieval of information, comprising the steps of:
capturing with a scanner a selected portion of text of printed information wherein the selected portion of text scanned is a user defined topic classification indicia key with a close-to-unique identifier of the text and serves as a topic classification key for recommending user defined topic matters and where the information is stored and accessed electronically;
placing the key in an electronically available index to facilitate retrieval of the information; and
recommending topic matters based on the user defined topic classification indicia keys of previously scanned text of printed information and previously stored electronic information that are related to the information being captured during retrieval of the printed information.
27. A computer-readable medium having computer-executable instructions for performing steps for efficient information storage and retrieval of information, comprising the steps of:
capturing with a scanner a selected portion of text of incoming printed information wherein the selected portion of text scanned is user defined topic classification indicia keys with a close-to-unique identifier of the text and serves as a topic classification key for recommending user defined topic matters when the incoming printed information is stored and accessed electronically;
placing the key in an electronically available index on a web page accessible area to facilitate search and retrieval of desired incoming information; and
recommending topic matters based on the user defined topic classification indicia keys of previously scanned text of printed information and previously stored electronic information that are related to the information being captured during retrieval of the printed information.
9. A system for efficient information storage and retrieval of information, comprising:
an index information retrieval unit, arranged to send a selected portion of text of printed information to an index storage unit, for capturing with a scanner the selected portion of text of the information wherein the selected portion of text is a user defined topic classification indicia key with a close-to-unique identifier of the text and serves as a topic classification key for recommending user defined topic matters when the information is desired to be accessed electronically;
the index storage unit, arranged to receive and store the user defined topic classification indicia key from the index information retrieval unit, for placing the key in an electronically available index to facilitate retrieval of the information from a storage medium that is accessible electronically; and
the storage medium that is accessible electronically, wherein the storage medium is arranged to be searched using the key held in the index to identify information in the storage medium that corresponds to the key,
wherein a computer unit is arranged to send and receive information from and to the index storage unit and the storage medium, and is arranged to request retrieval of information based on the key and configured for recommending topic matters based on keys of previously scanned text of printed information and previously stored electronic information that are related to the printed information being captured during retrieval of the information.
2. The method of
3. The method of
4. The method of
5. The method of
6. The method of
7. The method of
8. The method of
10. The system of
11. The system of
12. The system of
13. The system of
15. The system of
16. The system of
17. The system of
19. The computer-readable medium of
20. The computer-readable medium of
21. The computer-readable medium of
22. The computer-readable medium of
23. The computer-readable medium of
24. The computer-readable medium of
25. The computer-readable medium of
26. The computer-readable medium of
28. The computer-readable medium of
29. The computer-readable medium of
30. The computer-readable medium of
31. The computer-readable medium of
32. The computer-readable medium of
33. The computer-readable medium of
34. The computer-readable medium of
|
The present invention relates generally to information organization and retrieval, and more particularly to facilitating organization and retrieval of electronic information.
The amount of information generated and available in today's environment tends to be overwhelming. The most important information to an individual is the information that he/she has examined and determined to be useful to him/her. Today, most important information is available or passes through electronic form, even though it may ultimately be distributed primarily in printed form. When information is received electronically, such information takes little physical space and may be indexed manually or using an automatic index and classification system so that it may be stored and retrieved at a later time. However, when the information is printed, it is often inconvenient or difficult to connect the printed information with its electronic counterpart. This may be true even if the individual printed the information from its electronic form, such with a personal printer.
Often, the individual receiving the information may wish to read it and keep it for future reference, but does not have the opportunity to read the information when it is received. For example, people may quickly review a magazine and bend down page corners of interesting articles or rip out the articles, often reading only headlines or a one to two paragraphs of the article. Items of interest, such as travel, ornithology, enology, viticulture and gardening may be placed on a countertop in a “holding” pattern, stacked in piles, put in boxes and folders, and moved several times before either being filed in a huge, often unsorted, pile or folder to be read at some future date, or filed in the trash.
Many people have no time for categorizing and organizing this saved information, and it ends up in a growing number of piles, often with years worth of unsorted information that is difficult to access in an efficient manner. Hence, the original purpose of saving the article for later reference or reading is defeated. For example, consider the subject of travel. An individual may sort all travel articles into a folder; if not, the travel articles are all mixed up in multiple places. Among the mass of saved articles, there may be an article on “The Best Places to Stay in Hawaii”, another article on “Maui's Best Molokini Excursions”, and many other articles on Europe, Tahiti, and the Virgin Islands.
If the family desires to go to Maui, finding the needed information in the saved articles may present a large task. First, the general location of the travel articles in piles or folders must be determined. Then, if the articles are categorized, the individual must sort through the articles. If the articles were classified rather definitively, under what title would the articles on the places to stay in Hawaii be stored? In a Hawaii folder? Are there sub-folders for each island? If the articles are not categorized, it will be even more difficult since the individual will have to sort the entire pile or box/boxes of articles. Clearly, organizing and retrieving the desired information that may be useful in the future is not a trivial task and is time-consuming.
Thus, there is a need for a method and system to facilitate organizing and retrieving both printed and electronic information efficiently.
The present invention provides a method for efficient information storage and retrieval of information. The method includes the step of scanning/selecting/capturing a selected portion of text of the information wherein the selected portion of text captured is a fragment of information content that may be used as an approximate identifier or “key” for the entire information and where the information may be stored and accessed electronically. Then, the selected portion of the text or “key” is placed in an electronically available index/directory to facilitate retrieval of the information from a storage medium having the information stored electronically.
The present invention may also be utilized as a system for efficient information storage and retrieval of information. In this embodiment, the invention includes an index/directory information retrieval unit, an index/directory storage unit, and a storage medium. The index/directory information retrieval unit is arranged to send a portion of text of the information to an index/directory storage unit. The index/directory information retrieval unit is used for scanning/selecting/capturing the portion of text of the information wherein the portion of text is a fragment of information content that may be used as an approximate identifier or “key” for the entire information. The information is stored and accessed electronically. The index/directory storage unit is arranged to receive and store the key from the index/directory information retrieval unit. The index/directory storage unit is used for storage of the key in an electronically available index/directory to facilitate retrieval of the information from a storage medium that is accessible electronically. The index/directory is arranged to be searched to identify information in the storage medium that corresponds to the key. Where the information is already in electronic form, the information is stored in a storage medium and made available for search and retrieval based on text fragments. Where the information is printed, the information is scanned and processed to facilitate search and retrieval based on text fragments and also ideally retain the form and presentation of the information to provide an electronic copy of the information which is then stored in a storage medium and made available for search and retrieval based on text fragments. For example, the information may be stored in a personal or organization-wide database, or may be from a commercially accessible database. Typically, a computer/user unit is coupled to or arranged to receive/send information from/to the index/directory unit and the storage medium and is used to request retrieval of information based on the key.
In one embodiment, the invention may be implemented by a computer-readable medium that has computer-executable instructions for performing steps for efficient information storage and retrieval of information. The computer-executable instructions perform the steps described above.
The present invention provides a method and device for organizing and retrieving both printed and electronic information in which the information is identified by the person viewing the information by selection of a portion of the text, using the selection as a close-to-unique identifying “key”, and where the information is not in electronic form, obtaining an electronic copy of the information and storing the information electronically, thus allowing one to locate the information quickly using the key. The invention applies to any given package of information from small to large that includes written or spoken language that can be processed as text sequences, and may also include other forms of information such as charts, pictures, audio, and video. For example, common information packages include an article and its associated charts or graphics, a book, a document, a webpage, a letter, a resume, a form, a press release, and so forth. While the term “article” is used for clarity of writing, those skilled in the art will understand it to be representative of many kinds of information packages.
For example, as shown in
A scanner, such as are available commercially presently, may be used to implement the capturing of text. The scanner captures the text 102, either as an image that facilitates later Optical Character Recognition (OCR) on a different system or doing OCR directly to produce the text. The information captured by the scanner is sent to one or more systems that process the scan. The scanner may be coupled to the system in any desired known manner such as, for example, using infrared (IR) transmission, a synchronizing system, a Universal Serial Bus (USB) connection or wireless or the like. While any scanner may be used, the use of small hand scanners that do OCR onboard the scanner, such as the C-Pen 600, which is available from C Technologies AB, Ideon Research Park, Scheelevagen 15, SE-223 70 LUND, Sweden, is quite convenient. The C-Pen 600 is as small as a highlighter pen and is easy to use on any size and shape paper, making scanning especially practical and easy.
Given the richness of human language, the small amount of text typically is a close-to-unique identifier of the complete article from which it was excerpted and can be used as a key. The key is stored in an index or directory 104. In one embodiment of the invention, when a new key is added to the index/directory and found to be in use already, the system sends a message to the user that the key is already in use. Generally, this will not be a problem, as the number of matching articles will be small, but it may train the user to scan or elect longer text fragments in the future. The key may be stored in an index/directory 104, and the index/directory may be used to locate the article or set of articles that match the key. Where desired, the index/directory containing the keys may be managed by a personal database, organization-wide database, or service operated by other organizations.
Where the article is in electronic form, the article may be stored in an electronic database. In one embodiment of the present invention, the article in toto is stored electronically in a database 106 to improve responsiveness and reduce dependence for article access on remotely operated systems. Clearly, this may be accomplished by scanning the article, or may be accomplished by utilizing information electronically available from commercial companies such as LexisNexis®, NorthernLight®, and the like, which have undertaken to collect, index, classify, and make available electronic versions of thousands of print collections, and information printed from electronic content located on the Internet.
Both the keys and the articles may be used to provide useful classifications and indices and to allow search, discovery and retrieval of information. Systems may be used to provide personalized keyword indices, subject and other classifications and to allow for searching through a set or subset of the information identified as being of interest. Optionally, systems may recommend relevant and/or related to the topic matters of the information being captured or retrieved such as URLs, maps, books, and other related articles. Systems may also be used to provide desired information such as personal notes for the user on a particular topic or pictures.
In practice, as information is read, users no longer need to rip out articles or fold corners of articles of interest and stack them in piles or organize them in folders. Instead, users may scan a few lines of text from the articles, for example, using a small hand scanner, where the scanned lines are used as keys and occasionally upload the scanned keys to a system that may locate and process the electronic version of the articles referenced by the keys. Thus, at a future time, when the person is interested in retrieving the articles, key words may be entered, for example on a personal computer 108, and the search conducted across the set of articles 110 that have been identified by the user as being of interest. In the example shown in
The present invention provides for, upon a user's reading an article in print or electronically, wherein the article is of interest to the user, the user's scanning/selecting/capturing a small selected amount of text. As used hereinafter, and for purposes of claim construction, the term “scanning/selecting/capturing” refers to substantially equivalent processes by which a user who wishes to retrieve at a later date, an article or document of interest, first identifies or selects some small portion or aspect of the article or document to be used a key, or index, to retrieve the entire document, which is electronically stored elsewhere. In the case of documents or printed articles, the selected key can be “scanned” and turned into computer-storable data using an optical scanner. In the case of a document in an electronic form, the portion of the electronic document selected to be a key can be simply “selected” much as text is selected using a word processor by holding down certain keys and dragging a mouse pointer icon over portions of text on screen. In the case of graphics e.g. a portion of a picture might be “captured” or a spoken description thereof might be “captured.” For purposes of claim construction, any process or data that provides for the digitization of information from or describing a document, and that is used as an index to the stored versions of such documents, is considered to be equivalent to scanning or selecting or otherwise capturing (“scanning/selecting/capturing”) a portion of the document as a key. The scanned text is used as a close-to-unique identifier or key for the article. When the key is uploaded to a processing system that has access to a collection of electronic versions of printed information as well as electronic information, the key may be used to locate the electronic version of the printed/electronic article. Once the electronic version of the article is available, indices and classifications of the article may be created, manually or automatically, that allow for rapid search and retrieval using words, concepts, classifications, and the like that apply to the whole article and not just the key. Where desired, such searches may be limited only to articles that the user has seen and identified to be of interest, and hence has a known level of interest to the person. However, the present invention may also include, upon searching, providing a list of associations with other articles that may be relevant to the user, but which the user has not yet reviewed yet may be likely to desire to access.
As shown in
As shown in
The keys and/or information to be stored electronically may be transmitted using an infrared connection, a wired connection, and/or a telephone connection. The keys and/or information may be sent to a personal computer or other device having a storage medium. A camera using a wireless, wired, or infrared connection may be arranged to provide information to the system.
Although the present invention has been described in relation to particular preferred embodiments thereof, many variations, equivalents, modifications and other uses will become apparent to those skilled in the art. It is preferred, therefore, that the present invention be limited not by the specific disclosure herein, but only by the appended claims.
Patent | Priority | Assignee | Title |
7801905, | Nov 25 2003 | APPLIED E, INC | Knowledge archival and recollection systems and methods |
8179563, | Aug 23 2004 | Kyocera Corporation | Portable scanning device |
8301611, | Jan 09 2008 | GEMINI LEGAL SUPPORT, INC | Records management system and method |
8418055, | Feb 18 2009 | Kyocera Corporation | Identifying a document by performing spectral analysis on the contents of the document |
8447066, | Mar 12 2009 | Kyocera Corporation | Performing actions based on capturing information from rendered documents, such as documents under copyright |
8458155, | Jan 09 2008 | GEMINI LEGAL SUPPORT, INC | Records management system and method with excerpts |
8505090, | Apr 01 2004 | Kyocera Corporation | Archive of text captures from rendered documents |
8600196, | Sep 08 2006 | Kyocera Corporation | Optical scanners, such as hand-held optical scanners |
8620083, | Dec 03 2004 | Kyocera Corporation | Method and system for character recognition |
8638363, | Feb 18 2009 | Kyocera Corporation | Automatically capturing information, such as capturing information using a document-aware device |
8781228, | Apr 01 2004 | Kyocera Corporation | Triggering actions in response to optically or acoustically capturing keywords from a rendered document |
8799099, | May 17 2004 | Kyocera Corporation | Processing techniques for text capture from a rendered document |
8831365, | Apr 01 2004 | Kyocera Corporation | Capturing text from rendered documents using supplement information |
8874504, | Dec 03 2004 | Kyocera Corporation | Processing techniques for visual capture data from a rendered document |
8892495, | Feb 01 1999 | Blanding Hovenweep, LLC; HOFFBERG FAMILY TRUST 1 | Adaptive pattern recognition based controller apparatus and method and human-interface therefore |
8953886, | Aug 23 2004 | Kyocera Corporation | Method and system for character recognition |
8990235, | Mar 12 2009 | Kyocera Corporation | Automatically providing content associated with captured information, such as information captured in real-time |
9075779, | Mar 12 2009 | Kyocera Corporation | Performing actions based on capturing information from rendered documents, such as documents under copyright |
9081799, | Dec 04 2009 | GOOGLE LLC | Using gestalt information to identify locations in printed information |
9116890, | Apr 01 2004 | Kyocera Corporation | Triggering actions in response to optically or acoustically capturing keywords from a rendered document |
9143638, | Apr 01 2004 | Kyocera Corporation | Data capture from rendered documents using handheld device |
9268852, | Apr 01 2004 | Kyocera Corporation | Search engines and systems with handheld document data capture devices |
9275051, | Jul 19 2004 | Kyocera Corporation | Automatic modification of web pages |
9305030, | Jan 09 2008 | GEMINI LEGAL SUPPORT, INC | Records management system and methods |
9323784, | Dec 09 2009 | Kyocera Corporation | Image search using text-based elements within the contents of images |
9514134, | Apr 01 2004 | Kyocera Corporation | Triggering actions in response to optically or acoustically capturing keywords from a rendered document |
9535563, | Feb 01 1999 | Blanding Hovenweep, LLC; HOFFBERG FAMILY TRUST 1 | Internet appliance system and method |
9633013, | Apr 01 2004 | Kyocera Corporation | Triggering actions in response to optically or acoustically capturing keywords from a rendered document |
RE46881, | Apr 05 2004 | appliedE, Inc. | Knowledge archival and recollection systems and methods |
Patent | Priority | Assignee | Title |
4965763, | Mar 03 1987 | International Business Machines Corporation | Computer method for automatic extraction of commonly specified information from business correspondence |
5265242, | Aug 23 1985 | Document retrieval system for displaying document image data with inputted bibliographic items and character string selected from multiple character candidates | |
5819259, | Dec 17 1992 | HARTFORD FIRE INSURANCE COMPANY | Searching media and text information and categorizing the same employing expert system apparatus and methods |
5991755, | Nov 29 1995 | Panasonic Intellectual Property Corporation of America | Document retrieval system for retrieving a necessary document |
6038561, | Oct 15 1996 | IP COM I, LLC | Management and analysis of document information text |
6094649, | Dec 22 1997 | HANGER SOLUTIONS, LLC | Keyword searches of structured databases |
6178417, | Jun 29 1998 | Xerox Corporation | Method and means of matching documents based on text genre |
6182090, | Apr 28 1995 | Ricoh Company, Ltd. | Method and apparatus for pointing to documents electronically using features extracted from a scanned icon representing a destination |
6208988, | Jun 01 1998 | BHW INFO EDCO COM, LLC | Method for identifying themes associated with a search query using metadata and for organizing documents responsive to the search query in accordance with the themes |
6263121, | Sep 16 1998 | Canon Kabushiki Kaisha | Archival and retrieval of similar documents |
6327589, | Jun 24 1998 | Microsoft Technology Licensing, LLC | Method for searching a file having a format unsupported by a search engine |
6446061, | Jul 31 1998 | International Business Machines Corporation | Taxonomy generation for document collections |
6505196, | Feb 23 1999 | Clinical Focus, Inc. | Method and apparatus for improving access to literature |
6522782, | Dec 15 2000 | Meta Platforms, Inc | Image and text searching techniques |
6625624, | Feb 03 1999 | AT&T Corp | Information access system and method for archiving web pages |
6678694, | Nov 08 2000 | COGIA GMBH | Indexed, extensible, interactive document retrieval system |
Executed on | Assignor | Assignee | Conveyance | Frame | Reel | Doc |
Nov 17 2000 | VAN ZEE, PIETER J | Hewlett-Packard Company | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 011689 | /0393 | |
Dec 15 2000 | Hewlett-Packard Development Company, L.P. | (assignment on the face of the patent) | / | |||
Sep 26 2003 | Hewlett-Packard Company | HEWLETT-PACKARD DEVELOPMENT COMPANY L P | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 014061 | /0492 |
Date | Maintenance Fee Events |
Dec 08 2008 | M1551: Payment of Maintenance Fee, 4th Year, Large Entity. |
Oct 02 2012 | M1552: Payment of Maintenance Fee, 8th Year, Large Entity. |
Sep 26 2016 | M1553: Payment of Maintenance Fee, 12th Year, Large Entity. |
Date | Maintenance Schedule |
Jun 07 2008 | 4 years fee payment window open |
Dec 07 2008 | 6 months grace period start (w surcharge) |
Jun 07 2009 | patent expiry (for year 4) |
Jun 07 2011 | 2 years to revive unintentionally abandoned end. (for year 4) |
Jun 07 2012 | 8 years fee payment window open |
Dec 07 2012 | 6 months grace period start (w surcharge) |
Jun 07 2013 | patent expiry (for year 8) |
Jun 07 2015 | 2 years to revive unintentionally abandoned end. (for year 8) |
Jun 07 2016 | 12 years fee payment window open |
Dec 07 2016 | 6 months grace period start (w surcharge) |
Jun 07 2017 | patent expiry (for year 12) |
Jun 07 2019 | 2 years to revive unintentionally abandoned end. (for year 12) |