A feature enhancement and data augmentation method for detecting at least one action of at least one tested subject is provided. The feature enhancement and data augmentation method includes obtaining at least one spectrogram; and performing a feature enhancement processing on the at least one spectrogram, to enhance at least one feature corresponding to at least one action in the at least one spectrogram and generate at least one feature enhanced spectrogram. The feature enhancement processing comprises a directional filtering.
|
1. A feature enhancement and data augmentation method, for an action detection device, configured for detecting the at least one action of at least one tested subject, the feature enhancement and data augmentation method comprising:
obtaining, by the action detection device, at least one spectrogram; and
performing, by the action detection device, a feature enhancement processing on the at least one spectrogram, to enhance at least one feature corresponding to the at least one action in the at least one spectrogram and generate at least one feature enhanced spectrogram;
wherein the feature enhancement processing comprises a directional filtering.
11. An action detection device, for detecting the at least one action of at least one tested subject, comprising:
a processor, for executing a program; and
a storage unit, coupled to the processor, for storing the program; wherein the program is utilized for instructing the processor to perform following steps of:
obtaining at least one spectrogram; and
performing a feature enhancement processing on the at least one spectrogram, to enhance at least one feature corresponding to the at least one action in the at least one spectrogram and generate at least one feature enhanced spectrogram;
wherein the feature enhancement processing comprises a directional filtering.
2. The feature enhancement and data augmentation method of
processing at least one radar reflected radio-frequency signal, to obtain the at least one spectrogram, wherein the at least one action comprises a falling action.
3. The feature enhancement and data augmentation method of
4. The feature enhancement and data augmentation method of
5. The feature enhancement and data augmentation method of
6. The feature enhancement and data augmentation method of
7. The feature enhancement and data augmentation method of
8. The feature enhancement and data augmentation method of
processing at least one radar reflected radio-frequency signal, to obtain the at least one spectrogram; and
performing data augmentation on the at least one feature enhanced spectrogram, to generate a plurality of augmented feature enhanced spectrograms to train a deep learning engine;
wherein a number of the plurality of augmented feature enhanced spectrograms is greater than a number of the at least one feature enhanced spectrogram.
9. The feature enhancement and data augmentation method of
performing a non-linear distortion on the at least one feature enhanced spectrogram.
10. The feature enhancement and data augmentation method of
performing at least one shielding on the at least one feature enhanced spectrogram, wherein the at least one shielding does not completely cover a main feature region of the at least one feature enhanced spectrogram.
12. The action detection device of
a radar transceiver circuit, coupled to the processor, for receiving at least one radar reflected radio-frequency signal, and processing the at least one radar reflected radio-frequency signal to obtain a raw signal, wherein the processor converts the raw signal, to obtain at least one spectrogram;
wherein the at least one action comprises a falling action.
13. The action detection device of
14. The action detection device of
15. The action detection device of
16. The action detection device of
17. The action detection device of
18. The action detection device of
a radar transceiver circuit, coupled to the processor, for receiving at least one radar reflected radio-frequency signal, and processing the at least one radar reflected radio-frequency signal to obtain a raw signal, wherein the program further instructs the processor to perform following steps of:
converting the raw signal, to obtain the at least one spectrogram; and
performing data augmentation on the at least one feature enhanced spectrogram, to generate a plurality of augmented feature enhanced spectrograms to train a deep learning engine;
wherein a number of the plurality of augmented feature enhanced spectrograms is greater than a number of the at least one feature enhanced spectrogram.
19. The action detection device of
performing a non-linear distortion on the at least one feature enhanced spectrogram.
20. The action detection device of
performing at least one shielding on the at least one feature enhanced spectrogram, wherein the at least one shielding does not completely cover a main feature region of the at least one feature enhanced spectrogram.
|
The present disclosure relates to a feature enhancement and data augmentation method and action detection device thereof, and more particularly, to a feature enhancement and data augmentation method and action detection device thereof capable of enhancing a feature of a falling action in a spectrogram to avoid false alarms, and increasing the amount of falling data via data augmentation algorithm to strengthen model generalization capability.
For current action detection technology, the action detection technology may determine whether the tested subject (such as the elderly, the patient or the child) has fallen, slipped or collided according to the rapid movement of the tested subject, thereby providing an alarm to inform the caregiver of the tested subject. Falling detection using continuous radar waves is non-imaging type, has good privacy, and may be more user-friendly in addition to long-term monitoring.
In this case, a spectrogram of a potential falling action has random vibration-like noise disturbances, and the main feature for distinguishing falling action and non-falling action is differences in spectrogram energy distribution. However, features of falling actions and some non-falling actions are similar in the spectrogram, and the model is still prone to generate false alarms even after training.
In addition, a part of the training set data is the actual data collected in the case field, and another part is falling data simulated in the laboratory. However, the falling data is difficult to collect, and the simulated self-falling action also results in a certain degree of risk, so the training set data is distributed quite unevenly (falling actions are much less than non-falling actions), so that the weight for the model to learn falling feature may be lower.
Therefore, it is necessary to improve the prior art.
It is therefore an objective of the present disclosure to provide a feature enhancement and data augmentation method and action detection device thereof capable of enhancing a feature of a falling action in a spectrogram to avoid false alarms, and increasing the amount of falling data via data augmentation algorithm to strengthen model generalization capability.
The present disclosure provides a feature enhancement and data augmentation method for detecting at least one action of at least one tested subject is provided. The feature enhancement and data augmentation method includes obtaining at least one spectrogram; and performing a feature enhancement processing on the at least one spectrogram, to enhance at least one feature corresponding to at least one action in the at least one spectrogram and generate at least one feature enhanced spectrogram. The feature enhancement processing comprises a directional filtering.
The present disclosure further provides an action detection device. The action detection device includes a processor, for executing a program; and a storage unit, coupled to the processor, for storing the program. The program is utilized for instructing the processor to perform following steps of obtaining at least one spectrogram; and performing a feature enhancement processing on the at least one spectrogram, to enhance at least one feature corresponding to at least one action in the at least one spectrogram and generate at least one feature enhanced spectrogram. The feature enhancement processing comprises a directional filtering.
These and other objectives of the present invention will no doubt become obvious to those of ordinary skill in the art after reading the following detailed description of the preferred embodiment that is illustrated in the various figures and drawings.
Please refer to
In the present embodiment, the action detection device 10 performs signal processing on the reflected RF signal SR to obtain the raw signal SRD. In the present embodiment, the transceiver circuit of the action detection device 10 is a radar transceiver circuit, and coupled to the processor. The transceiver circuit receives at least one radar reflected RF detection signal SD, and processes the at least one radar reflected RF signal SR to obtain the raw signal SRD. More specifically, the mixer of the radar RF circuit performs down-conversion on the reflected RF signal SR (the radar reflected RF signal SR) to obtain an analog raw signal SRA. Then, the analog to digital converter of the radar RF circuit samples the analog raw signal SRA to obtain the raw signal SRD. In the present embodiment, the raw signal SRD represents Doppler components of the reflected RF signal SR in response to any body action in the field DR. Thus, the action detection device 10 determines whether a tested subject (e.g., a detected object U) in the field DR occurs a specific action according to the raw signal SRD, and provides an alarm ALM corresponding to the specific operation accordingly.
In the present embodiment, the action detection device 10 is a continuous wave (CW) radar or frequency modulated continuous wave (FMCW) radar. The field DR is the detected field of the action detection device 10. The specific action may be falling action, getting up action, rehabilitation action, training action, or other actions. The action detection device 10 may be installed in a space with only one single detected target, such as a room of an elderly living alone, a single room in a nursing center, etc.
In another embodiment of the present disclosure, the action detection device 10 is a frequency modulated continuous wave radar. The field DR is the detected field of the action detection device 10. The action detection device 10 (the frequency modulated continuous wave radar) may be installed in a space with multiple detected targets, such as a gym, indoor stadium, multi-person room in a nursing home and the like. The specific action may be respective falling action, getting up action, rehabilitation action, training action, or other actions of the multiple detected targets.
In this case, please refer to
In this case, the spectrogram TFS of potential falling action has random vibration-like noise disturbances, and the main feature for distinguishing falling action and non-falling action is differences in spectrogram energy distribution. That is, because of the instantaneous action changes, falling action may be more likely produce energy at high frequency components, while non-falling action may cause considerable noise disturbance at low frequency components.
However, please refer to
Please refer
In this case, since a feature related to falling action in the at least one feature enhanced spectrogram FSTFS is enhanced, a deep learning engine trained by the at least one feature enhanced spectrogram FSTFS may effectively and correctly identify a feature enhanced spectrogram corresponding to a falling action, and then the action detection device 10 may generate the alarm ALM accordingly. As a result, the present disclosure may enhance the feature of the falling action in the spectrogram, and improve discernment of features of different actions, to avoid false positive results.
Specifically, after obtaining the raw signal SRD through, for example, a continuous wave radar, the action detection device 10 performs short time Fourier transform (STFT) on the raw signal SRD, thereby obtaining at least one spectrogram TFS. Next, since falling action has dramatic change in velocity, there will be significant energy at high frequency component. Thus, the feature enhancement processing 400 may include a normalization processing, for normalizing the at least one spectrogram TFS along a frequency axis and generating at least one normalized spectrogram NTFS, to effectively make the features of high-frequency energy prominent.
For example, please refer
Please continue to refer to
For example, please refer
Please continue to refer to
In detail, in the application of radar wave for falling detection, the energy of the falling action has an instantaneous energy response at high frequency component, and the response is most significant in a direction of 90° in the spectrogram TFS. Environmental noise will be more concentrated in the low frequency component, and has corresponding energy response along the horizontal axis in the spectrogram TFS, such that the response is significant in a direction of 0°. In this case, please refer
Specifically, please refer
For example, please refer to
It is worth noting that the above embodiment enhances the feature of the falling action in the spectrogram, and improve discernment of features of different actions, to avoid false positive results. Those skilled in the art may make modifications or alterations accordingly, which are not limited to this. For example, the above embodiment enhances the feature of the falling action in the spectrogram to improve discernment from other actions. In other embodiments, corresponding features of other actions in the spectrogram may be enhanced depending on the actual needs, to increase the discernment of other actions.
Furthermore, in the above embodiment, the deep learning engine trained according to the at least one feature enhanced spectrogram FSTFS may effectively and correctly identify a feature enhanced spectrogram corresponding to a falling action. However, as shown in
Specifically, please refer to
On the other hand, the data augmentation processing performs at least one shielding on the feature enhanced spectrogram FSTFS, wherein the at least one shielding does not completely cover a main feature region of the feature enhanced spectrogram FSTFS. For example, as shown in left of
In this case, speech recognition and radar wave for falling detection both utilize spectrograms for machine learning. Because the input of sound includes a variety of noises, it may be regarded as a few words are missing when people speak or words are not clearly pronounced, where people can still recognize the meaning of the sentence according to the context. Similarly, the radar wave also includes a variety of environmental noises (such as arm waving, object falling, etc.) which will cause disturbance to radar wave. In a situation that these events happen alone or together with falling action, if a falling action still needs to be identified, the model needs to have learned generalized feature of events by trained with data which is partially missing or changed.
For example, the original collected training set data only includes 1613 pieces of falling data and 11299 pieces of non-falling data. After data augmentation, the augmented training data set includes 4839 pieces of falling data and 11299 pieces of non-falling data. After a depth learning engine is trained by the augmented training data set to obtain a model and applied in an environment of a bedroom of an elderly living alone in practice, all 75 falling events have been identified, and there is no missing report and no false negative results. There are only 6 false positive (false alarms) in 839 non-falling events. Thus, precision is 100% and recall is 99.3%.
In addition, the action detection device 10 may include a processor, a transceiver circuit, and a storage unit. The processor is electrically connected to the transceiver circuit and the storage unit, respectively. The processor may be a microprocessor or an application-specific integrated circuit (ASIC). The transceiver circuit is a radar transceiver. The RF transceiver includes at least one transmitting antenna and/or at least one receiving antenna, an oscillator, a mixer, a digital to analog converter (DAC), an analog to digital converter (ADC), etc. The storage unit may be any data storage device for storing a code. The processor reads and executes the code, to execute each step of the feature enhancement and data augmentation process 40. The storage unit may be subscriber identity module (SIM), read-only memory (ROM), random-access memory (RAM), CD read-only memory (CD-ROM), magnetic tapes, floppy disk, optical data storage device, etc., but is not limited to these.
In another embodiment of the present disclosure, a transmission module of the action detection device 10 transmits the aforementioned raw signal SRD or the aforementioned spectrogram TFS to an edge device or a server. Then, the edge device or the server performs the aforementioned feature enhancement and data augmentation process 40. The edge device or the server may receive multiple aforementioned raw signals SRD or multiple aforementioned spectrograms TFS from multiple action detection devices 10.
On the other hand, the feature enhancement and data augmentation process 40 may be executed individually or collectively by one or more of the action detection devices 10, the edge device or the server.
In summary, the present disclosure may enhance the feature of the falling action in the spectrogram, and improve discernment of features of different actions, to avoid false positive results. Besides, the present disclosure may increase the amount of falling data via data augmentation algorithm to strengthen model generalization capability, and improve model processing capability for a variety of information to increase the effectiveness of the model.
Those skilled in the art will readily observe that numerous modifications and alterations of the device and method may be made while retaining the teachings of the invention. Accordingly, the above disclosure should be construed as limited only by the metes and bounds of the appended claims.
Chou, Hao-Gong, Chang, Hsuan-Tsung
Patent | Priority | Assignee | Title |
Patent | Priority | Assignee | Title |
10984646, | Jun 23 2016 | Mayo Foundation for Medical Education and Research | Proximity based fall and distress detection systems and methods |
11660023, | Dec 23 2019 | Fall detection systems and methods | |
5442462, | Jun 10 1992 | D V P TECHNOLOGIES LTD 18 HATZOUK STREET | Apparatus and method for smoothing images |
5771318, | Jun 27 1996 | Siemens Aktiengesellschaft | Adaptive edge-preserving smoothing filter |
5799111, | Jun 10 1992 | D.V.P. Technologies, Ltd. | Apparatus and methods for smoothing images |
7272431, | Aug 01 2002 | California Institute of Technology | Remote-sensing method and device |
7567200, | Apr 27 2006 | THE OSTERWEIL FAMILY TRUST | Method and apparatus for body position monitor and fall detect ion using radar |
7916066, | Apr 27 2006 | THE OSTERWEIL FAMILY TRUST | Method and apparatus for a body position monitor and fall detector using radar |
8454528, | Apr 03 2008 | ResMed Sensor Technologies Limited | Non-contact physiologic motion sensors and methods for use |
9103899, | Apr 29 2011 | The Invention Science Fund I, LLC | Adaptive control of a personal electronic device responsive to a micro-impulse radar |
20060020203, | |||
20100272340, | |||
20130051674, | |||
20130176158, | |||
20150379767, | |||
20160063681, | |||
20160377704, | |||
20170131395, | |||
20180061013, | |||
20190139389, | |||
20190216393, | |||
20190219687, | |||
20210158525, | |||
CN109394229, |
Executed on | Assignor | Assignee | Conveyance | Frame | Reel | Doc |
Feb 26 2021 | CHANG, HSUAN-TSUNG | Wistron Corporation | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 055952 | /0281 | |
Feb 26 2021 | CHOU, HAO-GONG | Wistron Corporation | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 055952 | /0281 | |
Apr 19 2021 | Wistron Corporation | (assignment on the face of the patent) | / |
Date | Maintenance Fee Events |
Apr 19 2021 | BIG: Entity status set to Undiscounted (note the period is included in the code). |
Date | Maintenance Schedule |
Apr 02 2027 | 4 years fee payment window open |
Oct 02 2027 | 6 months grace period start (w surcharge) |
Apr 02 2028 | patent expiry (for year 4) |
Apr 02 2030 | 2 years to revive unintentionally abandoned end. (for year 4) |
Apr 02 2031 | 8 years fee payment window open |
Oct 02 2031 | 6 months grace period start (w surcharge) |
Apr 02 2032 | patent expiry (for year 8) |
Apr 02 2034 | 2 years to revive unintentionally abandoned end. (for year 8) |
Apr 02 2035 | 12 years fee payment window open |
Oct 02 2035 | 6 months grace period start (w surcharge) |
Apr 02 2036 | patent expiry (for year 12) |
Apr 02 2038 | 2 years to revive unintentionally abandoned end. (for year 12) |