Loading…
Research of Machine Learning Method for Specific Information Recognition on the Internet
With the available resources on the Internet becoming plentiful, a large amount of harmfulinformation is permeating in and has been influencing people's normal work and living seriously. Therefore, some harmful data stream must be recognized and filtered out effectively.After analyzing some har...
Saved in:
Main Author: | |
---|---|
Format: | Conference Proceeding |
Language: | English |
Subjects: | |
Online Access: | Request full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | With the available resources on the Internet becoming plentiful, a large amount of harmfulinformation is permeating in and has been influencing people's normal work and living seriously. Therefore, some harmful data stream must be recognized and filtered out effectively.After analyzing some harmful contents in Internet information stream, we present a new method, which recognizes specific information by Machine Learning (ML). We extracted key information from a number of corpuses through ML method to obtain the part of speech (POS) Transfer-Form for key information by learning from corpuses, which is based on the same pronunciation matching of key information. Further more, the testing value of key information will be obtained in real corpus to examine the likelihood between matching rules from information streams and those learnt from corpuses through the average value of POS transfer probability of key information. Therefore, the testing value for the whole real data stream will be obtained. The experiment proved that the method was efficient for recognizing certainInternet harmful information. |
---|---|
DOI: | 10.1109/ICMI.2002.1166998 |