preview

Is Text Mining Different Than Data Mining?

Better Essays

2. (10 pts) How is text mining different than data mining?
Text mining is a process which collects information and knowledge from large amounts of unstructured data sources. When I say unstructured data sources, I am talking about Pdf files, Word documents, XML files, text excerpts etc… Text mining collects information from text. Text mining is different than data mining because data mining is a process which collects information and knowledge from large amounts of structured data sources. Structured data sources means that data are classify by categorical, ordinal, or continuous variables, and the goal of data mining is to transform data into model or understandable structure after collecting information from data. However they are …show more content…

But by doing that, NLP met some challenges in achieving true NLP capabilities.
First of all, it is difficult to mark up terms in a text as corresponding to a particular part of speech, means that some part of speech such as nouns, verbs, adjectives, adverbs, etc. depends not only on the definitions of the terms but depend also on the context in which it is used.
Secondly, some words have different meaning and choosing the good meaning which will match with the sentence or context is a real challenge.
Thirdly some written language used words boundaries which is difficult for the text-parsing task to identify them. As example we have Japanese language, Chinese language etc. However it is a challenge also for analyzing spoken language.
Fourthly, the grammar also presents some ambiguity and it is difficult to choose the good structure.
Fifthly grammatical error, accent and vocal impediments in speech present a difficult task for the language processing. And finally the speech acts can be a challenge if the sentence does not contain enough information.
4. (10 pts) a. How is web mining different than text mining?
The definition of web mining from the book is the process to found useful information from web data, which are expressed in the form of textual, linkage, or usage information. These data that web mining collects can be beneficial for enterprise because information or data that web mining

Get Access