WordNet In (Bouras and Tsogkas, 2012), the importance of WordNet hypernymy relationships is highlighted in enhancing K-means clustering algorithm. Similar to the procedure prior to clustering process, an aggregate hypernym graph is generated to label a resulting cluster. The effect of other relationships, on the clustering performance, is not studied. Another Word-Net-based clustering method is presented in (Fodeh et al., 2011), where the role of nouns, especially polysemous and synonymous nouns in document clustering is investigated. A subset of core semantic features is chosen from disambiguated nouns through an unsupervised information gain measure. These core semantic features lead to admissible clustering results. The effect of …show more content…
(Motazedi et al., 2009) and (Lesk, 1986) introduce a bilingual translation machine called PEnTrans. A novel WSD method is proposed based on Lesk algorithm (Sarrafzadeh et al., 2011). In order to English to Persian translation, gloss, synset and ancestors in the radius of two hypernyms are extracted from WordNet, for each word’s sense. Also the POS and WSD tags are included (extracted from extended WordNet). The authors developed a bilingual dictionary by translation WordNet senses into Persian. For Persian to English translation a combination of knowledge, rule and corpus based approaches are utilized and also grammatical roles of words are considered in the WSD. 5. SEMANTIC ANALYSIS USING FARSNET 5.1 FarsNet Lexical Ontology The ontology is an abstract model of real world that demonstrates the concepts and the relations among them in a specific domain. This conceptual knowledge base has vital applications in semantic web, search engines, natural language processing, information retrieval, etc. The ontologies can be produced manually or in a semi-automatic manner by the ontology engineering tools and knowledge acquisition methods (Darrudi et al., 2004). FarsNet is the first Persian WordNet (Shamsfard et al., 2010) which has been produced in NLP laboratory of Shahid Beheshti University, Iran. The first version of FarsNet includes 18000 Persian words organized in about 10000 synsets. The words are in three syntactic
I have created a table to help better represent the lexicon difference which is seen
Another problem in the process of translation of political concepts from English into Persian is the ambiguities lying within the word. Therefore, the real challenge for the translator is how to deal with them and finally how to transfer the same meaning without decreasing the influence and impact hidden behind the
The knowledge base consists of information regarding the user behavior and ADL that include self-care tasks, household duties, and personal management actions. It specifies the task to be carried out and the actions to be performed. The relational database presents a natural association between the two elements of the decision support system, and the use of the database to additionally represent a novel approach to knowledge engineering (KE) for planning.
If we search for an answer to a question in a typical search engines such as Google, Bing, or Yahoo, it usually gives us relevant pages based on the key words in the query. We may need to follow several links or pages to reach a document providing a relevant answer. If we can store such search pathways to an answer for a given user query and reuse it for future searches it may speed up this process. Our question answering system motivated by reuse of prior web search pathways to yield an answer a user query. We represent queries and search pathways in a semi-structured format that contains query terms and referenced classes within a realm based ontology. First part of my research is to build a system that can automatically tag the terms in a user query to relevant classes from a domain-based ontology. The other part is to rank the prior searches (contains user queries, assigned classes, and search pathways) stored in the database based on the
According to Sowa (2005), the concept of lexicon forms the connection between language and knowledge. In other words, the specific words used to express knowledge are collectively known as the lexicon of that language. In this way, different languages have different lexicons. However, the connection among all languages is that there is a central lexicon with the same function throughout the languages to create mutual understanding and knowledge sharing among speakers of that language.
Made out of Web locales interconnected by hyperlinks, the World Wide Web can be seen as an enormous yet tumultuous wellspring of data. For choice making numerous business applications need to rely on upon web keeping in mind the end goal to total data from various sites. Programmed information extraction assumes an essential part in preparing results gave via internet searchers in the wake of presenting the question by client. presently days "site" has begun keeping more significance to our life. without which it is hard to oblige even one day .so it has turned into the need that the site ought to be more enlightening and alluring . be that as it may, the sites are created and just grew purposely or unwittingly
A word having only one meaning is called monosemantic, for example, hydrogen, and molecule. Such words are few in number, while polysemy is the case when two related words happen to share the same written form. In spite of the clearness of the differences between these two concepts, there are many examples where it is not clear whether a word should be analyzed as polysemous or monosemous, and no absolute criteria have ever been proposed which will successfully differentiate between them. The analysis of a word as monosemous or polysemous may well need to be relativized to a specific level of lexical abstraction, for each such level there are only two logical possibilities: either the word’s meaning can be adequately represented by a single gloss, in which case it must be considered monosemous, or it cannot, in which case it is polysemous. The divergence between monosemy and polysemy is therefore not a false one, since monosemy and polysemy name the only two logical possibilities for the structure of a lexical category on a given level of lexical
Two approaches were used in the study: the first approach is closed vocabulary or word-category lexica which was used in many previous studies. The second approach is the open- vocabulary or differential language analysis (DLA). It is the technique generated by the scientists of this study.
The results of the study showed that none of the translators have done perfectly in conveying the concepts mentioned above. However, generally speaking, it can be said that the second translator, Mohammad Najafi, has done a better job compared to other translators in conveying the language of the story. It may be possible that the two other translators have done a better job compared to Mohammad Najafi in conveying the content of the story rather than its language. However, 15 examples of each linguistic features are not enough to make a conclusion concerning which translator has done a great job in conveying a special linguistic features. Finally, it suggested that further research should be done to investigate in other foreign languages considering the linguistic features mentioned by Costello to provide a general assessment of the translation of the study in a particular
The lexical study is used in linguistics to refer to the vocabulary of a language. It encompasses the words and their formation, the type of words used, and the deviation from the norms of standard variety. It would be studied what type of words – subject specific or lexical sets, context matching or deviated, simple or complex – have been used. Both Aravind Adiga and Mohsin Hamid use a number of linguistic deviations for various purposes; to maintain readers’ attention in con (text); to express cultural emotions; to capture sociolinguistic reality; to present the things on the pages; and to create something new, honest and attractive. But one thing is clear that these linguistic moves have made their descriptive and narrative lively, easy,
The rest of the chapter is organized as follows. The related works based on tree data model and digraph data model are reviewed in Section 2.2 and 2.3 respectively. Subsequently, the works done on the result ranking are reviewed in Section 2.4. Also, other related works in XML keyword search are reviewed in Section 2.5. In Section 2.6, the approaches utilizing statistics of
The present tendency for developing an ontology-based data management system (DMS) is to take advantage of on attempts made to design a preceding well-established DMS (a reference system). The method aggregates to bring out from the mention of DMS a section of schema applicable to the new application requirements – a module –perhaps personalizing it with additional-conditions w.r.t. the application under building, and then directing a dataset using the resulting schema. In this project, we expand the current denotations of modules and we inaugurate novel effects of robustness that furnish means for examine easily that a robust module-based DMS develops safely w.r.t. both the
The million dollar question is: what is the Semantic Web? The Semantic Web is not a completely new web that is going to replace the current web, it is simply and extension from the present web. The idea of Web 3.0 is for the information in the web to be understood and recognized by the computer. Today the web is a big global storage that stores files for humans to read, not computers, humans. This in what the Semantic Web is going to change by allowing computers to read and understand the information and data stored in the web.
Ontology contains a set of concepts and relationship between concepts, and can be applied into information retrieval to deal with user queries.
Arabic Text applications use stemming as a preprocessing stage. The problems raised with stemming are introduced in this paper. Moreover, different stemming applications is mentioned. Additionally, stemming techniques are presented with discussion about each of them.