preview

Wikipedia Content Analysis

Decent Essays

Wikipedia is a free online encyclopedia that has the freedom of a user interface to edit almost all of its contents. Currently, Wikipedia is considered to be one of the most popular website along with a credit of being the most popular general reference work website (Ref.3&5 of web). It was launched on January 15, 2001 by Jimmi Wales and Larry Sanger (Wiki ref). Though it was only composed of articles written in English in its initial days, now it has included almost 292 languages which happens to have similar versions which differs in article contents and editing practices.For example Wikipedia has currently more than 5260000 English, 111000 Hindi, 1801000 French, 1306000 Italian, and a lot many (approximately 40 million in 250 languages) …show more content…

The webpage lists a panel of featured articles that include list, pictures, portals and topics. It also poses a column on today’s featured article (TFA) with in formations on this month’s queue articles recent current and potential TFA request, oddities most viewed once, and articles yet to appear. The featured articles are used as example articles for writing other contents. It is selected by a panel of Wikipedia editors. These editors are volunteers with the rights to check basic editing to complexities such as vandalism removal, resolving disputes and correcting contents. Before a final judgment is confirmed to a featured article a list of candidates are lined up for ensuring accuracy, neutrality, completeness and style according to a prescribed featured article criteria. As of October 2016 there are 4854 featured articles out of 5,267,869 English Wikipedia articles. Approximately 0.1 % or 1 in every 1080 articles has a featured tag. Off note, a star sign on the right corner of an article page is representative of a featured article. Additionally if that current article is featured in another language a corresponding star will appear in the language list …show more content…

This method significantly out performs more complex methods for the article quality assessment. In brief, the word count discrimination rule says article with more or less than 2000 words are classified as featured or non-featured respectively. This method yielded an accuracy of 0.96 for an unbalanced corpus. However, the value of the accuracy limit was varied for different subject articles. They were found to be less for biological sciences and more for history. A study by Stvilia measures information quality dynamics at both macro and micro levels (ref). They have postulated seven IQ matrices that can easily be tested on a representative Wikipedia content. They further added statistical characterization, content construction, process metadata and social context of Wikipedia articles. The parameters include authorocity/reputation, completeness, complexities, informativeness, consistencies, currency and

Get Access