Total Pageviews

Tuesday, March 22, 2022

LTRC, IIIT-Hyderabad के कार्पोरा

  LTRC, IIIT-Hyderabad के कार्पोरा 

LTRC, IIIT-Hyderabad द्वारा विकसित कार्पस संबधी विवरण को इस लिंक

https://ltrc.iiit.ac.in/corpus/corpus.html#1 

विवरण कुछ इस प्रकार प्राप्त होता है-

Welcome to the Corpus Search Engine

  1. Running the Corpus Search Engine
  2. Description of the Corpus

Running the Corpus Search Engine (Running the Corpus Search Engine)

This web page is the first step in using the corpus. You can look at the statistics, word frequencies and perform word search for any of the corpora. (Running the Corpus Search Engine)click
For more details see below

II Description of the Corpus

II.1 What is a Corpus

Corpus is a collection of large number of texts in a language. The texts in the corpus of a language are usually chosen from a diverse set of fields so that they are representative of the language. We have corpora of the following 10 languages on this site,
  1. Assamese
  2. Bengali
  3. Hindi
  4. Kannada
  5. Malayalam
  6. Marathi
  7. Oriya
  8. Punjabi
  9. Tamil
  10. Telugu
Size of each corpus is about 3 million words. Texts in each corpus are categorized broadly under aesthetics, mass media, social science, natural science, commerce and translated materials which are further divided into sub-categories. The texts themselves typically consist of a few pages randomly chosen from publications during 1980-1990 in each of these categories. The above corpora were prepared by several oraganizations under funding from MoIT (Ministry of Information Technology formerly Department of Electronics), Government of India.

II.2 Uses of Corpus (TOP)


The major use of corpus is in language analysis and research which can be useful in many applications.
For Example :
  • From the hindi corpus we can find the list of frequently used words in hindi. This can be useful in preparing children's books graded according to vocabulary.
  • In the preparation of a dictionary or a translation system different meanings of a word can be seen and studied in their contexts.
संपूर्ण विवरण के लिए इस लिंक पर जाएँ- 
https://ltrc.iiit.ac.in/corpus/corpus.html#1 

No comments:

Post a Comment