Total Pageviews

Saturday, January 4, 2020

हिंदी-उर्दू प्रापबैंक

हिंदी-उर्दू प्रापबैंक
लिंक-http://verbs.colorado.edu/propbank/framesets-hindi/A-v.html
"आना' क्रिया का उदाहरण-

Predicate: A

Roleset id: A.01 , to come

A.01: intransitive

Roles:

        Arg1entity in motion/ 'comer'
        Arg2-goldestination

Example:

        साल भर यहाँ श्रद्धालू आते रहते हैं

        Argm-tmp: साल भर (drel: k7t)
        Argm-loc: यहाँ (drel: k7p)
        Arg1: श्रद्धालू (drel: k1)
        Rel: आते रहते हैं

Example:

        from corpus
        पाकिस्तान अधिकृत कश्मीर में ८ अक्तूबर को RELPRO आए भूकंप से मची तबाही के बाद ...

        Argm-loc: पाकिस्तान अधिकृत कश्मीर में (drel: k7p)
        Argm-tmp: ८ अक्तूबर को (drel: k7t)
        Arg1: RELPRO-> भूकंप
        Rel: आए
        Arg1: भूकंप (drel: k1)

Roleset id: A.03 , arrive

A.03: intransitive

Roles:

        Arg0entity that arrives
        Arg2-goldestination
        Arg2-sousource (coming from)

Example:

        from corpus
        किशोरी हरिद्वार से दिल्ली आई थी

        Arg0: किशोरी (drel: k1)
        Arg2-sou: हरिद्वार से (drel: k5)
        Arg2-gol: दिल्ली (drel: k2p)
        Rel: आई थी


Predicate: A_LV

Roleset id: A.02 ,

A.02: light verb

Roles:

        Argmpredicating noun
पूरी सूची के लिए यहाँ जाएँ-
http://verbs.colorado.edu/propbank/framesets-hindi/
स्रोत वेबसाइट विवरण-
http://verbs.colorado.edu/propbank/framesets-hindi/A-v.html

The Hindi-Urdu PropBank Project


The development of computational resources will provide a major boost to Natural Language Processing applications for Hindi-Urdu. The Hindi-Urdu PropBank project is part of an effort to create a large scale resource that can be used for training machine learning algorithms and for carrying out corpus linguistics studies. This is a collaborative effort to build a 'Multi-representational and multi-layered Treebank for Hindi-Urdu'. This is a pioneering project in a language that has been resource-poor from the computational point of view.
The PropBank project will result in an annotated corpus of 400K words for Hindi and 200K words for Urdu. The Predicate-argument structure will be annotated for each verb in the corpus and a detailed lexicon will also be created for Hindi verb frames. The annotation of predicate-argument structure will be preceded by a thorough analysis of linguistic phenomena such as complex predicates. The guidelines that we will develop will merge a theoretical understanding of Hindi grammar with computational requirements.


Description of the project:

  • Task: Building a PropBanked Hindi-Urdu corpus. The data consists of Hindi newswire articles that have been treebanked using dependency relations. The Hindi-Urdu PropBank project is part of a larger effort to build a multi-layered and multi-representational resource for Hindi.
  • Project status: The project is currently working on creating PropBank annotation for 150K words from the Hindi Dependency Treebank.

Project Wiki:

The Hindi PropBank project shares a wiki page with the other research teams from University of Washington, Columbia University, University of Massachusetts at Amherst and IIT-Hyderabad, India.

Frameset Files:

The Hindi PropBank frameset files can be found here.

PropBank Team

Principal Investigators:
Martha Palmer, University of Colorado, Boulder
Bhuvana Narasimhan, University of Colorado, Boulder

Graduate Students:
Ashwini Vaidya, University of Colorado, Boulder

Consulting Postdoctoral Fellow:
Archna Bhatia, PhD, University of Illinois, Urbana-Champaign

Programming Support:
Jinho D. Choi, University of Colorado, Boulder
Jena Hwang, University of Colorado, Boulder

Annotators:
Nithin Singh Mohan
Sanmati Suresh
Omkar Rao
Smitha Kamat
Mrigendra Singh
Neeraj Sharma

Computational Language and Education ResearchUniversity of Colorado Boulder

No comments:

Post a Comment