हिंदी-उर्दू प्रापबैंक
लिंक-http://verbs.colorado.edu/propbank/framesets-hindi/A-v.html
"आना' क्रिया का उदाहरण-
Arg2-gol: destination
Argm-tmp: साल भर (drel: k7t)
Argm-loc: यहाँ (drel: k7p)
Arg1: श्रद्धालू (drel: k1)
Rel: आते रहते हैं
पाकिस्तान अधिकृत कश्मीर में ८ अक्तूबर को RELPRO आए भूकंप से मची तबाही के बाद ...
Argm-loc: पाकिस्तान अधिकृत कश्मीर में (drel: k7p)
Argm-tmp: ८ अक्तूबर को (drel: k7t)
Arg1: RELPRO-> भूकंप
Rel: आए
Arg1: भूकंप (drel: k1)
Arg2-gol: destination
Arg2-sou: source (coming from)
किशोरी हरिद्वार से दिल्ली आई थी
Arg0: किशोरी (drel: k1)
Arg2-sou: हरिद्वार से (drel: k5)
Arg2-gol: दिल्ली (drel: k2p)
Rel: आई थी
पूरी सूची के लिए यहाँ जाएँ-
http://verbs.colorado.edu/propbank/framesets-hindi/
स्रोत वेबसाइट विवरण-
http://verbs.colorado.edu/propbank/framesets-hindi/A-v.html
The development of computational resources will provide a major boost to Natural Language Processing applications for Hindi-Urdu. The Hindi-Urdu PropBank project is part of an effort to create a large scale resource that can be used for training machine learning algorithms and for carrying out corpus linguistics studies. This is a collaborative effort to build a 'Multi-representational and multi-layered Treebank for Hindi-Urdu'. This is a pioneering project in a language that has been resource-poor from the computational point of view.
The PropBank project will result in an annotated corpus of 400K words for Hindi and 200K words for Urdu. The Predicate-argument structure will be annotated for each verb in the corpus and a detailed lexicon will also be created for Hindi verb frames. The annotation of predicate-argument structure will be preceded by a thorough analysis of linguistic phenomena such as complex predicates. The guidelines that we will develop will merge a theoretical understanding of Hindi grammar with computational requirements.
Martha Palmer, University of Colorado, Boulder
Bhuvana Narasimhan, University of Colorado, Boulder
Graduate Students:
Ashwini Vaidya, University of Colorado, Boulder
Consulting Postdoctoral Fellow:
Archna Bhatia, PhD, University of Illinois, Urbana-Champaign
Programming Support:
Jinho D. Choi, University of Colorado, Boulder
Jena Hwang, University of Colorado, Boulder
Annotators:
Nithin Singh Mohan
Sanmati Suresh
Omkar Rao
Smitha Kamat
Mrigendra Singh
Neeraj Sharma
Computational Language and Education Research, University of Colorado Boulder
लिंक-http://verbs.colorado.edu/propbank/framesets-hindi/A-v.html
"आना' क्रिया का उदाहरण-
Predicate: A
Roleset id: A.01 , to come
A.01: intransitiveRoles:
Arg1: entity in motion/ 'comer'Arg2-gol: destination
Example:
साल भर यहाँ श्रद्धालू आते रहते हैंArgm-tmp: साल भर (drel: k7t)
Argm-loc: यहाँ (drel: k7p)
Arg1: श्रद्धालू (drel: k1)
Rel: आते रहते हैं
Example:
from corpusपाकिस्तान अधिकृत कश्मीर में ८ अक्तूबर को RELPRO आए भूकंप से मची तबाही के बाद ...
Argm-loc: पाकिस्तान अधिकृत कश्मीर में (drel: k7p)
Argm-tmp: ८ अक्तूबर को (drel: k7t)
Arg1: RELPRO-> भूकंप
Rel: आए
Arg1: भूकंप (drel: k1)
Roleset id: A.03 , arrive
A.03: intransitiveRoles:
Arg0: entity that arrivesArg2-gol: destination
Arg2-sou: source (coming from)
Example:
from corpusकिशोरी हरिद्वार से दिल्ली आई थी
Arg0: किशोरी (drel: k1)
Arg2-sou: हरिद्वार से (drel: k5)
Arg2-gol: दिल्ली (drel: k2p)
Rel: आई थी
Predicate: A_LV
Roleset id: A.02 ,
A.02: light verbRoles:
Argm: predicating nounपूरी सूची के लिए यहाँ जाएँ-
http://verbs.colorado.edu/propbank/framesets-hindi/
स्रोत वेबसाइट विवरण-
http://verbs.colorado.edu/propbank/framesets-hindi/A-v.html
The Hindi-Urdu PropBank Project
The development of computational resources will provide a major boost to Natural Language Processing applications for Hindi-Urdu. The Hindi-Urdu PropBank project is part of an effort to create a large scale resource that can be used for training machine learning algorithms and for carrying out corpus linguistics studies. This is a collaborative effort to build a 'Multi-representational and multi-layered Treebank for Hindi-Urdu'. This is a pioneering project in a language that has been resource-poor from the computational point of view.
The PropBank project will result in an annotated corpus of 400K words for Hindi and 200K words for Urdu. The Predicate-argument structure will be annotated for each verb in the corpus and a detailed lexicon will also be created for Hindi verb frames. The annotation of predicate-argument structure will be preceded by a thorough analysis of linguistic phenomena such as complex predicates. The guidelines that we will develop will merge a theoretical understanding of Hindi grammar with computational requirements.
Description of the project:
- Task: Building a PropBanked Hindi-Urdu corpus. The data consists of Hindi newswire articles that have been treebanked using dependency relations. The Hindi-Urdu PropBank project is part of a larger effort to build a multi-layered and multi-representational resource for Hindi.
- Project status: The project is currently working on creating PropBank annotation for 150K words from the Hindi Dependency Treebank.
Project Wiki:
The Hindi PropBank project shares a wiki page with the other research teams from University of Washington, Columbia University, University of Massachusetts at Amherst and IIT-Hyderabad, India.Frameset Files:
The Hindi PropBank frameset files can be found here.PropBank Team
Principal Investigators:Martha Palmer, University of Colorado, Boulder
Bhuvana Narasimhan, University of Colorado, Boulder
Graduate Students:
Ashwini Vaidya, University of Colorado, Boulder
Consulting Postdoctoral Fellow:
Archna Bhatia, PhD, University of Illinois, Urbana-Champaign
Programming Support:
Jinho D. Choi, University of Colorado, Boulder
Jena Hwang, University of Colorado, Boulder
Annotators:
Nithin Singh Mohan
Sanmati Suresh
Omkar Rao
Smitha Kamat
Mrigendra Singh
Neeraj Sharma
Computational Language and Education Research, University of Colorado Boulder
No comments:
Post a Comment