ИСТИНА |
Войти в систему Регистрация |
|
ФНКЦ РР |
||
This paper is based on research carried out in the framework of our project on the General Internet Corpus of Russian (Geekrya) . The need to use large-scale corpora automatically collected from the Web was first recognized in computational linguistics. Recently, the lack of data in "manually-built" corpora led to recognition of the importance of Web-derived corpora in traditional linguistic research.