国产精品麻豆欧美日韩ww_欧美日高清视频_亚洲精品成人久久久_久久精品国产清自在天天线

打印本文 打印本文  關閉窗口 關閉窗口  
LOB語料庫
作者:admin  文章來源:本站原創  點擊數  更新時間:2012-11-26  文章錄入:admin  責任編輯:admin



LOB語料庫

 

創建時間:1970年代初

創建單位:英國Lancaster大學和挪威Oslo大學以及Bergen大學

規模層級:100萬詞次

基本情況:研究當代英國英語,與美國英語對比,使用了TAGIT系統,以統計方式建立換算幾率矩陣,提高標注正確率。

The Lancaster-Oslo/Bergen Corpus (LOB) was compiled by researchers in Lancaster, Oslo and Bergen. It consists of one million words of British English texts from 1961. The texts for the corpus were sampled from 15 different text categories. Each text is just over 2,000 words long (longer texts have been cut at the first sentence boundary after 2,000 words) and the number of texts in each category varies (see table below). Further information about the texts can be found in the LOB manual (external link).

This corpus is the British counterpart of the Brown Corpus of American English, which contains texts printed in the same year so that comparison between both varieties could be made.

 

查詢地址:http://icame.uib.no/lob/lob-dir.htm 

 

 

打印本文 打印本文  關閉窗口 關閉窗口