开放平行文本
Thread poster: henryhu
henryhu
henryhu
Local time: 22:06
English to Chinese
Jan 20, 2014

最近在寻找双语平行文本,似乎资源不是很多。是不是平行文本多是个人自己积累的?如果创建一个平行语料库,译员共享那些可以公开的双语文本,是一件很有意义的事情。

 
James_xia
James_xia  Identity Verified
China
English to Chinese
+ ...
受限 Jan 20, 2014

henryhu wrote:

最近在寻找双语平行文本,似乎资源不是很多。是不是平行文本多是个人自己积累的?如果创建一个平行语料库,译员共享那些可以公开的双语文本,是一件很有意义的事情。


想法挺好,但文本的保密性需要是所有外包商都比较在乎的事~:)

[Edited at 2014-01-20 14:23 GMT]


 
Shirley Lao
Shirley Lao  Identity Verified
Taiwan
Local time: 22:06
Member (2007)
English to Chinese
+ ...
Try some public bilingual and multilingual thesauri, glossaries and terminology databases Jan 20, 2014

Bilingual parallel-aligned texts are not commonly found between Chinese and English, except for a few bilingual databases like Lawbank at http://db.lawbank.com.tw/FLAW/
However, you have to align the Chinese and the English texts manually.

There are some bilingual or multilingual manuals or guidelines (like the ICD) available in some international organizations but you also have to alig
... See more
Bilingual parallel-aligned texts are not commonly found between Chinese and English, except for a few bilingual databases like Lawbank at http://db.lawbank.com.tw/FLAW/
However, you have to align the Chinese and the English texts manually.

There are some bilingual or multilingual manuals or guidelines (like the ICD) available in some international organizations but you also have to align the Chinese and the English texts manually.

It is easier to find public bilingual and multilingual thesauri, glossaries and terminology databases from some international organizations like the WHO, the UN, the IMF, and so forth.

Examples are:

United Nations Multilingual Terminology Database (in six languages)
http://unterm.un.org/

UNESCO Thesaurus
http://termweb.unesco.org/
http://databases.unesco.org/thesaurus/other.html

International Monetary Fund Multilingual Glossaries (in nine languages)
http://www.imf.org/external/np/term/eng/index.htm


henryhu wrote:

最近在寻找双语平行文本,似乎资源不是很多。是不是平行文本多是个人自己积累的?如果创建一个平行语料库,译员共享那些可以公开的双语文本,是一件很有意义的事情。
Collapse


 
Phil Hand
Phil Hand  Identity Verified
China
Local time: 22:06
Chinese to English
不少 Jan 21, 2014

https://www.chinadialogue.net/

法律文本,网上有很多(不算可靠,但不能说没有)

很多政客的发言稿也可以在网上搜索得到

像财新网每月都有英文报道,与中文报道基本一致

书呢,经典书都有:ctext.org

你说的平行具体是什么意思?


 
Jinhang Wang
Jinhang Wang  Identity Verified
China
Local time: 22:06
English to Chinese
+ ...
parallel Jan 21, 2014

Phil Hand wrote:

https://www.chinadialogue.net/

法律文本,网上有很多(不算可靠,但不能说没有)

很多政客的发言稿也可以在网上搜索得到

像财新网每月都有英文报道,与中文报道基本一致

书呢,经典书都有:ctext.org

你说的平行具体是什么意思?


用“对照”更好些。


 
pkchan
pkchan  Identity Verified
United States
Local time: 10:06
Member (2006)
English to Chinese
+ ...
Linguistic Data Consortium=双语平行语料库 Jan 21, 2014

Linguistic Data Consortium: Home
www.ldc.upenn.edu/‎翻譯這個網頁
The Linguistic Data Consortium (LDC) creates, collects and distributes speech and text databases, annotated corpora, treebanks, lexicons and other linguistic ...
ChineseLDC.Org
www.chinip.csdb.cn/‎轉為繁體網頁
三语
... See more
Linguistic Data Consortium: Home
www.ldc.upenn.edu/‎翻譯這個網頁
The Linguistic Data Consortium (LDC) creates, collects and distributes speech and text databases, annotated corpora, treebanks, lexicons and other linguistic ...
ChineseLDC.Org
www.chinip.csdb.cn/‎轉為繁體網頁
三语语料库(汉英部分)和面向奥运 的中英日三语语料库进行系统训练。 ... 中文语言资源联盟,英文译名Chinese Linguistic Data Consortium,缩写为CLDC。 CLDC ...
Linguistic Data Consortium--LDC - 语料库语言学在线
www.corpus4u.org › ... › 语料库语言学理论前沿‎轉為繁體網頁
2005年6月19日 - 1 篇文章 - ‎1 位作者
Linguistic Data Consortium The Linguistic Data Consortium is an open consortium of universities, companies and government research ...
Collapse


 
henryhu
henryhu
Local time: 22:06
English to Chinese
TOPIC STARTER
平行语料应做到双语逐句对应 Jan 22, 2014

谢谢指点,我正逐个体验各位给出的资源。我理解的双语平行语料应主要由逐句对应的双语构成,这与通常的直译习惯一样,也是构成翻译记忆(TM)的基本单位。下面的链接就是一个例子,查看双语文章时是逐段对应,但实际翻译和保存时,都是双语句-句对应:

http://wohub.org/Jack/translation/俄罗斯爆炸案引发冬奥会安保忧虑

好用的双语库,不但要方便检索,而且要方便使用,成为能够在翻译时直接使用的记忆库。就算有很多商业翻译都是不能公开的,也还有很多可以公开的双语语料,我打算用这些可以公开获取的资源建立初步的语料库。最好的方式是形成一个社区,翻译者、语言学习者、研究者都可以公开自己的资源,个人维护语料而同时又共享资源。


 
Phil Hand
Phil Hand  Identity Verified
China
Local time: 22:06
Chinese to English
不太相信 Jan 22, 2014

henryhu wrote:

好用的双语库...成为能够在翻译时直接使用的记忆库。


这是白日梦吧!第一:很多已有的资源质量很低。第二:就算质量合格,也不一定适用于新的文本,一句话根据上下文的不同可以有很不一样的意思。

是我的话,觉得引进别人的/公开的翻译库只能成为一个干扰因素,起到反作用!一般来说,只有专业性很强的翻译库才有用,比如欧盟各语种的官方法律文本。其他的没有太大的意义。


 
Shirley Lao
Shirley Lao  Identity Verified
Taiwan
Local time: 22:06
Member (2007)
English to Chinese
+ ...
The Terminology Coordination Unit of the European Parliament Jan 22, 2014

You can also refer to some terminology resources at http://termcoord.wordpress.com/



henryhu wrote:

最近在寻找双语平行文本,似乎资源不是很多。是不是平行文本多是个人自己积累的?如果创建一个平行语料库,译员共享那些可以公开的双语文本,是一件很有意义的事情。


 
Shirley Lao
Shirley Lao  Identity Verified
Taiwan
Local time: 22:06
Member (2007)
English to Chinese
+ ...
The new website for the TermCoord Jan 22, 2014

http://termcoord.eu/




Shirley Lao wrote:

You can also refer to some terminology resources at http://termcoord.wordpress.com/



henryhu wrote:

最近在寻找双语平行文本,似乎资源不是很多。是不是平行文本多是个人自己积累的?如果创建一个平行语料库,译员共享那些可以公开的双语文本,是一件很有意义的事情。


 


To report site rules violations or get help, contact a site moderator:


You can also contact site staff by submitting a support request »

开放平行文本






Wordfast Pro
Translation Memory Software for Any Platform

Exclusive discount for ProZ.com users! Save over 13% when purchasing Wordfast Pro through ProZ.com. Wordfast is the world's #1 provider of platform-independent Translation Memory software. Consistently ranked the most user-friendly and highest value

Buy now! »
CafeTran Espresso
You've never met a CAT tool this clever!

Translate faster & easier, using a sophisticated CAT tool built by a translator / developer. Accept jobs from clients who use Trados, MemoQ, Wordfast & major CAT tools. Download and start using CafeTran Espresso -- for free

Buy now! »