
Parallel texts extraction from the web : Luận văn ThS. Công nghệ thông tin / Lê, Quang Hùng; Lê, Anh Cường
Tác giả : Lê, Quang Hùng; Lê, Anh Cường
Nhà xuất bản : Trường Đại học Công nghệ
Năm xuất bản : 2010
Mô tả vật lý : 48 p. + CD-ROM
Chủ đề : 1. Mạng máy tính. 2. Sao chép văn bản. 3. Văn bản song ngữ. 4. Website. 5. Thesis.
Thông tin chi tiết
Tóm tắt : | Luận văn ThS. Công nghệ thông tin -- Trường Đại học Công nghệ. Đại học Quốc gia Hà Nội, 2010Electronic ResourcesIn this chapter, we first introduce about parallel corpus and its role in NLP applications. Current studies, objectives of the thesis and contributions are then presented. Finally, the thesis’ structure is shortly described. Chapter 2 - Related works The studies that have close relations with our work are introduced in this chapter. Chapter 3 - The proposed approach We show our proposed model, including the general architecture of the model, how structural features and content-based features are designed and estimated. Chapter 4 - Experiment This chapter evaluates the goodness and effectiveness of our proposed method for extracting parallel texts from the Web. The performance of our proposed and baseline are presented in here. Chapter 5 - Conclusion and Future works Final conclusions about our work as a whole and the evaluation of the results in particular are presented, followed by suggestions of possible future work that could be done. Finally, references introduce researches that are closely related to our work. |
Thông tin dữ liệu nguồn
Thư viện | Ký hiệu xếp giá | Dữ liệu nguồn |
---|---|---|
![]() |
|
https://repository.vnu.edu.vn/handle/VNU_123/41961 |