後設資料

從資料特性思考傳播內容Metadata之建置

序號 7
刊名 2
年份 2006
出版月份 9月
卷期 Vol.10 No.2
作者 郭良文;林素甘;曾欣怡
作者任職單位 交通大學傳播研究所、傳播與科技學系教授;淡江大學資訊與圖書館學系助理教授;數位典藏國家型計畫聯合目錄計畫助理
摘要

本文針對傳播資料的概念特性與結構特質,探討純文字、動態影像、靜態影像與聲音等四大類型傳播資料 metadata 之建置。一般常使用的都柏林核心集(Dublin Core),所運用的十五項 metadata 欄位,乃針對廣泛適用的資料所設計,雖然涵蓋項目多、流通廣泛,但其欄位卻無法滿足動態性與複雜性高的傳播媒體資料內涵。本文針對傳播媒體資料特性,提出建置 metadata 的考量因素。就基本概念的部分而言,本文說明傳播資料的複雜性、聚合性、即時性、連續性、互文性、超文本特性與多義性。其次,本文亦從資料結構特質的角度,針對傳播領域中的報紙新聞(純文字)、電影紀錄片(動態影像)、報紙新聞版面(靜態影像)與廣播(聲音)等 metadata 的建置,援引案例進行討論、並提出不同類型傳播資料可考慮建置之欄位,以捕捉傳播資料的特質。

關鍵字 傳播資料後設資料數位典藏詮釋資料
頁碼 122-141
全文 全文下載
DOI
Review
Title Rethinking Metadata Format from the Data of Media and Communication
Author Liang-wen Kuo; Su-kan Lin; Chin-i Tseung
Author's title Professor, Graduate Institute of Communication Studies, National Chiao Tung University; Assistant Professor, Department of Information and Library Science, Tamkang University; Research Assistant, Union Catalog of National Digital Archives Program
Abstract

This paper discusses the fundamental concepts and structures of communication data in terms of the metadata format of the following four types: text, motion images, still images and sound. Due to Dublin Core's lack of account on the dynamic aspects of media and communication data, an analysis of the nature of communication data is proposed. Seven characteristics of fundamental concepts are discussed: complexity, convergence, spontaneity, continuity, intertextuality, hypertextuality and polygamy. Accompanied by examples, each of the above mentioned data type is also explained in terms of the considering factors of metadata format of each of media types.

Keywords Communication datadigital archivesMedia metadataMetadata
fulltext 全文下載
DOI

臺灣地區中文網頁自動辨別日期之研究

序號 7
刊名 大學圖書館
年份 2011
出版月份 3月
卷期 Vol.15 No.1
作者 邰文暉;吳政叡
作者任職單位 輔仁大學圖書資訊研究所研究生;輔仁大學圖書資訊系專任教授
摘要

隨著網際網路的日益普及,線上資源也越來越豐富,要精準的為讀者找出有用的資訊,前提是必須能夠精準的分析網頁內容。日期是網頁Metadata中的重要欄位,由於臺灣在日期格式的書寫習慣,使得中文網頁的日期形式較為複雜,因而增加了自動著錄網頁創造(或修改)日期時的困難。本研究的主要目的是針對網頁日期部分做深入的分析研究,以便能夠更精確的利用中文網頁中的日期欄位進行檢索利用。 本研究以隨機抽樣方式來抓取繁體中文網頁,分析及統計樣本網頁中出現的日期格式,並使用正規表示式來自動抓取正確的網頁日期,最後計算出正確率。透過此研究可以了解在進行中文網頁日期欄位自動辨識時可能會遭遇到的困難,並評估自動擷取繁體中文網頁日期欄位的可行性。實驗結果顯示,有日期資料網頁的正確率約為61%,沒有日期資料網頁的部分約為62%。有日期資料網頁的平均誤差年約為0.62年,且83.4%的網頁能精準預測其年份(即誤差年為0),因此雖然本研究的成果尚未能完全取代人工,但若應用得宜仍然可以提高網

關鍵字 元資料後設資料日期格式網頁日期自動日期辨別詮釋資料
頁碼 132-143
全文 全文下載
DOI 10.6146/univj.2011.15-1.07
Review
Title A Study of Auto Extraction of Dates from Chinese Web Pages in Taiwan Area
Author Wen-Hui Tai;Cheng-Juei Wu
Author's title Graduate Student, Department of Library and Information Science, Fu-Jen University; Professor, Department of Library and Information Science, Fu-Jen University
Abstract

Online resources have become more plentiful nowadays, thanks to the popularization of Internet services. In order to achieve accurate search results for the users, it is necessary to analyze web pages precisely. ‘Date’ is one of the most important fields of metadata in web pages. Due to the special date displaying formats using in Taiwan, it has made the automatic cataloging on date for webpage more difficult. The major purpose of this research is to thoroughly analyze different types of date displaying formats applied to Chinese web pages. These findings will be used to increase the precision on the date auto extraction of web pages. The procedures of experiment are as follows. Firstly, samples were randomly selected from Internet. Secondly, the statistic analysis on the date displaying format of each web pages was conducted. Lastly, Regular Expression was used to abstract the dates of each web page, while the accuracy ratio was also calculated. The difficulties and feasibility of auto date extraction are discussed in the end of this work. The results of the experiment suggest the accuracy ratio of web pages with date information is 61%. On the other hand, the accuracy ratio of web pages without date information is 62%. The average error of those web pages with date information is 0.62 year. The results of this research suggest that the auto date extraction mechanism can be used to improve the efficiency on webpage information retrieval.

Keywords Auto date extractionDate formatMetadataWebpage date
fulltext 全文下載
DOI 10.6146/univj.2011.15-1.07
訂閱文章