JOURNAL ARTICLE

Data Intensive Infrastructure

Abstract

Summary form only given, as follows. The complete presentation was not made available for publication as part of the conference proceedings. Modern research intensive organisations face challenges storing and preserving the increasing amounts of data generated by scientific instruments and high performance computers. Data must be delivered in a variety of modes depending on the end use, ranging from Web portals through to supercomputers. Building infrastructure to meet this need is complex and expensive. There is a need for mechanisms that support both managed and unmanaged data in a coherent and scalable way, often over a physically distributed multi-campus environment. In this talk I will discuss the ways we are delivering such infrastructure at the University of Queensland. Long term hierarchical storage, and many of the computing systems, are housed in a commercial Tier 3 data centre 20 kms from the main campus in St Lucia. Some high performance machines and desktops, and all scientific instruments, are housed on campus. University researchers work with local, national and international collaborators, requiring the need to share data securely and efficiently across a variety of scales. Our COTS based "MeDiCI data fabric" provides seamless access to data in such an environment. In order to improve standards of management, curation and preservation of data, a locally developed meta-data management service called RDM provides a single point of access for storage requests. Recent work on the CAMERA environment links unmanaged collections to managed repositories in a flexible and efficient manner. Finally, the fabric delivers data to a range of commodity and novel computing platforms such as the FlashLite data intensive cluster and the Wiener GPU supercomputer.

Keywords:
Computer science Variety (cybernetics) Data management Scalability World Wide Web RDM Data access Service (business) Data science Work (physics) Database Engineering Business

Metrics

1
Cited By
0.00
FWCI (Field Weighted Citation Impact)
0
Refs
0.23
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Research Data Management Practices
Physical Sciences →  Computer Science →  Information Systems
Scientific Computing and Data Management
Social Sciences →  Decision Sciences →  Information Systems and Management
Advanced Data Storage Technologies
Physical Sciences →  Computer Science →  Computer Networks and Communications

Related Documents

JOURNAL ARTICLE

Special section on data-intensive cloud infrastructure

Ashraf AboulnagaBeng Chin OoiPatrick Valduriez

Journal:   The VLDB Journal Year: 2014 Vol: 23 (6)Pages: 843-843
JOURNAL ARTICLE

Data Grids: a new computational infrastructure for data-intensive science

Paul Avery

Journal:   Philosophical Transactions of the Royal Society A Mathematical Physical and Engineering Sciences Year: 2002 Vol: 360 (1795)Pages: 1191-1209
© 2026 ScienceGate Book Chapters — All rights reserved.