Category Archives: Data Grids

SAGA Hadoop

Introduction Hadoop has become a fixture where big data is concerned but it has been difficult to use in HPC and HTC cluster environments. This is becoming unfortunate as an increasing number of new algorithms assume Hadoop’s an option. I … Continue reading

Posted in Compute Grids, Data Grids, Hadoop, High Throughput Computing (HTC), High Throughput Parallel Computing (HTPC) | 1 Comment

On the Open Science Grid Trail

The open science grid is a distributed heterogeneous network of computing clusters. Its infrastructure and protocols allow members to submit high throughput compute jobs for remote execution. All use is authenticated and authorized via a PKI infrastructure which associates jobs … Continue reading

Aside | Posted on by | Leave a comment

iRODS in the Duke, NERSC, RENCI Collaboration

Research Data and Computation Scenario Here are some basic characteristics of the problem space we’re addressing in our collaboration with Duke Physics and NERSC: The Duke research team will run OSG grid jobs manipulating terabytes of data. The team will … Continue reading

Posted in Compute Grids, Data Grids, Globus, grid, iRODS, OSG | Leave a comment