The development of Canada’s largest grid computing network is not only contributing to a worldwide particle physics project, it’s teaching high-performance users how to share their resources. In production since the fall of last year, GridX1 consists of eight clusters operating out of a series of post-secondary schools, including the University of Victoria, the Centre for Subatomic Research at the University of Alberta, the WestGrid cluster at the University of British Columbia, and the Research Computing Support Group at the National Research Centre in Ottawa. These facilities are pooling their computer cycles to share data and applications through a technique known as grid computing. The project has been underway for several years.Running on open source software using Intel-based servers, Grid X1 is linked to an experiment running out of the European Organization for Nuclear Research (CERN) in Geneva. The experiment, called ATLAS, will simulate the way protons collide in order to learn more about matter and prove the existence of a particle, the Higgs boson, that scientists hope will explain how the universe was formed.
Randall Sobie, a UVic professor and GridX1 research scientist, said when ATLAS begins to record data in 2007 or 2008 it is expected to generate a petabyte or two of data per year. “The challenge of analyzing all that data made us realize we couldn’t do it in one site. The idea was to exploit all these facilities that are distributed around the world,” he said. “You have the resources, you have the manpower, why not build a facility to do this? The grid seemed to make sense.”
There is no individual funding for GridX1, nor is there an over-arching body that controls the way the grid operates. Although the grid contains about 3,000 CPUs, Sobie said only about 10 to 15 per cent of the resources tend to be used.
Sobie compared GridX1 to the Seti@home project, which allows home users to pool their PC resources during idle periods as part of an experiment to search for extra-terrestrial life.
“In a way it’s like that kind of concept but on a bigger scale. Instead of a desktop resource it’s a network resource,” he said.
Roger Impey, group leader for the National Research Council, described GridX1 as an altruistic network where control over local infrastructure is maintained. “It’s a very Canadian kind of system, in that we’re all sort of trying to get along,” he said. “Only certain applications really lend themselves to this kind of environment, because of the way the thing is structured. You have to live with this sort of situation where you don’t really know how many resources you’re going to get at any given point. You certainly can’t turn around and demand more.”
That spirit of co-operation is helping others understand how to run distributed high-performance computing environments more effectively, according to other GridX1 participants.

Share on LinkedIn Share with Google+