Performing intricate calculations using high-performance computing

A cluster of Ontario computers with a funny name, SHARCNET is considered the model for how university computer-based research can be organized to be most effective. And researchers from all areas are currently taking advantage of the network's high-performance computing grid's capacities.

In September of 2008, a cluster of Ontario computers known as SHARCNET broke into the world's news consciousness when, after 16 days of relentless calculating and recalculating, it verified that yes, 243,112,609-1 was the largest number human beings had ever found that could be divided only by 1 and itself; that is, a prime number.

To give you a metaphorical idea of how truly massive this number was, Jeff Gilchrist, the Carleton University doctoral student who ran the calculation through SHARCNET, computed that if you printed all 12.9 million digits of 243,112,609-1 in a book, that book would have to be 3,200 pages long. If stretched out in 12-point type, the number would have been 50 kilometres long.

While readers of the planet's blogosphere "oohed" and "wowed" and marvelled over the number and its implications for computer encryption, what wasn't making the headlines were some things almost as amazing that SHARCNET had done without any publicity. When the computers spread over 17 institutions weren't verifying prime numbers, they were helping Matt Davison at the University of Western Ontario develop financial computer models that may tell investors the best time to buy and sell their stocks.

They were assisting Vesselin Stoilov of the University of Windsor in his quest for the perfect frictionless surface. This surface would allow car manufacturers, among others, to do away completely with the need for lubricants. It's a search that required simulations of surfaces from the atomic level up. So without SHARCNET's number-crunching power, "We might as well be looking in the dark," says Stoilov.

And let's not forget the work of Chris Bauch, a mathematics professor at the University of Guelph who has been evaluating the cost-effectiveness of giving every child in the country a hepatitis A vaccination.

But perhaps the biggest contribution that SHARCNET - the acronym stands for Shared Hierarchical Academic Research Computing Network - has made to the world is the vision that gave it birth. It is a made-in-Ontario, perfected-in-Canada, adopted-worldwide model for how university computer-based research can be organized to be most effective.

Let's go back to a moment in 1999 when five Ontario universities decided that they wanted to ramp up their computer research capacities. At that time, they found themselves scraping up against the prevailing organizational paradigm.

The buzzword of the day was "supercomputing centres." This was a model in which a massive amount of computing power was placed in a central location, in what Michael Bauer, the Western computer science professor who is now associate director of SHARCNET, calls "a great big barn." At that time, scientists arrived at the supercomputing centres to work on their problems or sent their data there to be processed.

What Guelph, Western, Windsor, Laurier and McMaster scientists decided they wanted instead was a place that was in a sense everywhere and nowhere at the same time.

ORION (Ontario Research and Innovation Optical Network) had been established a few years earlier to provide the high speed connectivity, but it was SHARCNET that was going to provide the real computer power.

"Rather than building a single large facility, we chose to build facilities at multiple institutions and connect them together through a high-speed network," remarks Bauer. "We were 'The Cloud' before 'The Cloud' existed."

The Cloud he is referring to is a 21st-century vision of the future of computing that sees all computers hooked together not through individual servers, but through a kind of Internet-linked cloud of anonymous machine locations.

In retrospect, SHARCNET's version of The Cloud seems to be a totally logical linkage between the emerging high-speed Internet and the relentlessly faster, more powerful and smaller modern computer. But at the time, there was much skepticism about the plan, primarily because there was no existing model for SHARCNET to imitate.

"We had an expert review panel, numbers of whom worked at supercomputing centres, and they asked, 'Why don't you just put all the equipment in one place? Why are you doing this?'" says Bauer.

The psychology of the user was one answer.

"You need to have a local presence" is how Cindy Munro, SHARCNET's director of finance and administration, bluntly puts it.

The local presence was designed to give researchers a sense that SHARCNET belonged to them in a way that the supercomputing centres never did. Turning supercomputing into something that was "ours and not theirs" was particularly important, says Bauer, when it came to convincing non-traditional users - such as researchers working outside of the data-intensive areas of physics and engineering - that they too should reconfigure their research agendas to presume they could deconstruct data mountains.

Not to mention that if scientists bought into the notion of a cloud-like network, they had a real inducement to train their students to feel comfortable using the high-performance computing grid's almost effortless-to-access capacities.

Davison's stock-buying conundrum is a classic example of how this all works.

Trying to figure the best time to buy one of 10 specified stocks over the period of a year is enormously complex and "ends up looking a bit like the problem of heat flow on a 10-dimensional object," he says. "Although the math can be written down in a concise way, it is impossible to find an analytic expression for the problem's solution even when just one asset is in the mix."

So Davison and his students have turned to SHARCNET to simulate a number of market conditions and conduct what you might call a multiplicity of stock-buying experiments.

It was also necessary for the founding universities to understand that while they might have to compete to have access to various pieces of the cloud-like supercomputing system, they also had to buy into the notion of collaboration. "We've developed a new word to describe this," says Munro. "We call it 'coop-petition' to signify that you both cooperate and compete together."

Coop-petition might also describe how the computers themselves are configured. They are not the same machines simply installed in different places, but rather, something else. "We have different people with different needs and we can put together different packages of computers for them to access," says Bauman. "As we have grown, what we have come to realize is that you need differences almost as much as sameness."

In a larger sense, the SHARCNET cloud-like model has become a prototype for other collaborative efforts. The West- Grid Consortium, which links four Western provinces, and the Atlantic Computational Excellence Network (ACEnet), which links together seven universities in the Atlantic provinces, are examples.

Perhaps nothing says more about the integration of high-speed computing with the practice of modern science than the fact that there are two other high-performance computing consortia in Ontario that have thrived with a different organizational model. You might describe them as "The Node" rather than "The Cloud" because they are massive, centrally located centres participants can access over the Internet.

The University of Toronto's SciNet consortium, which services U of T and its associated research hospitals, is such a node. SciNet has just announced that, in conjunction with IBM, it will build Canada's most powerful supercomputer - one that can do jaw-dropping a 360 trillion calculations per second. What could you do with that, you might wonder. How about sift through the Alps of particle data coming from the brand new Large Hadron Collider based in Geneva, searching for a sign of the as yet undiscovered particle that theory says should give all things in the universe mass.

In a similar node vein, Carleton, the University of Ottawa, Queen's, the Royal Military College, Ryerson University, Seneca and Loyalist College are linked via the High Performance Computing Virtual Laboratory, based at Queen's.

Doug Mewhort, a professor of psychology at Queen's, is working with graduate student Mike Jones on developing a computer program that can appreciate linguistic variance with such refinement that Mewhort says, "It can tell the difference between a good writer and a bad writer." Even better for a Google universe, the program may be able to accurately parse the context in which a word has been used and thus narrow the often impossibly large results of a keyword search.

Which approach is better, The Cloud or The Node, you might well ask.

"Our applications are memory hogs, but don't need that many processors," says Mewhort. "If I tried to do our analysis on SHARCNET machines I might fail because I can't get enough memory. Other applications, like many used in physics, require more processors than memory. So they need to farm out applications to many slave machines like SHARCNET uses."

And thus, maybe the ultimate Ontario contribution to how high-performance university research computing should be organized expresses itself as a mantra of contradiction.

Success means you are ever flexible in your approach and, accordingly, how you best configure your computers should be written as - with emphasis - it depends.

Q In general, how do you see Canada doing in the translation of research into application?
A This is an area where we typically do the Canadian thing: beat ourselves up about our supposed failings. I don't think that we're doing any better or worse than other countries because, in general, translating research findings into clinical practice or policy or into products is a tough, frustrating, often contradictory process no matter where you are on the planet.
Read full Q A session
The Government of Ontario has invested more than $3.2 billion in research over the past 10 years (1997-2007).
View chart (PDF)