Quantcast
PC World: Technology Advice You Can Trust
Find a Review
Free Newsletters
Receive the latest reviews, how-to's, news, and more.
Consumer Advocate
Tech-Savvy Business
Daily Technology News
WiFi Finder
Locate wireless services by a specific address, city, state, country, airport, or zip code.
RSS Feeds
Get our latest content via convenient RSS feeds.
Latest News
Today @ PC World
Become a PCW Member
Join the community and start enjoying the benefits:
  • Get tech advice from thousands of PC World Members
  • Rate and recommend the latest tech products
  • Share your thoughts in blog and article comments
  • Get free excerpts and exclusive discounts on Super Guides
Read More About: Internet

Keeping AOL's Files in Order

Lessons from AOL's operations architect on managing huge volumes of data files.

Deni Connor, Network World

Sunday, April 15, 2007 6:00 PM PDT
Recommend this story?

Dan Pollack of AOL had a big data problem -- 8 petabytes of unstructured data to be exact -- data that his clients had to access and manage, and he had to make sure was always available.

Pollack, operations architect for the internet service provider in Dulles, Va., manages a traditional storage-area network (SAN), which has about 3,000 connected servers and uses 10,000 Fibre Channel storage ports, to store the transactional data generated by line-of-business applications. But it is the 8 petabytes of unstructured data stored on 1,000 file servers across his enterprise that was causing him fits.

Client applications need to access this data continually, whether it is for archival or sharing purposes. It was necessary for Pollack to have some way to manage the data and ensure its accessibility during upgrades, patches and maintenance activity. He also wanted to maintain this data on lower-cost storage than his SAN, because while it is important data, it doesn't have the business-criticality of data stored on the SAN.

"While we found that [a SAN] is good for a lot of critical applications, there's another class of applications that require very high capacity but very low-cost storage in order to not go broke trying to provide services to your customers," Pollack says.

Pollack started to investigate storage systems that would give him high capacity, less-expensive storage and the uptime he required. He found a number of cluster-based network-attached storage (NAS) systems from companies the likes of Panasas, Isilon, PolyServe (now HP ) and IBRIX. After careful consideration, Pollack chose IBRIX's Fusion.

"One of the things we liked about IBRIX over those other options is that they've re-implemented the interface as a file system as opposed to levering a traditional NAS protocol like NFS [the Network File System], CIFS [the Common Internet File System] or iSCSI," Pollack says. "What that does is makes it extremely understandable for our clients and easy to access."

He also chose IBRIX over NAS systems because he considered it to be more available.

"The large-scale NAS deployments are not as available as they ought to be," Pollack says. "We have a long, checkered experience with large NAS deployments and very large systems that support customer-facing applications."

"In order to avoid this availability problem, we wanted to go with something that was cluster-based and non-coupled," he says.

IBRIX's Fusion is a software-only system, which installs on each of the 1,000 file servers in Pollack's cluster. With Fusion, Pollack can use any storage media he wants, unlike packages such as Panasas, which require use of their own storage.

"With IBRIX we can select our own hardware at the lowest cost possible," says Pollack, who is using direct-attached storage consisting of 12 700GB ATA drives per server.

As for uptime, when Pollack wants to do upgrades or maintenance, he can take down a single server in the cluster, perform the upgrade and then bring the node back up without disrupting the operations of the rest of the cluster.

"There may be a minimal data unavailability on whatever is offline but every other server is available," Pollack says. "Any unavailability is a small part of the total complex and it doesn't affect the actual behavior of the rest of the complex."

When Pollack's cluster grows, he can add new nodes in without taking down the cluster. Data is balanced across the cluster as it is added.

However, there is one thing missing though from Pollack's wish list: the ability for the cluster to automatically redistribute data to other nodes in the cluster when a failure occurs.

"It's a requirement we gave to the IBRIX guys that they should have delivered in the next couple of months," Pollack says.


For more information about enterprise networking, go to NetworkWorld. Story copyright 2008 Network World Inc. All rights reserved.


Recommend this story?
Related Searches: file management aol operations

Comments
Latest News
The iPhone's reach expanded again Friday, with Orange announcing plans to sell the phone in Europe, the Middle East and... 16-May-2008
A new train simulator codeveloped by Fujitsu offers unparalleled realism thanks to high-definition video shot on actual train... 16-May-2008
Samsung Electronics will unveil this weekend the first prototype of a new LCD (liquid crystal display) technology that won't... 16-May-2008
With all the time spent on the road, most drivers consider their cars to be their second homes. Reaching their primary home... 16-May-2008
Internet users in China have begun expressing solidarity with the victims of Monday's earthquake via their instant messaging... 15-May-2008
Sony has promoted a senior executive at its U.S. games studio to lead its global studios, it said Friday. 15-May-2008
Fujitsu has developed a prototype electronic paper screen that tackles one of the technology's biggest weaknesses: the amount... 15-May-2008
The One Laptop Per Child Project and Microsoft plan to make both Windows and Linux available on a version of the project's XO... 15-May-2008
Yahoo has responded to investor Carl Icahn's threat to take control of Yahoo's board and force it back to the negotiating... 15-May-2008
Billionaire investor Carl Icahn's proxy fight for Yahoo is aimed at reigniting merger talks between the Internet company and... 15-May-2008

PC World's Marketplace

PC World's Free Whitepapers

Name City
Address 1 State Zip
Address 2 E-mail (optional)