Imagine a simple interface for data search across an organization's local and cloud storage

"The search would return relevant data types, their location, and automatically extracted metadata. From there, advanced analytics could be performed in a serverless environment, and scale seamlessly to the cloud as needed. Results files would be presented in an interactive, configurable, and shareable format. Large raw data files could be transferred to collaborators in an efficient, parallel format over high speed, low latency connections.

While this visionary solution sounds like an incredible way to advance research and take advantage of diverse datasets, such a solution does not exist. When it comes to managing petascale datasets, most organizations don't know where to start..."

See all Archived IT - HPC articles See all articles from this issue