Data Analysis Services Group - April 2012

News and Accomplishments

VAPOR Project

Project information is available at: http://www.vapor.ucar.edu

TG GIG PY6 Award:

Wes compiled all of the data he collected as well as the methods he used to run the benchmarks and gathered the data for the wiki. The analysis uncovered some minor performance discrepancies in PIOVDC, PIO and PNetCDF run times.  The first time the benchmarks were run  PNetCDF was much faster than PIO even though the PIO run is just writing uncompressed data out to PNetCDF. The second set of times (a few days later)  PIO was much faster. Looking at individual runs we do see there are cases where they are basically the same (what we expect to see) but there are times when they are very different (this is what makes the averaged results look so bad). One of the things Wes looked at with hopes of getting all the times closer was the compression ratio for the PIOVDC compressed version. Changing compression didn't impact in run time.
While looking through everything for the wiki it was discovered that the IOR results I had were not very good. Wes had been using an example script and left in the -z flag which is random access, when removed the IOR runs saw an increase in speed. Another thing Wes changed in IOR was the -t flag up to 4096k.

Wes and Yannick were also looking into the memory issues we were seeing when running on Lynx and Janus. Si pointed Wes to a script he had that basically checks the stat file in /proc/pid and continually updates to vmpeak as well as a few other memory stats. This didn't seem to produce results that could be trusted. Wes spent some more time looking at Tau but ran into some issues with Lynx and Janus. On Janus the system was so loaded Wes couldn't access the scratch directory and people were running jobs on login nodes because the time it took to test something in the regular debug queue was taking about an hour.

Wes also wrote some updated MPI code in fortran to better gather timing data (getting the average runtimes across all processes as well as the max and min in each category). For one of the bench marks Wes had to create a new communicator because the only way it works on the systems is to run it with double the process allocation (memory issues).

XD Vis Award:

Depth peeling support was enabled in the probe.

KISTI Award:

Negotiations on year two of the  KISTI award were completed and the final contract is expected to be signed in May.

Development:

Alan made numerous changes to support the new RegularGrid class, so that we can now properly handle missing values throughout the VAPOR application, except in Python (which will be dealt with soon). John had to restructure portions of the DataMgr to support the Python calculation engine working with the RegularGrid class.

The RegularGrid changes broke the VAPOR support for moving extents.  We came up with a new plan for supporting moving extents.  These changes require a lot of work because they permeated the entire application; Alan has now implemented most of these changes.

Alan set up the Windows 64-bit environment.  We are waiting for Unidata to release a Windows 64-bit version of NetCDF, before we can release 64-bit VAPOR on Windows.

John cleaned up the volume rendering code to reduce the number of unnecessary data loads and improve overall performance.

We redid the VAPOR visualizations for Hank Child’s new book based on reviewers’ comments.

Administrative:

The vapor section of the FY13 POP was written.

The VAPOR team had continued working on their two-year plan, which is expected to be completed in early May. The intent of the plan is to look beyond incremental feature additions to the VAPOR suite of applications.

Education and Outreach:

John and Alan completed revisions for their  chapter on VAPOR for the upcoming book, High Performance Visualization. The completed manuscript has been accepted for publication.

Alan was Interviewed by a reporter from Casper Star Tribune regarding the proposed work for our Siparcs intern Ashish Dhital.

Alan and John attended DOECGF in Albuquerque, NM.  Alan presented an update on the status of the VAPOR project.

Alan met with Rick Brownrigg,  helped him prepare a demo of VAPOR for the EGU Visualization workshop.

Consulting:

Alan has been helping Janice Coen on a new wildfire visualization.

Alan met with Mel Shapiro, helped him construct a new visualization using backwards  unsteady advection.

The VAPOR team, along with VETS, will be supporting the ASD campaign again this year. The primary goal is to produce visualizations that would be showcased at the NWSC grand opening in October. The event will be attended by various dignitaries, including the director of NSF and the governor of Wyoming. The VAPOR team has reviewed the ASD proposals and is making arrangements to work with three of the awardees (VETS will support the remaining two).

Software Research Projects

The NREL LDRD pre-proposal submitted in January was accepted and a full proposal was invited. The full proposal has been submitted and expected to be reviewed next month. If awarded NREL would extend VAPOR's VDC library to support varying compression rates across the spatial domain. This is a $0 proposal for the VAPOR team, who would serve only as consultants.

Several brain storming sessions were held in consideration of responding to the NSF 12-499 big data solicitation. We are now exploring partners for a collaboration that would build upon our wavelet-enabled lossy compression and progressive data access work.

Systems Projects

Storage Research

NWSC Planning & Installation

System Support

GLADE Storage Cluster

Data Transfer Cluster

Other