F=faculty; GS=graduate student; US=undergraduate student; PD=post-doc; I=industrial collaborator; O=other
Click on images to enlarge in a new window
Develop a Pilot System for web-based dissemination of linked geotechnical database archives, develop and implementation plan, and plan and conduct a workshop to review and obtain input and consensus of the geotechnical community.
This project supports PEER Lifelines research goals. It is focused on linking distributed geotechnical databases and providing an optimum structure for web dissemination of data from the linked providers through a virtual data dissemination center.
The project methodology to establish baseline current geo-professional practice for generating, archiving, and disseminating geotechnical data and information. Results of the survey are being used to:
Year-6 was devoted to completing the back-end harvesting architecture, XML schema, data provider specific data translators, and a user-friendly interface.
In order for the database providers to maintain complete downloadable records, which can be efficiently retrieved by the GVDC, translators were written for each individual database provider’s data sets. The translators export the data from providers’ databases into XML files, which are stored on a web-accessible directory on the providers’ production servers. The translators map attributes according to the GVDC data dictionary standard. A GIS-based map front-end user interface and an attribute based (text) querying user interface were developed, both of which will query the Database metadata only. Following a query the user is provided a list of applicable records and is able to retrieve the complete XML data records stored on a web accessible directory on the database providers’ production servers in real-time using URLs to those files. This supercedes the initial design which used an attribute-based (only, no GIS-based interface) query system to view applicable lists of data within the Harvester Database data repository, and download complete data records from there.
The Harvester Database repository now only contains a limited number of attributes as search page query parameters. These are metadata parameters common to the linked database providers. Each database provider has an additional translator written for their OS on top of the OAIB, which exports their complete data records for dissemination in the GVDC XML schema.
The front-end search page allows users to search the GVDC Database for metadata parameters only. The parameters are delivered as a list of records organized by data types determined by the GVDC data dictionary categories of tables. The user can query by selecting data types of interest, record name, boring name, or other data identifiers. Java servlets send individual request/queries in sequence to individual database providers’ XML data sets, which reside on the database providers’ servers. The servlets retrieve data record(s), according to the XMLschema, and deliver them to the user in an Excel Spreadsheet (priority 1); or in downloadable XML (ASCII) format, where the XMLschema may also be provided (i.e. as a link) (priority 2).
The commercial code, ArcIMS, is being programmed into the GVDC architecture. ArcIMS applications can be ported to multiple OS’s, including Linux and Microsoft. ArcIMS will be utilized to provide an interactive map of California, utilizing the most recent available free map layers available from ESRI of cities, roads and highways, and county boundaries and allow the user to draw a box that defines coordinates that are transferred from the map to Lat Lon fields in a query page. This string is passed to the Java Search Page using JSP (Java Server Pages) and to a page listing the results. The user can then continue with the Java Search Page, choosing/entering parameters and processing a query to the Harvester Database repository using a Submit button. A separate code processes the query to the Harvester Database, as ArcIMS out-of-the-box doesn’t support MySQL. The Harvester Database repository structure is designed to be relatively simple, extensible, and adaptable to such proprietary databases in the future). The data retrieval will proceed as described above.
Work related to elements of the pilot virtual geotechnical data center is being done by a number of groups:
Detailed descriptions of these activities are provided in Swift, et al., “Invited Workshop on Archiving and Web Dissemination of Geotechnical Data,” Proceedings of a COSMOS/PEER Workshop, October 4 & 5, 2001, COSMOS Publication No. CP-2001/03.
These single databases are or will be web-accessible. The 2L02 project will develop a pilot system to link multiple databases in a virtual system that can be accessed simultaneously. By accessing the system with a search vector, the user will be able to search all databases of linked data providers.
The project is scheduled to be completed in Year-7.
The pilot system is not available for use at this time.