Document Formats:
Although most marine data is archived and published in purely data formats, there are cases where the data are only available in formats usually concerned with documents, including proprietary formats or elaborately formatted ASCII text. In most cases programs can easily be written to extract or convert ASCII, but proprietary formats (e.g. MS Word) can be extremely difficult to work with, unless regular blocks of material can be copied and pasted into an ASCII editor. Very large resources in proprietary formats simply wouldn't be possible to treat in this way. Another problem might be that the proprietary format has become obsolete. To the maximum practicable extent, data contained in proprietary formats should be exported or stripped out to ASCII tables, documented, and stored separately for safety.
Just about every possible way that data can be expressed visually on the page has been used at some time. Great effort sometimes went into creating tables with row and column headings, and attractive headers. These were usually published with little or no thought given to eventual use in a computer.
This CD-ROM contained ASCII text documents of data, intended to look good when printed. Notice that it even has vertical labels along the left side. To actually use these data required a program that went into each line and extracted the desired numbers. There were literally thousands of these tables in a few huge files.
NODC SURFACE CURRENT (SHIPS DRIFT) LONG SUMMARY 10-DEGREE SQUARE 1400 1-DEGREE SQUARE 35 MONTH 7 RESULTANT DIRECTION 0 TOTAL OBSERVATIONS 1 NORTH COMPONENT .0 RESULTANT SPEED .0 EAST COMPONENT .0 DISTRIBUTION OF INDIVIDUAL OBSERVATIONS PER- KNOTS(CM/SEC) N NE E SE S SW W NW SUM CENT CALM 1 100.0 0.1 (5) 0 0 0 0 0 0 0 0 0 .0 S 0.3 (15) 0 0 0 0 0 0 0 0 0 .0 P 0.5 (26) 0 0 0 0 0 0 0 0 0 .0 E 0.7 (36) 0 0 0 0 0 0 0 0 0 .0 E 0.9 (46) 0 0 0 0 0 0 0 0 0 .0 D 1.1 (57) 0 0 0 0 0 0 0 0 0 .0 1.3 (67) 0 0 0 0 0 0 0 0 0 .0 C 1.5 (77) 0 0 0 0 0 0 0 0 0 .0 L 1.7 (88) 0 0 0 0 0 0 0 0 0 .0 A 1.9 (98) 0 0 0 0 0 0 0 0 0 .0 S 2.5 (129) 0 0 0 0 0 0 0 0 0 .0 S 3.0 (154) 0 0 0 0 0 0 0 0 0 .0 3.5 (180) 0 0 0 0 0 0 0 0 0 .0 4.0 (206) 0 0 0 0 0 0 0 0 0 .0 >4.0(>206) 0 0 0 0 0 0 0 0 0 .0 SUM OF OBS. 0 0 0 0 0 0 0 0 PERCENT OBS. .0 .0 .0 .0 .0 .0 .0 .0 100.0 MEAN SPEED .0 .0 .0 .0 .0 .0 .0 .0 MAX. SPEED .0 .0 .0 .0 .0 .0 .0 .0 STD DEVIATION .0 .0 .0 .0 .0 .0 .0 .0 10-DEGREE SQUARE 1400 1-DEGREE SQUARE 42 MONTH 9 RESULTANT DIRECTION 224 TOTAL OBSERVATIONS 1 NORTH COMPONENT -.1 RESULTANT SPEED .2 EAST COMPONENT -.1
The National Virtual Ocean Data System (NVODS), an OPeNDAP application using the interface, can deliver several different data formats that are crafted to have a good appearance when printed, as shown by this example. Note that the columns and rows are neatly labeled with their geographic coordinates. The use of -1.E+34 as the blank value is a common choice. Note that the link in the third row of the header may not be correct or active at this time.
VARIABLE : ZONAL SURFACE CURRENT (CM/S) FILENAME : shipdrift_moncl.nc FILEPATH : http://ferret.pmel.noaa.gov/thredds/dodsC/data/PMEL/ BAD FLAG : -9.9999998E+33 SUBSET : 11 by 12 points (LONGITUDE-LATITUDE) TIME : 16-JAN 06:00 6.5E 7.5E 8.5E 9.5E 10.5E 11.5E 12.5E 13.5E 14.5E 15.5E 16.5E 29.5S -13.07 -10.97 -8.455 -9.023 -17.87 11.69 -12.21 -10.4 -7.275 -24.7 29 28.5S -0.45 -25.24 -11.38 -12.99 -5.944 -5.1 -4.437 -4.928 -3.55 -15.55 -1.E+34 27.5S -20.25 -10.60 -11.98 -15.17 -11.33 -12.34 -6.765 -8.083 2.33 -10.6 -1.E+34 26.5S -10.63 -19.46 -8.239 -19.46 -13 -8.787 -5.7 -10.7 -35.4 -1.E+34 -1.E+34 25.5S -17.78 -13.42 -10.89 -10.81 -5.036 -9.609 23.1 -4.4 -3.6 -1.E+34 -1.E+34 24.5S -20.48 -16.05 -7.433 -5.867 -4.063 -1.E+34 -11.96 -1.E+34 2.817 -1.E+34 -1.E+34 23.5S -12.56 2.614 -10.04 1.785 5.525 0 6.6 1.7 -0.567 -1.E+34 -1.E+34 22.5S -12.96 -6.531 -12.12 4.4 13.75 -12.8 -14.83 -2.467 -5.333 -1.E+34 -1.E+34 21.5S -9.536 -1.171 7.97 59.9 -11.15 4.4 -22.68 -1.E+34 -1.E+34 -1.E+34 -1.E+34 20.5S -7.088 -6.9 53.1 -24.05 -7.825 -13.63 -4.243 -1.E+34 -1.E+34 -1.E+34 -1.E+34 19.5S -19.71 -16.25 18 -23.1 3.5 2.1 -8.7 -1.E+34 -1.E+34 -1.E+34 -1.E+34 18.5S -13.7 -4.4 1.667 6.425 -15.1 13.2 -1.E+34 -1.E+34 -1.E+34 -1.E+34 -1.E+34
Very widely used format, native to the MS Word program.
Highly valuable and widely used for scanning hard-copy documents