Proposed SEGY (SEGZ) changes
January 7, 2008
January 7, 2008 - Updated the spreadsheets containing a comparison between different interpretations of the segy standard, just follow this linkApril 4, 2007. Times have changed. We need to move to a more modern way to describe a Seismic Data Exchange Format. We need to support:
After years of looking at SEGY files, the only agreement we can all come to is:
- 64 bit numbers
- Variable length headers, there just is not enough room
- Floating point numbers in the Line and Trace Headers
Geophysicists are demanding flexibility in how we describe our data. Instead of standardizing our data, all we have to do is standardize how we describe our data. Follow this link to see how this is possible.
- A Descriptive text Header
- A Line Header to store numbers and strings in specific byte locations
- A Trace header that stores numbers (fold, static etc), trace numbers and XY cooridinates
- Trace data
Recommendation for a Proposed SEGY rev(2). This simplified SEGY definition will contain only those items that can be agreed upon:
In order to give every company the flexibility they desire, it is proposed we use two signature bytes (399,400 of binary header) to uniquely identify the specified format. Further details are contained here for SEGY rev(2)
- The byte structure, 3200 bytes for Textual Description, 400 bytes for line based information and 240 bytes for traced based information.
- The Sample Interval, Bytes 17-18, Binary16
- The Number of samples, Bytes 21-22, Binary16
EnCana's proposed Workstation SEGY standard consists of three parts, The file name definition, the definition of the Workstation segy and an example of the EBCDIC header. What makes this standard work, is the validation script called check.pl. It needs to run before you transfer the data! The rest of this document outlines the how and the why this document has evolved.
For those of you who want to program to this standard, you may find it easier to examine the different parts, 2D, 3D, Gathers and Shots with the following spread sheets.
The Excel spreadsheets above show the differences from various versions of different segy formats. They also contain a list of names. The idea is that the header entries are no longer to be defined just by the byte-location and description, but instead by a name. This is important since with the ability (or necessity) to define the byte positions where each trace header name is located in the headers, the byte locations can no longer be relied upon to define which header entry we are referring to, so the fixed set of names may well become essential.
- 400 byte line (binary) header Spread sheet (updated Jan 7, 2008)
- 240 byte trace header Spread sheet (updated Jan 7, 2008)
These data files make a good reference.
Properly implemented, data in this format needs only to be attached to users of SeisX and SeisWare after the file name has been updated with eg: .MIG.0.sgy. The perl script encana2segy.pl has been provided to rename (and update SeisX fields) for those of you who do not want to code up the SeisX specific stuff.
Data is to be provided to either on an ftp site, CD, DVD, hard drive, apple ipod, usb dongle etc. A second copy of the data is required for any data not sent directly to the EnCana Data Management Group.
Workstation SEGY Proposal Updates (changes)
- July 19, 2006 - Updated the EnCana Workstation segy definition. Dave Morgan (Divestco) pointed out that there may be confusion with the EnCana signature byte number 100. This number could be miss interpreted as a time in ms to the first sample (CSEG 1994 SEGY definition). The EnCana signature number has now been changed from 100 to the number 101. The Length of the Line Name field has now also been changed to match SeisX/SeisWare and is now 32 characters long.
- July 18, 2006 - Proposal for SEGY rev(2), just those items we can agree upon. Add signature bytes so any company can have their own standard. Follow this link.
- June 29, 2006 - Update a Work Flow HOWTO encapsulate all seismic data into a dual archive file (A data file and an index file). This HOWTO is specifically written for EnCana but can readily be adapted for other use. To read about why this was developed, follow this link. I am sure there is more information here than what you want to read!
- February 13, 2006 - Updated the binary (line) header spreadsheet above and highlighted the Mandatory five digit EPSG code that combines Spheroid, Datum and Projection. A few of these codes are:
- 26710 NAD27 / UTM zone 10N
- 26711 NAD27 / UTM zone 11N
- 26712 NAD27 / UTM zone 12N
- 26910 NAD83 / UTM zone 10N
- 26911 NAD83 / UTM zone 11N
- 26912 NAD83 / UTM zone 12N
- February 10, 2006 - Here's our current look at the proposed SeisCap encapsulation format, it's the next generation of what is known as the Kelman KQ or Seitel's suof encapsulation format. This format will encapsulate all file types.
- January 30, 2006 - Met with a group to discuss Workstation SEGY standards for Survey grids, datums and projections. It was agreed that the EPSG code would be stored in bytes 161-164 of the binary header. Note: EPSG code encapsulates datum and projection into one field and can be used world wide. Please refer to www.epsg.org for more info on these codes. The spreadsheets above have been updated.
- January 15, 2006 - I talked with the Engineering Manager of ARAM, a Division of Geo-X Systems Ltd. He recommended that we declare byte locations for number of extended binary and trace headers. Byte 85 in the binary header and Byte 120 in the trace header has been selected for this purpose. This will give the ability for the recording of field data to provide additional space for information from the field similar to SEGD extended block structure. Note: The Vibroseis specific stuff in bytes 119-155 in the trace header has been removed.
- October 21, 2005 - Updated some examples of the text files, now you can see what a velocity looks like.
- August 5, 2005 - Fixed up the access to Workstation segy data examples that contains 2D, 3D and gathers. They are not very large (most of the data has been stripped off)
- August 4, 2005 - Wrote a perl script to update the binary header with datum, replacement velocity and the total number of traces can be calculated.
- August 2, 2005 - Updated the binary and trace header excel spread sheets. They will now plot out in colour. Send these to a large plotter and you will have a good summary of the different segy standards!
- July 24, 2005 - Why do we need a different segy standard? Why can't we use the cseg format? What's wrong with the SEG Rev(1)format? Why hasn't it become more accepted in the industry? This chapter will show how to use additional stuff in the trace headers.(sp locations, statics, elevation and fold).
- July 13, 2005 - The gather format has actually been used and verified! The floating datum field was essential and EnCana had gathers moved from CGG to Veritas in the Workstation format.
- April 6, 2005 - Corrected references to dates expressed as year then month, ie this month is 200504
- March 14, 2005 - Cleaned up the document, you will now see LineID (AEC) and Reference Number (PCP) used to represent the same thing. Think of it as a unique bar code number to describe the data! Here is a proposed list of digital products (Archival Standards) to be provided by the processing contractor. This list has been approved by the GAG (Geophysical Advisory Group) in April 2005.
- EnCana segy stacks
- Structure stack, unfiltered and not scaled
- Migration stack, unfiltered without post stack spectral balance
- RMS velocity field as a segy stack
- Optional - Filtered, scaled with or without spectral balance
- EnCana TXT files.
- A digital text file describing all acquisition and detailed processing parameters for each version of each line. Mute's and RMS velocity field to be provided in card image format
- Survey coordinates after processor modifications. If the survey has changed by more than 10m horizontal or 5m vertical a new survey disk and listing of the differences is required and to be provided to the Data Resources group.
- EnCana segy Gathers. One set of pre-stack processed gathers - CDP/SHOT gathers, with decon & static with all geometry in the headers, un-muted with (or without)NMO applied (interpreter preference).
- Optional Extras
- Angle/offset limited stacks
- Fold Cube
- Decon QC Cube
- Tiff images of final COFF or velocity semblance plot
- Any processor or interpreter notes accumulated during the course of the processing project. File documenting key findings, tests performed and conclusions made. Format (pdf, doc or ppt)
- November 23, 2004 - Still working on finalizing the gather format. We have created two SeisX, SeisWare templates that you can use to check data for both the binary and trace headers:
We have added and will be starting to check the number of traces in the file (byte 61) and the trace sort code in byte 29. Example data sets, 2D, 3D and gathers are located here. The headers contain real data but the time data values have been removed. If you utilize SiesX/SeisWare and these format templates here's what you will see.
- 2D-3D stack data encana_v100_stack.fmt
- 2D-3D gathers encana_v100_gathers.fmt
- 2D stack data examples Binary/EBCDIC header and Trace header
- 3D stack data examples Binary/EBCDIC header and Trace header
- 3D gather data examples Binary/EBCDIC header and Trace header
- November 2, 2004 - Kirk and I have put some effort to "fix" the standard. The two spread sheets above reflect our current changes. They have resulted as we ensure the format will cover 2D, 3D, gathers and shots. Here are some of the changes for the Binary Header that affect stack data:
Changes to the Trace Header that effect stack data:
- Byte 1 - Reference number is now 12 characters. Devon needs more characters for their reference number. (there was always room for this one)
- Byte 29 - Trace sort code, you now need to fill this out, stack =4, cdp's = 8 and shots = 5. (The check program is going to use this information)
- Byte 61 - Number of traces in the file, this field is now I4 so we can now handle more than 32k traces. (A good check field, do we have all our traces?)
- Byte 53 - Floating datum at each bin location. (Important for foothills processing)
- Byte 99 - Source static applied. (0 if the bin is not on a SP) This field will be used as a flag to tell where shot points have been recorded
- Byte 103 - Total static applied
- October 28, 2004 - Kirk Vensel of CGG points out a few errors in the format definition, especially when it comes to gathers.
These have been fixed and the latest excel spread sheet for the different data types for the trace header is located here
- Binary header, number of traces has been expanded to I4 format in bytes 61-64, We need to handle data with more than 32,767 traces! The Station interval (I2) has been moved to byte position 83, previously unused.
- The word Floating has been added to the Datum elevation for the receiver and source in bytes 53 and 57 of the trace header. If you have gathers (bins), place the floating datum gather in the receiver position.
- Typos in some of the External Name description have been corrected. Dates are stored as Year and then month, ie 200403. This makes it easier to sort your data fields(user request for change)
- All field descriptions in Capitals are part of the SeisX definition. For example, AXIS_TRACE, AXIS_LINE, and AXIS_TIME are assigned the values 1, 2 and 3.
- August 23, 2004 - Updated description for max ensemble.
- June 21, 2004 - Updated the trace sort code, added 1 for shots and 2 for cdp. I'll be testing the Workstation standard for cdp's and shot's over the next few months. We are also considering adding the email address of the geophysicist and processor as the first line in the txt descriptive file to better identify the data! Added some notes requesting that ASCII strings in the Binary header are padded with nulls (ascii 0) and not underscores.
- May 11, 2004 - Updated the check.pl script to check for the 2D or 3D prefix in the user line name field. Removed the number of fields check in encana2seisx.pl script. (Data is checked first)
- May 10, 2004 - Surface shot point added to byte 25 of trace header. This field will be used to identify live shot point locations (enables us to label where the real shot points exist on a line).
- May 6, 2004 - Oooops, we made a mistake. The date in the file name refers to the Processing Date, not the acquisition date. This field is used to identify the most recent processing.
- April 22, 2004 - Are you having problems putting names longer than 32 characters on a CD or DVD? The suggestion is now for data on a CD, if any names are longer than 32 characters, the contractor is to provide a separate batch file to properly rename the files once they have been copied to the hard disk. (It's the CD and DVD format that at's fault) I think this is a workable solution. This has the advantage that you can name the files what the geophysicist wants for his project and provide what the data management group in order to properly archive the data!
- March - 23, 2004 - Updated the Prefix codes to the reference number. For our PC based processors you may find it advantageous to install the following linux shell in order to run the perl scripts.
- February - 23, 2004 - The Workstation segy format becomes official. Earth Signal now joins GeoX in being able to create the Workstation SeisX standard. A few problems have been observed and need to be addressed.
- The Last 130 bytes of the EBCDIC header need to be written as ASCII. This field contains the SeisX Wellid and Description. Our suggestion is to place the processor's name and company in the Wellid field and a short description of the processing flow in the Description field. Remember, it has to be written in Ascii. It probably makes sense to write the complete EBCDIC description as ASCII so we can view the full EBCDIC header at one time.
- Remember, when you specify Longitude in Canada it has a negative sign.
- I've placed some examples of 2D and 3D files located right here that you can take apart to see how others have interpreted the Workstation format.
- I have added a lot more details to the check program and have thrown in a perl script to show you how to write SeisX files in both 2D and 3D. No guarantees that they work in all cases. Let me know if it doesn't for you and I will fix it. Just follow this link to see how to set up the check.pl program.
- I have updated the Workstation binary header documentation to better reflect what the SeisX codes actually mean here. to see the details.
- Note that SeisWare is able to convert statics or elevations directly from the trace headers and create horizons. (I have a perl script with SeisX that does the same thing, send me a note if you would like to use it)
- February - 2, 2004 - Disk file naming standard EDM reference number, EDM line name, user line name, processing flow, date, processor.sgy. Note this reference is the latest document!
- Jan 22, 2004 - GAG Draft for Workstation segy - Microsoft Word documents
- Workstation segy, yellow essential for all data, pink - strongly recommended, green - recommended for SeisX or subsequent processing and blue - optional.
- EBCDIC header example Meaningful description
- July 21, 2003 - The SeisX WELLID and DESC fields are defined as ascii in the EBCDIC header. WELLID isused to identify the source (and contact) of the data.
- June 15, 2003 - Dave Cooper suggests we call this segy format the Workstation format. This explains why:
It was also recommended that we keep the EBCDIC header as EBCDIC, at least until our loading tools (Panther) can read a ASCII header for Landmark users. The SeisX/SeisWare users may want to use ASCII, this enables the complete textual to be read at one time. (SeisX writes to the last 60 bytes in ASCII)
- IEEE floating point numbers are used
- ASCII names and floating point numbers are used in the Binary Header
- It's not possible to use the SEG Rev 1 SEGY standard.
- May 27, 2003 - I'm having loading problems, Panther's SDL doesn't work. What should I do? Here is some documentation on a perl script that will massage the Workstation SEGY standard to get around SDL's current limitation.
- May 21, 2003 - Make sure XY's populate the SOURCE_X_LOCATION and SOURCE_Y_LOCATION at all traces. Kingdom defaults to these byte locations. Populate the Latitude and Longitude locations only where SPID's exist, otherwise use the number 0.
- May 6, 2003 - 3D - Added the closest shot point for byte 17, 0 if the bin is located between shot points.
- May 1, 2003 - There is a debate in our shop, should we use a 5 or a 6 for IEEE floating point. One camp wants to support the SEG REV 1 standard (and Landmark) that has declared the number 5. However, Kingdom, Paradigm and SeisWare use a 5 for 8 bit integer. Geoquest uses 5 for VAX floating point. Others have used a 5 for 36bit floating point (some obscure hardware). The other camp wants to support the CSEG that in 1994 declared the number 6 for IEEE floating point. Paradigm, SeisWare, Winpicks, Geoquest and Hampson Russell have been using a 6 for a number of years. I guess all I can say is it will either be a five or a six. Keep posted!
- April 13, 2003 - We are now using total source static (trace header, bytes 99,100) to determine where shot points exist. Added the perl script encana_dump.pl to dump out the EBCDIC (ASCII) descriptive header and binary header contents. Useful for the data loaders and as a check for the seismic processors.
- April 3, 2003 - Arcis points out that we need a flag if we don't know the datum or replacement velocity. We have decided to use -9999 as the flag. Check.pl has been updated to reflect this flag and to allow more stack codes to pass. Now all you require is at least one valid code instead of all valid codes.
- March 21, 2003 - Just the file naming convention
- March 20, 2003 - Multi volume data sets defined (eg 3D's larger than 2gig)
- March 20, 2003 - The list of Processor names has been updated, it's now twice as large and you now have the description as well as the code.
- March 20, 2003 - The Shot_point_surface location in byte 197 has been changed from IEEE four byte float to I4. This number will not contain a fraction.
- March 18, 2003 - A format code of -3 means IEEE float and an list of files transferred is now required.
- March 4, 2003 - Follow this link (Perl Check scripts) for details on how to use the Workstation check.pl program.
- Here is the list of valid stack names.
- January 20, 2003 - Dave Morgan (Geo-X) starts to code our Mandatory fields, points out a few minor problems explained in the minutes.
- Oct 28, 2002 - Comments from Alan Faichney, Chairman, SEG Technical Standards Committee
- Oct 12, 2002 - Incorporate some ideas from SEG Y rev1 Data Exchange format. Here are a few comments why we can't use the SEG Y rev1 Data Exchange Format, May 2002 standard.
- September 25, 2002 - Incorporated comments from Veritas, Big endian, use IEEE instead of decimeters
- September 23, 2002 - Proposed modifications primarily to the trace header to handle the marine situation by Ann O'Byrne.
Background
In 1975 Barry, Cavers and Kneale proposed a set of standards for digital exchange of Seismic Stack Data. Today, practically all that is left of the original 27 year old standard is the format of the data layout. Segy data contains a 3200 byte descriptive header, a 400 byte binary (line specific data) a 240 trace header followed by variable length data. The original standard pertained to 9 track 1/2 tape and IBM computers. SEGY Rev0 covered IBM floating point.
![]()
In June 1994 a group a Standards Committee was formed by the C.S.E.G. in Calgary. They re examined the standard and made their recommendations here. In October 1999, Doug Bath published an article in the CSEG recorder "This article outlines a GENERIC SEGY format for stacked data, providing essential information and workstation data formats while maintaining the original SEGY specifications as much as possible". This previous format recommended using integers and scalers of 1000. Our new format is now recommending using IEEE floating points where appropriate in the headers as well as the data samples. We agree with the CSEG that the number 6 should be used for IEEE float, not the number 5 as defined by Landmark and the SEG. This may cause you grief if you try to load data into Landmark without first changing the format code. Remember, Photon (SeisX) uses the number 5 to signify 8bit integer.
The data were originally stored as integer and IBM Floating point for numbers and EBCDIC for text descriptions. Today the standards are disk, CD, DVD, DLT etc with numbers stored as integer or IEEE float and ASCII for text. Even IBM has made the switch to the IEEE standard.
Any data on disk can be read more efficiently in IEEE than in IBM floating point!
Both the former AEC and PCP had their own standards. In fact, PCP had two standards, one for 2D and another for 3D. AEC had one standard for both and included many (but not all) fields as defined by SeisX. The following proposal will define a new single standard for segy at EnCana.
This new proposal includes all of the fields defined by the 2002 SEG Rev 1 standard except for a few minor fields:
Adherence to the specifications outlined below will permit both 2D and 3D stacked seismic lines to be directly attached to a SeisX project thus removing one of the time consuming SeisX project management steps.
- Job ID, Reference number, minutes and seconds of a record are not defined
- The Extended Textual File Header is not supported, a separate text files must be used. This is a very much simplier solution. I don't know of any service companies that can actually read these data!
- The proposed standard uses IEEE floating point numbers in the Binary and Trace headers (vs integers and scale factors)
- Format codes are slightly different. We follow the CSEG 1994 standard, the number 5 represents 8bit and the number 6 represents IEEE float. Please note, this is different than Landmark where a 6 is 8bit!
Proposal
Naming Convention
It is proposed that EnCana adopt the former PCP naming convention for the following reasons:The following naming convention be used:
- PCP currently has ~210,000 segy files on line that follow this standard
- The AEC standard applies only to CD. The information contained in the directories does not extend to the ~50,000 segy files on the system (under /auto)
- The PCP naming convention is more complete than AEC
Note that the EDM line line is assigned by EnCana's operation group. It will represent the original historical line name. 2D Examples:
- 2D : Line_ID.EDM_line_name.2D-Line_name.stack_type.year_month of generation.processing company name.sgy
- 3D : Line_ID.EDM_line_name.3D-Area_name.stack_type.year_month of generation.processing company name.sgy
F73379.ECA-VER-993D-02P.3D-verger.mig.200004.abc.sgy A44267.MKD-5.2D-MKD-5.f-fk-mig-100.200302.gox.sgy P172939.02-SBL-2.2D-SBL-2.f-fk-mig-94.200302.gox.sgy A43572.01-KUG-13.2D-KUG-13.f-fk-mig.200302.gox.sgy A44263.MKD-2N.2D.f-fk-str.200302.gox.sgyThe external line names must adhere to unix naming conventions. Spaces, dots, brackets, special characters, are not to be used! Date is now specified as year and then month, this makes more sense when the data are sorted by this field.If the data are brute stack, the descriptor temp should be included in the stack type field. These data will automatically be removed from out system after a period of time.
The common name is be added to a 3D data set. Here is a 3D Example:
P173286.JENSEN3D.3D-JENSENMERGE.test-istk.200302.gox.sgyIf a 3D has to be broken into separate parts it should be named as follows:P81848.PCP-MERG-003DM-08.3D-JENSEN.f-ma-mig-1of5.200203.gox.sgy P81848.PCP-MERG-003DM-08.3D-JENSEN.f-ma-mig-2of5.200203.gox.sgy P81848.PCP-MERG-003DM-08.3D-JENSEN.f-ma-mig-2of5.200203.gox.sgy P81848.PCP-MERG-003DM-08.3D-JENSEN.f-ma-mig-2of5.200203.gox.sgy P81848.PCP-MERG-003DM-08.3D-JENSEN.f-ma-mig-2of5.200203.gox.sgyNote the 3D prefix, this will permit the 3D's to sort together in a list.
Inventory List
In order to identify data on our ftp data server and to verify that we have the complete file, an ascii text file is required containing email addresses for the processor, geophysicist, file name and file size. This file can be emailed to the Geophysicist and should exist with the data. The format for the file is date.processor_email.txt and an example file will look like:
more 04mar30.obriend@geo-x.com.txt obriend@geo-x.com eric.keyser@encana.com P12345.MODEL1-876.2D.mig.200303.gox.sgy 1424000 obriend@geo-x.com eric.keyser@encana.com P12345.MODEL1-876.2D.mig.200303.gox.txt 14415 obriend@geo-x.com eric.keyser@encana.com P12345.MODEL1-876.2D.ro.200303.gox.sgy 1424000 obriend@geo-x.com eric.keyser@encana.com P12345.MODEL1-876.2D.ro.200303.gox.txt 14415This file will be treated as space delimited to simplify validation scripts.Prefix Letter
Changes to the reference numbers will be required for EnCana Workstation. We have decided to use a series of prefix characters to preserve data integrity. The new prefix codes are:
Prefix Line_ID (Reference number) A XAEC Alberta Energy Company P XPCE Pan Canadian F EnCana Workstation (International) D EnCana Workstation (Domestic) Blank Don't know 3200 byte Descriptive Header
It is recommended that the Workstation SEGY format adopt the somewhat free format for the Descriptive (formally EBCDIC) header as defined by AEC. The PCP standard was very rigid and sometimes PCP's processing partners found there was no room to record important descriptive data. The 3200 byte ASCII Descriptive header serves two very important roles, it provides:This header is used to store Job Identification Header (Record 1, 3200 bytes, ASCII coded block) with the following information:
- General line information to assist with data loading to an interpretation workstation
- Information conventionally written on the processing side label on traditional paper sections a llowing the interpreter to browse the Descriptive header for information about a line or 3D volume. By switching to ASCII format this header can easily be viewed. SeisX already uses the last two lines to store descriptive data in ASCII format. (Actually only the last 130 bytes). Be sure to leave these fields blank or loading into SeisX will trash them out.
Here is an example of an EnCana EBCDIC header filled out by Mark Baxter of C&C.
- Client and Project Names, Area, EnCana Reference Number (Line ID), Line Name (if applicable), Tape Number, Display Des cription
- Field Parameters, Acquisition / Processing Company and Instrumentation
- Processing Sequence and Description
- datum elevation and replacement velocity
- processing center and date
- time of first sample
- survey datum (i.e. NAD 27, NAD 83)
- UTM zone or central meridian
- Trace header byte locations for important data loading parameters (shot point, CDP, inline and crossline num bers), Additional information for 3d
- NE corner: Trace count, XUTM, YUTM
- NW corner: Trace count, XUTM, YUTM
- SE corner: Trace count, XUTM, YUTM
- SW corner: Trace count, XUTM, YUTM
See how he has left the last two lines blank (SeisX will trash these locations) In order to complete the SeisX definition, the last 130 bytes of the EBCDIC header must be filled out as ASCII. Blanks are ok but the following is recommended:
- Byte 3070 for 50 bytes, ascii SeisX WELLID - Source Identification plus email, eg ARCIS,dclark@arcis.com This information will help us better identify data
- 3120 for 80 bytes, ascii SeisX DESC - External file name, this field is displayed on the top of each seismic display window.
Binary and Trace Header definition
It is recommended that following guidelines be used for a new segy stack standard:
- Start off with the SeisX segy_v3.fmt definition. All floating point numbers are stored as IEEE, today's computer standard. Note: the code -3 in the format descriptions refers to IEEE Big Endian four byte floating point (ie a float on the SUN). SeisX format files have the following advantages:
- Format is well understood and many companies already generate this format
- Data loading is eliminated for users of SeisX/SeisWare. Data are attached not loaded
- Data loading is simplified for Landmark users with only one format defined
- Check programs already exist for that format
- Data can be previewed directly with SeisX before they are loaded
- Adopt the SEG Y rev1 data exchange standard, using reals instead of scaled integers
- Adopt the AEC standard for those fields not specified by SeisX or SEG Y rev1.
- The proposed new Workstation format will look like:
- 400 bytes of the Binary (line constant) header as defined here.
- 240 bytes of trace header definition stored here.
- Variable length data with floating point numbers now stored as IEEE (change from IBM float). A job constant scaler is to be applied scaling the RMS for the line to the number 4000. This will enable multiple versions of the same data to be compared. Why the number 4000? Data can be converted directly to 16bit with minimum clipping.
Validation -- Check Program
A validation program is essential to ensure EnCana's standards are met. Seismic data can easily be loaded if key fields are filled out in a consistant format. Feed back from processing contractors have indicated that our prior standards (especially the EBCDIC header) were a bit much. There are two levels of checks, Lite and Heavy. The Lite check will be used for data received by thire parties and the heavy check for data from our preferred seismic partners. The Lite checks are:The Workstation heavy checks are:
- Valid processor name
- Valid stack type (Only the first - separated field will be checked)
- Valid date and month
- External line name
- Check segy format code in the Binary Header
The exact details of the fields that will be checked are contained in the Reference section below and are labelled as column Key. Items labelled with a one will be checked and it is expect the vendor will correct his data to this standard. EnCana Workstation has provided the check.pl program to validate your data. You have the source code available and can determine exactly what the checks are!
- Valid Seismic Datum and Replacement Velocity (check the constants in check.pl)
- Reference Number and Line Name from the Binary header agree with the external line name
- Data are flagged with the geometry flag to signify 2D or 3D data
- The elevation and the total statics are within acceptable ranges (check the constants in check.pl)
- For 2D data the difference between shot point fields are the same (ie .5 or 1)
- For 3D data the difference between the cross lines are not zero
- Company signature byte in the Binary Header, EnCana = 100 (the letter d )
The signature byte will enable software to automatically reformat to any future standard.
Reference
So what are we missing? This proposal contains 100% of the SeisX definition. Click here to see what is missing from our former standards.The support documents for the AEC standard.
The support documents for the PCP segy standard. Comments from interested parties are located here.
- Archival Requirements: General Descriptions
- CD Layout Standard
- SeisX Format Requirements for 2D Seismic Data
- EBCDIC Header Example
- Tape Label Examples for 3D and 2D Seismic
- Photon ASCII Velocity Table File Format Description
- General Velocity Table Format Description
Site Owner: Eric Keyser