Location, location, location

A quiz: how many pieces of information do you need to accurately and unambiguously locate a spot on the earth?

It depends a bit if we're talking about locations on a globe, in which case we can use latitude and longitude, or locations on a map, in which case we will need coordinates and a projection too. Since maps are flat, we need a transformation from the curved globe into flatland — a projection

So how many pieces of information do we need?

The answer is surprising to many people. Unless you deal with spatial data a lot, you may not realize that latitude and longitude are not enough to locate you on the earth. Likewise for a map, an easting (or x coordinate) and northing (y) are insufficient, even if you also give the projection, such as the Universal Transverse Mercator zone (20T for Nova Scotia). In each case, the missing information is the datum. 

Why do we need a datum? It's similar to the problem of measuring elevation. Where will you measure it from? You can use 'sea-level', but the sea moves up and down in complicated tidal rhythms that vary geographically and temporally. So we concoct synthetic datums like Mean Sea Level, or Mean High Water, or Mean Higher High Water, or... there are 17 to choose from! To try to simplify things, there are standards like the North American Vertical Datum of 1988, but it's important to recognize that these are human constructs: sea-level is simply not static, spatially or temporally.

To give coordinates faithfully, we need a standard grid. Cartesian coordinates plotted on a piece of paper are straightforward: the paper is flat and smooth. But the earth's sphere is not flat or smooth at any scale. So we construct a reference ellipsoid, and then locate that ellipsoid on the earth. Together, these references make a geodetic datum. When we give coordinates, whether it's geographic lat–long or cartographic xy, we must also give the datum. Without it, the coordinates are ambiguous. 

How ambiguous are they? It depends how much accuracy you need! If you're trying to locate a city, the differences are small — two important datums, NAD27 and NAD83, are different by up to about 80 m for most of North America. But 80 m is a long way when you're shooting seismic or drilling a well.

What are these datums then? In North America, especially in the energy business, we need to know three:

NAD27 — North American Datum of 1927, Based on the Clarke 1866 ellipsoid and fixed on Meades Ranch, Kansas. This datum is very commonly used in the oilfield, even today. The complexity and cost of moving to NAD83 is very large, and will probably happen v e r y  s l o w l y. In case you need it, here's an awesome tool for converting between datums. 

NAD83 — North American Datum of 1983, based on the GRS 80 ellipsoid and fixed using a gravity field model. This datum is also commonly seen in modern survey data — watch out if the rest of your project is NAD27! Since most people don't know the datum is important and therefore don't report it, you may never know the datum for some of your data. 

WGS84 — World Geodetic System of 1984, based on the 1996 Earth Gravitational Model. It's the only global datum, and the current standard in most geospatial contexts. The Global Positioning System uses this datum, and coordinates you find in places like Wikipedia and Google Earth use it. It is very, very close to NAD83, with less than 2 m difference in most of North America; but it gets a little worse every year, thanks to plate tectonics!

OK, that's enough about datums. To sum up: always ask for the datum. If you're generating geospatial information, always give the datum. You might not care too much about it today, but Evan and I have spent the better part of two days trying to unravel the locations of wells in Nova Scotia so trust me when I say that one day, you will care!

Disclaimer: we are not geodesy specialists, we just happen to be neck-deep in it at the moment. If you think we've got something wrong, please tell us! Map licensed CC-BY by Wikipedia user Alexrk2 — thank you! Public domain image of Earth from Apollo 17. 

The spectrum of the spectrum

A few weeks ago, I wrote about the notches we see in the spectrums of thin beds, and how they lead to the mysterious quefrency domain. Today I want to delve a bit deeper, borrowing from an article I wrote in 2006.

Why the funny name?

During the Cold War, the United States government was quite concerned with knowing when and where nuclear tests were happening. One method they used was seismic monitoring. To discriminate between detonations and earthquakes, a group of mathematicians from Bell Labs proposed detecting and timing echoes in the seismic recordings. These echoes gave rise to periodic but cryptic notches in the spectrum, the spacing of which was inversely proportional to the timing of the echoes. This is exactly analogous to the seismic response of a thin-bed.

To measure notch spacing, Bogert, Healy and Tukey (1963) invented the cepstrum (an anagram of spectrum and therefore usually pronounced kepstrum). The cepstrum is defined as the Fourier transform of the natural logarithm of the Fourier transform of the signal: in essence, the spectrum of the spectrum. To distinguish this new domain from time, to which is it dimensionally equivalent, they coined several new terms. For example, frequency is transformed to quefrency, phase to saphe, filtering to liftering, even analysis to alanysis.

Today, cepstral analysis is employed extensively in linguistic analysis, especially in connection with voice synthesis. This is because, as I wrote about last time, voiced human speech (consisting of vowel-type sounds that use the vocal chords) has a very different time–frequency signature from unvoiced speech; the difference is easy to quantify with the cepstrum.

What is the cepstrum?

To describe the key properties of the cepstrum, we must look at two fundamental consequences of Fourier theory:

  1. convolution in time is equivalent to multiplication in frequency
  2. the spectrum of an echo contains periodic peaks and notches

Let us look at these in turn. A noise-free seismic trace s can be represented in the time t domain by the convolution of a wavelet w and reflectivity series r thus

convolutional model

Then, in the frequency f domain

In other words, convolution in time becomes multiplication in frequency. The cepstrum is defined as the Fourier transform of the log of the spectrum. Thus, taking logs of the complex moduli

Since the Fourier transform F is a linear operation, the cepstrum is

We can see that the spectrums of the wavelet and reflectivity series are additively combined in the cepstrum. I have tried to show this relationship graphically below. The rows are domains. The columns are the components w, r, and s. Clearly, these thin beds are resolved by this wavelet, but they might not be in the presence of low frequencies and noise. Spectral and cepstral analysis—and alanysis—can help us cut through the seismic and get at the geology. 

Time series (top), spectra (middle), and cepstra (bottom) for a wavelet (left), a reflectivity series containing three 10-ms thin-beds (middle), and the corresponding synthetic trace (right). The band-limited wavelet has a featureless cepstrum, whereas the reflectivity series clearly shows two sets of harmonic peaks, corresponding to the thin- beds (each 10 ms thick) and the thicker composite package.

References

Bogert, B, Healy, M and Tukey, J (1963). The quefrency alanysis of time series for echoes: cepstrum, pseudo-autocovariance, cross- cepstrum, and saphe-cracking. Proceedings of the Symposium on Time Series Analysis, Wiley, 1963.

Hall, M (2006). Predicting stratigraphy with cepstral decomposition. The Leading Edge 25 (2), February 2006 (Special issue on spectral decomposition). doi:10.1190/1.2172313

Greenhouse George image is public domain.

J is for Journal

I'm aware of a few round-ups of journals for geologists, but none for those of us with more geophysical leanings. So here's a list of some of the publications that used to be on my reading list back when I used to actually read things. I've tried to categorize them a bit, but this turned out to be trickier than I thought it would be; I hope my buckets make some sense.

Journals with mirrored content at GeoScienceWorld are indicated by GSW

Peer-reviewed journals

Technical magazines

  • First Break — indispensible news from EAGE and the global petroleum scene, and a beautifully produced periodical to boot. No RSS feed, though. Boo. Subscription only.
  • The Leading EdgeGSWRSS — SEG's classic monthly that You Must Read. But... subscription only.
  • Recorder is brilliant value for money, even if it doesn't have an RSS feed. It is also publicly accessible after three months, which is rare to see in our field. Yay, CSEG!

Other petroleum geoscience readables

  • SPE Journal of Petroleum Technology — all the news you need from SPE. It's all online if you can bear the e-reader interface. Mostly manages to tread the marketing-as-article line that some other magazines transgress more often (none of those here; you know what they are).
  • CWLS InSite — openly accessible and often has excellent articles, though it only comes out twice a year now. Its sister organisation, SPWLA, allegedly has a journal called Petrophysics, but I've never seen it and can't find it online. Anyone?
  • Elsevier publish a number of excellent journals, but as you may know, a large part of the scientific community is pressuring the Dutch publishing giant to adopt a less exclusive distribution and pricing model for its content. So I am not reading them any more, or linking to them today. This might seem churlish, but consider that it's not uncommon to be asked for $40 per article, even if the research was publicly funded.

General interest magazines

  • IEEE SpectrumRSS — a terrific monthly from 'the world's largest association for the advancement of technology'. They also publish some awesome niche titles like the unbelievably geeky Signal Processing — RSS. You can subscribe to print issues of Spectrum without joining IEEE, and it's free to read online. My favourite.
  • Royal Statistical Society SignificanceRSS (seems to be empty) — another fantastic cross-disciplinary read. [Updated: You don't have to join the society to get it, and you can read everything online for free]. I've happily paid for this for many years.

How do I read all this stuff?

The easiest way is to grab the RSS feed addresses (right-click and Copy Link Address, or words to that effect) and put them in a feed reader like Google Reader. (Confused? What the heck is RSS?). If you prefer to get things in your email inbox, you can send RSS feeds to email.

If you read other publications that help you stay informed and inspired as an exploration geophysicist — or as any kind of subsurface scientist — let us know what's in your mailbox or RSS feed!

The cover images are copyright of CSEG, CWLS and IEEE. I'm claiming 'fair use' for these low-res images. More A to Z posts...

Shooting into the dark

Part of what makes uncertainty such a slippery subject is that it conflates several concepts that are better kept apart: precision, accuracy, and repeatability. People often mention the first two, less often the third.

It's clear that precision and accuracy are different things. If someone's shooting at you, for instance, it's better that they are inaccurate but precise so that every bullet whizzes exactly 1 metre over your head. But, though the idea of one-off repeatability is built in to the concept of multiple 'readings', scientists often repeat experiments and this wholesale repeatability also needs to be captured. Hence the third drawing. 

One of the things I really like in Peter Copeland's book Communicating Rocks is the accuracy-precision-repeatability figure (here's my review). He captured this concept very nicely, and gives a good description too. There are two weaknesses though, I think, in these classic target figures. First, they portray two dimensions (spatial, in this case), when really each measurement we make is on a single axis. So I tried re-drawing the figure, but on one axis:

The second thing that bothers me is that there is an implied 'correct answer'—the middle of the target. This seems reasonable: we are trying to measure some external reality, after all. The problem is that when we make our measurements, we do not know where the middle of the target is. We are blind.

If we don't know where the bullseye is, we cannot tell the difference between precise and imprecise. But if we don't know the size of the bullseye, we also do not know how accurate we are, or how repeatable our experiments are. Both of these things are entirely relative to the nature of the target. 

What can we do? Sound statistical methods can help us, but most of us don't know what we're doing with statistics (be honest). Do we just need more data? No. More expensive analysis equipment? No.

No, none of this will help. You cannot beat uncertainty. You just have to deal with it.

This is based on an article of mine in the February issue of the CSEG Recorder. Rather woolly, even for me, it's the beginning of a thought experiment about doing a better job dealing with uncertainty. See Hall, M (2012). Do you know what you think you know? CSEG Recorder, February 2012. Online in May. Figures are here. 

News of the month

News of the week was maybe a little ambitious, so we're going to scale back to a monthly post. The same sort of news — technology with subsurface application. Whatever catches our beady eyes, really. Seen something cool? Tip us off.

First, a quick plug. Matt's writing course is on offer again at the CSPG-CSEG-CWLS GeoConvention in Calgary in May. It's a technical writing course, but it's not really about technical writing—it's about get more people writing more stuff. For fun, for science, for whatever. See the conspicuous ad (right) for more info. 

OK, two quick plugs. Dropbox just updated their web interface. If you're not a Dropbox user already, you are missing out on an amazing file storage and transfer tool. Files are accessible from anywhere, and can be shared with a simple web link. We use it every single day for personal and project stuff. Get an account here or click on the illusion.

The technology is coming

A few weeks ago we posted a video of a new augmented reality monocle. Now, news is growing that Google's mysterious X lab is developing some similar-sounding glasses. The general idea is that they connect to your Android phone for communications services, and sit on your face labeling things in the real world, in real time. Labeling with ads, presumably.

As the new iPad now totes a screen with more pixels than the monitor you’re looking at, it’s clear that mobile devices are changing everything there is to change about computing. 

Another SGI ICE, NASA's Pleiades is one of the top ten clusters in the world at 1.4 Pflops. It has a staggering 191TB of memory. Image: NASA.

Not a total flop

Remember SGI? You know, giant blue refrigeratory thing with 12GB of RAM in the back of the viz room, cost about $1M? Completely wiped out by the Linux PC about 10 years ago? Well, not completely: SGI just sold to  Total E&P a giant computer. Much bigger than a refrigerator, and much more expensive than $1M. At 2.3 petaflops (quadrillion floating-point operations per second) this new ICE X machine will be easily one of the most powerful computers in the world.

If the press release is anything to go by, and it probably isn't, Total seems to have reservoir modeling in mind, not just seismic processing. I wonder if they have a mixing board yet? 

Nova Scotia deepwater on fire

Not literally, but there's a small new flame at any rate. Shell Canada went large in January's bid round on four deepwater blocks off Nova Scotia, committing to almost $1B in exploration expenditures over the next five years. They won parcels 1 to 4 for $1.8M, $303M, $235M, $430M respectively, totalling $970M. This is terrific news for Nova Scotia, and for Canada.

This regular news feature is for information only. We aren't connected with any of these organizations, and don't necessarily endorse their products or services. SGI and ICE X are registered trademarks of Silicon Graphics International Corp. The psychobox illusion is a trademark of Dropbox.com. Offshore Nova Scotia map modified from CNSOPB.

A mixing board for the seismic symphony

Seismic processing is busy chasing its tail. OK, maybe an over-generalization, but researchers in the field are very skilled at finding incremental—and sometimes great—improvements in imaging algorithms, geometric corrections, and fidelity. But I don't want any of these things. Or, to be more precise: I don't need any more. 

Reflection seismic data are infested with filters. We don't know what most of these filters look like, and we've trained ourselves to accept and ignore them. We filter out the filters with our intuition. And you know where intuition gets us.

Mixing boardIf I don't want reverse-time, curved-ray migration, or 7-dimensional interpolation, what do I want? Easy: I want to see the filters. I want them perturbed and examined and exposed. Instead of soaking up whatever is left of Moore's Law with cluster-hogging precision, I would prefer to see more of the imprecise stuff. I think we've pushed the precision envelope to somewhere beyond the net uncertainty of our subsurface data, so that quality and sharpness of the seismic image is not, in most cases, the weak point of an integrated interpretation.

So I don't want any more processing products. I want a mixing board for seismic data.

To fully appreciate my point of view, you need to have experienced a large seismic processing project. It's hard enough to process seismic, but if there is enough at stake—traces, deadlines, decisions, or just money—then it is almost impossible to iterate the solution. This is rather ironic, and unfortunate. Every decision, from migration aperture to anisotropic parameters, is considered, tested, and made... and then left behind, never to be revisited.

Linear seismic processing flow

But this linear model, in which each decision is cemented onto the ones before it, seems unlikely to land on the optimal solution. Our fateful string of choices may lead us to a lovely spot, with a picnic area and clean toilets, but the chances that it is the global maximum, which might lie in a distant corner of the solution space, seem slim. What if the spherical divergence was off? Perhaps we should have interpolated to a regularized geometry. Did we leave some ground roll in the data? 

Seismic processing mixing boardLook, I don't know the answer. But I know what it would look like. Instead of spending three months generating the best-ever migration, we'd spend three months (maybe less) generating a universe of good-enough migrations. Then I could sit at my desk and—at least with first order precision—change the spherical divergence, or see if less aggressive noise attenuation helps. A different migration algorithm, perhaps. Maybe my multiples weren't gone after all: more radon!

Instead of looking along the tunnel of the processing flow, I want the bird's eye view of all the possiblities. 

If this sounds impossible, that's because it is impossible, with today's approach: process in full, then view. Why not just do this swath? Ray trace on the graphics card. Do everything in memory and make me buy 256GB of RAM. The Magic Earth mentality of 2001—remember that?

Am I wrong? Maybe we're not even close to good-enough, and we should continue honing, at all costs. But what if the gains to be made in exploring the solution space are bigger than whatever is left for image quality?

I think I can see another local maximum just over there...

Mixing board image: iStockphoto.