3. Time resolution of data set
been completed)
| One row of several dozen in the NCDC archive of hard copy paper holdings from around the world |
| Microfilm holdings arising from Europe over the second world war |
| Example search results for American Samoa |
![]() |
| Figure 1 Station locations of the four benchmark regions. Blue stations are in all worlds. Red stations only appear in worlds 2 and 3. |
![]() |
| Figure 1 Overview the ISTI comprehensive benchmarking system for assessing performance of homogenisation algorithms. (Fig. 3 of Willett et al., 2014) |
ccc-gistemp is Climate Code Foundation‘s rewrite of the NASA GISS Surface Temperature Analysis GISTEMP. It produces exactly the same result, but is written in clear Python.
I’ve recently modified ccc-gistemp so that it can use the dataset recently released by the International Surface Temperature Initiative. Normally ccc-gistemp uses GHCN-M, but the ISTI dataset is much larger. Since ISTI publish the Stage 3 dataset in the same format as GHCN-M v3 the required changes were relatively minor, and Climate Code Foundation appreciates the fact that ISTI is published in several formats, including GHCN-M v3.
The ISTI dataset is not quality controlled, so, after re-reading section 3.3 of Lawrimore et al 2011, I implemented an extremely simple quality control scheme, MADQC. In MADQC a data value is rejected if its distance from the median (for its station’s named month) exceeds 5 times the median absolute deviation (MAD, hence MADQC); any series with fewer than 20 values (for each named month) is rejected.
So far I’ve found MADQC to be reasonable at rejecting the grossest non climatic errors.
Let’s compare the ccc-gistemp analysis using the ISTI Stage 3 dataset versus using the GHCN-M QCU dataset. The analysis for each hemisphere:
For both hemispheres the agreement is generally good and certainly within the published error bounds.
Zooming in on the recent period:
Now we can see the agreement in the northern hemisphere is excellent. In the southern hemisphere agreement is very good. The trend is slightly higher for the ISTI dataset.
The additional data that ISTI has gathered is most welcome, and this analysis shows that the warming trend in both hemispheres was not due to choosing a particular set of stations for GHCN-M. The much more comprehensive station network of ISTI shows the same trends.
![]() |
| Location of all stations in the recommended version of the Stage Three component of the databank. The color corresponds to the number of years of data available for each station. |
![]() |
| Station count of the recommended merge by year from 1850-2010. Databank stations in red compared to GHCN-M, version 3 in black. |
Temperature
is one of the main quantities measured in meteorology and plays a key
role in weather forecasts and climate determination. The instrumental
temperature recordings now spans well over a century, with some records
extending back to the 17th century, and represents an invaluable tool in
evaluating historic climatic trends. However, ensuring the quality of
the data records is challenging, with issues arising from the wide range
of sensors used, how the sensors were calibrated, and how the data was
recorded and written down. In particular, the very definition of the
temperature scales have evolved. While they have always been based on
calibration of instruments via a series of material phase transitions
(fixed points), the evolution of sensors, measuring techniques and
revisions of the fixed points used has introduced differences that may
lead to difficulties when studying historic temperature records. The
conversion program here presented deals with this issue for 20th century
data by implementing a proposed mathematical model to allow the
conversion from historical scales to the currently adopted International
Temperature Scale of 1990 (ITS-90). This program can convert large
files of historical records to the current international temperature
scale, a feature which is intended to help in the harmonisation
processes of long historic series. This work is part of the project
“MeteoMet” funded by the EURAMET, the European association of National
Institutes of Metrology, and is part of a major general effort in
identifying the several sources of uncertainty in climate and
meteorological records.
Assuming that calibration procedures immediately spread throughout the world – homogenisation algorithms might conceivably see adjustments in 1968, with smaller adjustments in 1990.If undetected, the effect would be to create a bias in the temperature record. This is difficult to calculate since the bias is temperature dependent, but if the mean land-surface temperature is ~10°C and if temperature excursions are typically ±10 °C then one might expect that the effect to be that records prior to 1968 were systematically overestimated by about 0.005 °C, and records between 1968 and 1990 by about 0.003 °C.