Loading…

Using PPCA to Estimate EOFs in the Presence of Missing Values

One of the problems encountered when using satellite-derived sea surface temperature (SST) data is the impossibility of retrieving data where the ocean surface is obscured by cloud. Empirical orthogonal function (EOF) analysis cannot be carried out easily when there are missing values within the dat...

Full description

Saved in:
Bibliographic Details
Published in:Journal of atmospheric and oceanic technology 2004-09, Vol.21 (9), p.1471-1480
Main Authors: Houseago-Stokes, Richenda E, Challenor, Peter G
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by
cites cdi_FETCH-LOGICAL-c371t-c82e43de8697391610520660de1aefe873c098901d94df22b77bcde0bb7fe753
container_end_page 1480
container_issue 9
container_start_page 1471
container_title Journal of atmospheric and oceanic technology
container_volume 21
creator Houseago-Stokes, Richenda E
Challenor, Peter G
description One of the problems encountered when using satellite-derived sea surface temperature (SST) data is the impossibility of retrieving data where the ocean surface is obscured by cloud. Empirical orthogonal function (EOF) analysis cannot be carried out easily when there are missing values within the dataset. One possible solution is to interpolate using the existing data. In this paper an alternative technique is investigated, probabilistic principal component analysis (PPCA), and applied to calculate the principal EOFs of North Atlantic SSTs. This analysis uses results obtained from interpolating the SST data using a simplified Kaiman filter, with data randomly removed to simulate missing values, and then reconstructs the data using PPCA, obtaining the principal EOFs. The calculation of the EOFs was quicker than traditional EOF analysis, as the eovariance matrix was estimated rather than calculated. The replacement of missing values was also computationally more efficient than using the Kaiman filter, taking a fraction of the time. The expectation-maximization (EM) algorithm produced similar results to those produced through standard procedures. However, the choice of the number of EOFs to be retained had a significant effect on the accuracy of the interpolated dataset, with more EOFs reducing the accuracy of the reconstructed dataset. [PUBLICATION ABSTRACT]
doi_str_mv 10.1175/1520-0426(2004)021<1471:UPTEEI>2.0.CO;2
format article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_36228717</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>21189175</sourcerecordid><originalsourceid>FETCH-LOGICAL-c371t-c82e43de8697391610520660de1aefe873c098901d94df22b77bcde0bb7fe753</originalsourceid><addsrcrecordid>eNqFkUtPwlAQhW-MJiL6H25cGF0UZqaP2_pKSFOQBFMW4PamtFMtgRZ7y8J_bwvGhRtXszlz5sz5hBgiDBCVO0SXwAKHvFsCcO6A8BEdhffL-SKKps80gEEYP9CJ6P0qT0UPlB1Y4Co6FxfGrAEAbfR64mlpivJdzufhSDaVjExTbJOGZRSPjSxK2XywnNdsuExZVrl8Lcxh4S3Z7NlcirM82Ri--pl9sRhHi_DFmsWTaTiaWamtsLFSn9ixM_a9oI2BHkKbzPMgY0w4Z1_ZKQR-AJgFTpYTrZRapRnDaqVyVq7dFzdH211dfbZnG70tTMqbTVJytTfa9oh8hepfISH6AR4cr_8I19W-LtsfNBG55DpO5zY5itK6MqbmXO_qtp36SyPojoXuGtZdw7pjoVsWumOhjyw0adBhrMn-BtvjfMk</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>222525447</pqid></control><display><type>article</type><title>Using PPCA to Estimate EOFs in the Presence of Missing Values</title><source>Freely Accessible Science Journals</source><creator>Houseago-Stokes, Richenda E ; Challenor, Peter G</creator><creatorcontrib>Houseago-Stokes, Richenda E ; Challenor, Peter G</creatorcontrib><description>One of the problems encountered when using satellite-derived sea surface temperature (SST) data is the impossibility of retrieving data where the ocean surface is obscured by cloud. Empirical orthogonal function (EOF) analysis cannot be carried out easily when there are missing values within the dataset. One possible solution is to interpolate using the existing data. In this paper an alternative technique is investigated, probabilistic principal component analysis (PPCA), and applied to calculate the principal EOFs of North Atlantic SSTs. This analysis uses results obtained from interpolating the SST data using a simplified Kaiman filter, with data randomly removed to simulate missing values, and then reconstructs the data using PPCA, obtaining the principal EOFs. The calculation of the EOFs was quicker than traditional EOF analysis, as the eovariance matrix was estimated rather than calculated. The replacement of missing values was also computationally more efficient than using the Kaiman filter, taking a fraction of the time. The expectation-maximization (EM) algorithm produced similar results to those produced through standard procedures. However, the choice of the number of EOFs to be retained had a significant effect on the accuracy of the interpolated dataset, with more EOFs reducing the accuracy of the reconstructed dataset. [PUBLICATION ABSTRACT]</description><identifier>ISSN: 0739-0572</identifier><identifier>EISSN: 1520-0426</identifier><identifier>DOI: 10.1175/1520-0426(2004)021&lt;1471:UPTEEI&gt;2.0.CO;2</identifier><language>eng</language><publisher>Boston: American Meteorological Society</publisher><subject>Analysis ; Oceanography ; Oceans ; Principal components analysis ; Satellite systems ; Sea surface temperature ; Surface water ; Temperature</subject><ispartof>Journal of atmospheric and oceanic technology, 2004-09, Vol.21 (9), p.1471-1480</ispartof><rights>Copyright American Meteorological Society Sep 2004</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c371t-c82e43de8697391610520660de1aefe873c098901d94df22b77bcde0bb7fe753</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,27901,27902</link.rule.ids></links><search><creatorcontrib>Houseago-Stokes, Richenda E</creatorcontrib><creatorcontrib>Challenor, Peter G</creatorcontrib><title>Using PPCA to Estimate EOFs in the Presence of Missing Values</title><title>Journal of atmospheric and oceanic technology</title><description>One of the problems encountered when using satellite-derived sea surface temperature (SST) data is the impossibility of retrieving data where the ocean surface is obscured by cloud. Empirical orthogonal function (EOF) analysis cannot be carried out easily when there are missing values within the dataset. One possible solution is to interpolate using the existing data. In this paper an alternative technique is investigated, probabilistic principal component analysis (PPCA), and applied to calculate the principal EOFs of North Atlantic SSTs. This analysis uses results obtained from interpolating the SST data using a simplified Kaiman filter, with data randomly removed to simulate missing values, and then reconstructs the data using PPCA, obtaining the principal EOFs. The calculation of the EOFs was quicker than traditional EOF analysis, as the eovariance matrix was estimated rather than calculated. The replacement of missing values was also computationally more efficient than using the Kaiman filter, taking a fraction of the time. The expectation-maximization (EM) algorithm produced similar results to those produced through standard procedures. However, the choice of the number of EOFs to be retained had a significant effect on the accuracy of the interpolated dataset, with more EOFs reducing the accuracy of the reconstructed dataset. [PUBLICATION ABSTRACT]</description><subject>Analysis</subject><subject>Oceanography</subject><subject>Oceans</subject><subject>Principal components analysis</subject><subject>Satellite systems</subject><subject>Sea surface temperature</subject><subject>Surface water</subject><subject>Temperature</subject><issn>0739-0572</issn><issn>1520-0426</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2004</creationdate><recordtype>article</recordtype><recordid>eNqFkUtPwlAQhW-MJiL6H25cGF0UZqaP2_pKSFOQBFMW4PamtFMtgRZ7y8J_bwvGhRtXszlz5sz5hBgiDBCVO0SXwAKHvFsCcO6A8BEdhffL-SKKps80gEEYP9CJ6P0qT0UPlB1Y4Co6FxfGrAEAbfR64mlpivJdzufhSDaVjExTbJOGZRSPjSxK2XywnNdsuExZVrl8Lcxh4S3Z7NlcirM82Ri--pl9sRhHi_DFmsWTaTiaWamtsLFSn9ixM_a9oI2BHkKbzPMgY0w4Z1_ZKQR-AJgFTpYTrZRapRnDaqVyVq7dFzdH211dfbZnG70tTMqbTVJytTfa9oh8hepfISH6AR4cr_8I19W-LtsfNBG55DpO5zY5itK6MqbmXO_qtp36SyPojoXuGtZdw7pjoVsWumOhjyw0adBhrMn-BtvjfMk</recordid><startdate>20040901</startdate><enddate>20040901</enddate><creator>Houseago-Stokes, Richenda E</creator><creator>Challenor, Peter G</creator><general>American Meteorological Society</general><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7TG</scope><scope>7TN</scope><scope>7UA</scope><scope>7XB</scope><scope>88F</scope><scope>88I</scope><scope>8AF</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>8FK</scope><scope>8G5</scope><scope>ABUWG</scope><scope>AEUYN</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>ATCPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>BHPHI</scope><scope>BKSAR</scope><scope>C1K</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>F1W</scope><scope>GNUQQ</scope><scope>GUQSH</scope><scope>H8D</scope><scope>H96</scope><scope>HCIFZ</scope><scope>KL.</scope><scope>L.G</scope><scope>L7M</scope><scope>M1Q</scope><scope>M2O</scope><scope>M2P</scope><scope>MBDVC</scope><scope>P5Z</scope><scope>P62</scope><scope>PATMY</scope><scope>PCBAR</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PYCSY</scope><scope>Q9U</scope><scope>S0X</scope><scope>H95</scope></search><sort><creationdate>20040901</creationdate><title>Using PPCA to Estimate EOFs in the Presence of Missing Values</title><author>Houseago-Stokes, Richenda E ; Challenor, Peter G</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c371t-c82e43de8697391610520660de1aefe873c098901d94df22b77bcde0bb7fe753</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2004</creationdate><topic>Analysis</topic><topic>Oceanography</topic><topic>Oceans</topic><topic>Principal components analysis</topic><topic>Satellite systems</topic><topic>Sea surface temperature</topic><topic>Surface water</topic><topic>Temperature</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Houseago-Stokes, Richenda E</creatorcontrib><creatorcontrib>Challenor, Peter G</creatorcontrib><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Meteorological &amp; Geoastrophysical Abstracts</collection><collection>Oceanic Abstracts</collection><collection>Water Resources Abstracts</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Military Database (Alumni Edition)</collection><collection>Science Database (Alumni Edition)</collection><collection>STEM Database</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>Research Library (Alumni Edition)</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest One Sustainability</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies &amp; Aerospace Collection</collection><collection>Agricultural &amp; Environmental Science Collection</collection><collection>ProQuest Central Essentials</collection><collection>AUTh Library subscriptions: ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Earth, Atmospheric &amp; Aquatic Science Collection</collection><collection>Environmental Sciences and Pollution Management</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>ASFA: Aquatic Sciences and Fisheries Abstracts</collection><collection>ProQuest Central Student</collection><collection>Research Library Prep</collection><collection>Aerospace Database</collection><collection>Aquatic Science &amp; Fisheries Abstracts (ASFA) 2: Ocean Technology, Policy &amp; Non-Living Resources</collection><collection>SciTech Premium Collection</collection><collection>Meteorological &amp; Geoastrophysical Abstracts - Academic</collection><collection>Aquatic Science &amp; Fisheries Abstracts (ASFA) Professional</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Military Database</collection><collection>Research Library</collection><collection>Science Database</collection><collection>Research Library (Corporate)</collection><collection>Advanced Technologies &amp; Aerospace Database</collection><collection>ProQuest Advanced Technologies &amp; Aerospace Collection</collection><collection>Environmental Science Database</collection><collection>Earth, Atmospheric &amp; Aquatic Science Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>Environmental Science Collection</collection><collection>ProQuest Central Basic</collection><collection>SIRS Editorial</collection><collection>Aquatic Science &amp; Fisheries Abstracts (ASFA) 1: Biological Sciences &amp; Living Resources</collection><jtitle>Journal of atmospheric and oceanic technology</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Houseago-Stokes, Richenda E</au><au>Challenor, Peter G</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Using PPCA to Estimate EOFs in the Presence of Missing Values</atitle><jtitle>Journal of atmospheric and oceanic technology</jtitle><date>2004-09-01</date><risdate>2004</risdate><volume>21</volume><issue>9</issue><spage>1471</spage><epage>1480</epage><pages>1471-1480</pages><issn>0739-0572</issn><eissn>1520-0426</eissn><abstract>One of the problems encountered when using satellite-derived sea surface temperature (SST) data is the impossibility of retrieving data where the ocean surface is obscured by cloud. Empirical orthogonal function (EOF) analysis cannot be carried out easily when there are missing values within the dataset. One possible solution is to interpolate using the existing data. In this paper an alternative technique is investigated, probabilistic principal component analysis (PPCA), and applied to calculate the principal EOFs of North Atlantic SSTs. This analysis uses results obtained from interpolating the SST data using a simplified Kaiman filter, with data randomly removed to simulate missing values, and then reconstructs the data using PPCA, obtaining the principal EOFs. The calculation of the EOFs was quicker than traditional EOF analysis, as the eovariance matrix was estimated rather than calculated. The replacement of missing values was also computationally more efficient than using the Kaiman filter, taking a fraction of the time. The expectation-maximization (EM) algorithm produced similar results to those produced through standard procedures. However, the choice of the number of EOFs to be retained had a significant effect on the accuracy of the interpolated dataset, with more EOFs reducing the accuracy of the reconstructed dataset. [PUBLICATION ABSTRACT]</abstract><cop>Boston</cop><pub>American Meteorological Society</pub><doi>10.1175/1520-0426(2004)021&lt;1471:UPTEEI&gt;2.0.CO;2</doi><tpages>10</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 0739-0572
ispartof Journal of atmospheric and oceanic technology, 2004-09, Vol.21 (9), p.1471-1480
issn 0739-0572
1520-0426
language eng
recordid cdi_proquest_miscellaneous_36228717
source Freely Accessible Science Journals
subjects Analysis
Oceanography
Oceans
Principal components analysis
Satellite systems
Sea surface temperature
Surface water
Temperature
title Using PPCA to Estimate EOFs in the Presence of Missing Values
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-08T06%3A51%3A37IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Using%20PPCA%20to%20Estimate%20EOFs%20in%20the%20Presence%20of%20Missing%20Values&rft.jtitle=Journal%20of%20atmospheric%20and%20oceanic%20technology&rft.au=Houseago-Stokes,%20Richenda%20E&rft.date=2004-09-01&rft.volume=21&rft.issue=9&rft.spage=1471&rft.epage=1480&rft.pages=1471-1480&rft.issn=0739-0572&rft.eissn=1520-0426&rft_id=info:doi/10.1175/1520-0426(2004)021%3C1471:UPTEEI%3E2.0.CO;2&rft_dat=%3Cproquest_cross%3E21189175%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c371t-c82e43de8697391610520660de1aefe873c098901d94df22b77bcde0bb7fe753%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=222525447&rft_id=info:pmid/&rfr_iscdi=true