Loading…

Temporal Sequence Learning, Prediction, and Control: A Review of Different Models and Their Relation to Biological Mechanisms

In this review, we compare methods for temporal sequence learning (TSL) across the disciplines machine-control, classical conditioning, neuronal models for TSL as well as spike-timing-dependent plasticity (STDP). This review introduces the most influential models and focuses on two questions: To wha...

Full description

Saved in:
Bibliographic Details
Published in:Neural computation 2005-02, Vol.17 (2), p.245-319
Main Authors: Wörgötter, Florentin, Porr, Bernd
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
cited_by cdi_FETCH-LOGICAL-c464t-4f4adf0b5b16c029f1935aae4a4c985decd5a68cb208f117dec36d144d2d580e3
cites cdi_FETCH-LOGICAL-c464t-4f4adf0b5b16c029f1935aae4a4c985decd5a68cb208f117dec36d144d2d580e3
container_end_page 319
container_issue 2
container_start_page 245
container_title Neural computation
container_volume 17
creator Wörgötter, Florentin
Porr, Bernd
description In this review, we compare methods for temporal sequence learning (TSL) across the disciplines machine-control, classical conditioning, neuronal models for TSL as well as spike-timing-dependent plasticity (STDP). This review introduces the most influential models and focuses on two questions: To what degree are reward-based (e.g., TD learning) and correlation-based (Hebbian) learning related? and How do the different models correspond to possibly underlying biological mechanisms of synaptic plasticity? We first compare the different models in an open-loop condition, where behavioral feedback does not alter the learning. Here we observe that reward-based and correlation-based learning are indeed very similar. Machine control is then used to introduce the problem of closed-loop control (e.g., actor-critic architectures). Here the problem of evaluative (rewards) versus nonevaluative (correlations) feedback from the environment will be discussed, showing that both learning approaches are fundamentally different in the closed-loop condition. In trying to answer the second question, we compare neuronal versions of the different learning architectures to the anatomy of the involved brain structures (basal-ganglia, thalamus, and cortex) and the molecular biophysics of glutamatergic and dopaminergic synapses. Finally, we discuss the different algorithms used to model STDP and compare them to reward-based learning rules. Certain similarities are found in spite of the strongly different timescales. Here we focus on the biophysics of the different calcium-release mechanisms known to be involved in STDP.
doi_str_mv 10.1162/0899766053011555
format article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_211232171</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>793936591</sourcerecordid><originalsourceid>FETCH-LOGICAL-c464t-4f4adf0b5b16c029f1935aae4a4c985decd5a68cb208f117dec36d144d2d580e3</originalsourceid><addsrcrecordid>eNqFkc1vEzEQxS0EoqFw54QsJDhlwbPrj11uJZSClAoEQeK2cuxx62rXDvYGxKH_O04TqagS4mTL83sz8_wIeQrsFYCsX7O265SUTDQMQAhxj8yg3Ku2bb_fJ7NduSp1dUQe5XzFGJPAxENyBELVTCk2I9crHDcx6YF-xR9bDAbpEnUKPlzM6eeE1pvJxzCnOli6iGFKcXhDT-gX_OnxF42OvvPOYcIw0fNoccg35OoSfSrQoHdqOkX61schXnhTJp2judTB5zE_Jg-cHjI-OZzH5Nv709XiQ7X8dPZxcbKsDJd8qrjj2jq2FmuQhtWdg64RWiPX3HStsGis0LI165q1DkCVh0Za4NzWVrQMm2Pyct93k2Jxmad-9NngMOiAcZt7qTjnrFX_BUEpqbqaF_D5HfAqblMoJvoaoG5qUFAgtodMijkndP0m-VGn3z2wfhdgfzfAInl26Ltdj2hvBYfECvDiAOhcPtMlHYzPt5wUUnC5czLfc6P_a7d_zv0D6UuvDA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>211232171</pqid></control><display><type>article</type><title>Temporal Sequence Learning, Prediction, and Control: A Review of Different Models and Their Relation to Biological Mechanisms</title><source>MIT Press Journals</source><creator>Wörgötter, Florentin ; Porr, Bernd</creator><creatorcontrib>Wörgötter, Florentin ; Porr, Bernd</creatorcontrib><description>In this review, we compare methods for temporal sequence learning (TSL) across the disciplines machine-control, classical conditioning, neuronal models for TSL as well as spike-timing-dependent plasticity (STDP). This review introduces the most influential models and focuses on two questions: To what degree are reward-based (e.g., TD learning) and correlation-based (Hebbian) learning related? and How do the different models correspond to possibly underlying biological mechanisms of synaptic plasticity? We first compare the different models in an open-loop condition, where behavioral feedback does not alter the learning. Here we observe that reward-based and correlation-based learning are indeed very similar. Machine control is then used to introduce the problem of closed-loop control (e.g., actor-critic architectures). Here the problem of evaluative (rewards) versus nonevaluative (correlations) feedback from the environment will be discussed, showing that both learning approaches are fundamentally different in the closed-loop condition. In trying to answer the second question, we compare neuronal versions of the different learning architectures to the anatomy of the involved brain structures (basal-ganglia, thalamus, and cortex) and the molecular biophysics of glutamatergic and dopaminergic synapses. Finally, we discuss the different algorithms used to model STDP and compare them to reward-based learning rules. Certain similarities are found in spite of the strongly different timescales. Here we focus on the biophysics of the different calcium-release mechanisms known to be involved in STDP.</description><identifier>ISSN: 0899-7667</identifier><identifier>EISSN: 1530-888X</identifier><identifier>DOI: 10.1162/0899766053011555</identifier><identifier>PMID: 15720770</identifier><identifier>CODEN: NEUCEB</identifier><language>eng</language><publisher>One Rogers Street, Cambridge, MA 02142-1209, USA: MIT Press</publisher><subject>Algorithms ; Applied sciences ; Artificial intelligence ; Biological and medical sciences ; Brain ; Computer science; control theory; systems ; Exact sciences and technology ; Forecasting ; Fundamental and applied biological sciences. Psychology ; General aspects ; Learning ; Learning and adaptive systems ; Mathematics ; Mathematics in biology. Statistical analysis. Models. Metrology. Data processing in biology (general aspects) ; Neural networks ; Neural Networks (Computer) ; Neurology ; Probability and statistics ; Probability theory and stochastic processes ; Review ; Sciences and techniques of general use ; Special processes (renewal theory, markov renewal processes, semi-markov processes, statistical mechanics type models, applications) ; Time Factors</subject><ispartof>Neural computation, 2005-02, Vol.17 (2), p.245-319</ispartof><rights>2005 INIST-CNRS</rights><rights>Copyright MIT Press Journals Feb 2005</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c464t-4f4adf0b5b16c029f1935aae4a4c985decd5a68cb208f117dec36d144d2d580e3</citedby><cites>FETCH-LOGICAL-c464t-4f4adf0b5b16c029f1935aae4a4c985decd5a68cb208f117dec36d144d2d580e3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://direct.mit.edu/neco/article/doi/10.1162/0899766053011555$$EHTML$$P50$$Gmit$$H</linktohtml><link.rule.ids>314,776,780,27903,27904,53987,53988</link.rule.ids><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&amp;idt=16565467$$DView record in Pascal Francis$$Hfree_for_read</backlink><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/15720770$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Wörgötter, Florentin</creatorcontrib><creatorcontrib>Porr, Bernd</creatorcontrib><title>Temporal Sequence Learning, Prediction, and Control: A Review of Different Models and Their Relation to Biological Mechanisms</title><title>Neural computation</title><addtitle>Neural Comput</addtitle><description>In this review, we compare methods for temporal sequence learning (TSL) across the disciplines machine-control, classical conditioning, neuronal models for TSL as well as spike-timing-dependent plasticity (STDP). This review introduces the most influential models and focuses on two questions: To what degree are reward-based (e.g., TD learning) and correlation-based (Hebbian) learning related? and How do the different models correspond to possibly underlying biological mechanisms of synaptic plasticity? We first compare the different models in an open-loop condition, where behavioral feedback does not alter the learning. Here we observe that reward-based and correlation-based learning are indeed very similar. Machine control is then used to introduce the problem of closed-loop control (e.g., actor-critic architectures). Here the problem of evaluative (rewards) versus nonevaluative (correlations) feedback from the environment will be discussed, showing that both learning approaches are fundamentally different in the closed-loop condition. In trying to answer the second question, we compare neuronal versions of the different learning architectures to the anatomy of the involved brain structures (basal-ganglia, thalamus, and cortex) and the molecular biophysics of glutamatergic and dopaminergic synapses. Finally, we discuss the different algorithms used to model STDP and compare them to reward-based learning rules. Certain similarities are found in spite of the strongly different timescales. Here we focus on the biophysics of the different calcium-release mechanisms known to be involved in STDP.</description><subject>Algorithms</subject><subject>Applied sciences</subject><subject>Artificial intelligence</subject><subject>Biological and medical sciences</subject><subject>Brain</subject><subject>Computer science; control theory; systems</subject><subject>Exact sciences and technology</subject><subject>Forecasting</subject><subject>Fundamental and applied biological sciences. Psychology</subject><subject>General aspects</subject><subject>Learning</subject><subject>Learning and adaptive systems</subject><subject>Mathematics</subject><subject>Mathematics in biology. Statistical analysis. Models. Metrology. Data processing in biology (general aspects)</subject><subject>Neural networks</subject><subject>Neural Networks (Computer)</subject><subject>Neurology</subject><subject>Probability and statistics</subject><subject>Probability theory and stochastic processes</subject><subject>Review</subject><subject>Sciences and techniques of general use</subject><subject>Special processes (renewal theory, markov renewal processes, semi-markov processes, statistical mechanics type models, applications)</subject><subject>Time Factors</subject><issn>0899-7667</issn><issn>1530-888X</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2005</creationdate><recordtype>article</recordtype><recordid>eNqFkc1vEzEQxS0EoqFw54QsJDhlwbPrj11uJZSClAoEQeK2cuxx62rXDvYGxKH_O04TqagS4mTL83sz8_wIeQrsFYCsX7O265SUTDQMQAhxj8yg3Ku2bb_fJ7NduSp1dUQe5XzFGJPAxENyBELVTCk2I9crHDcx6YF-xR9bDAbpEnUKPlzM6eeE1pvJxzCnOli6iGFKcXhDT-gX_OnxF42OvvPOYcIw0fNoccg35OoSfSrQoHdqOkX61schXnhTJp2judTB5zE_Jg-cHjI-OZzH5Nv709XiQ7X8dPZxcbKsDJd8qrjj2jq2FmuQhtWdg64RWiPX3HStsGis0LI165q1DkCVh0Za4NzWVrQMm2Pyct93k2Jxmad-9NngMOiAcZt7qTjnrFX_BUEpqbqaF_D5HfAqblMoJvoaoG5qUFAgtodMijkndP0m-VGn3z2wfhdgfzfAInl26Ltdj2hvBYfECvDiAOhcPtMlHYzPt5wUUnC5czLfc6P_a7d_zv0D6UuvDA</recordid><startdate>20050201</startdate><enddate>20050201</enddate><creator>Wörgötter, Florentin</creator><creator>Porr, Bernd</creator><general>MIT Press</general><general>MIT Press Journals, The</general><scope>IQODW</scope><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>7QO</scope><scope>7TK</scope><scope>FR3</scope><scope>P64</scope><scope>7X8</scope></search><sort><creationdate>20050201</creationdate><title>Temporal Sequence Learning, Prediction, and Control: A Review of Different Models and Their Relation to Biological Mechanisms</title><author>Wörgötter, Florentin ; Porr, Bernd</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c464t-4f4adf0b5b16c029f1935aae4a4c985decd5a68cb208f117dec36d144d2d580e3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2005</creationdate><topic>Algorithms</topic><topic>Applied sciences</topic><topic>Artificial intelligence</topic><topic>Biological and medical sciences</topic><topic>Brain</topic><topic>Computer science; control theory; systems</topic><topic>Exact sciences and technology</topic><topic>Forecasting</topic><topic>Fundamental and applied biological sciences. Psychology</topic><topic>General aspects</topic><topic>Learning</topic><topic>Learning and adaptive systems</topic><topic>Mathematics</topic><topic>Mathematics in biology. Statistical analysis. Models. Metrology. Data processing in biology (general aspects)</topic><topic>Neural networks</topic><topic>Neural Networks (Computer)</topic><topic>Neurology</topic><topic>Probability and statistics</topic><topic>Probability theory and stochastic processes</topic><topic>Review</topic><topic>Sciences and techniques of general use</topic><topic>Special processes (renewal theory, markov renewal processes, semi-markov processes, statistical mechanics type models, applications)</topic><topic>Time Factors</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Wörgötter, Florentin</creatorcontrib><creatorcontrib>Porr, Bernd</creatorcontrib><collection>Pascal-Francis</collection><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>Biotechnology Research Abstracts</collection><collection>Neurosciences Abstracts</collection><collection>Engineering Research Database</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>MEDLINE - Academic</collection><jtitle>Neural computation</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Wörgötter, Florentin</au><au>Porr, Bernd</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Temporal Sequence Learning, Prediction, and Control: A Review of Different Models and Their Relation to Biological Mechanisms</atitle><jtitle>Neural computation</jtitle><addtitle>Neural Comput</addtitle><date>2005-02-01</date><risdate>2005</risdate><volume>17</volume><issue>2</issue><spage>245</spage><epage>319</epage><pages>245-319</pages><issn>0899-7667</issn><eissn>1530-888X</eissn><coden>NEUCEB</coden><abstract>In this review, we compare methods for temporal sequence learning (TSL) across the disciplines machine-control, classical conditioning, neuronal models for TSL as well as spike-timing-dependent plasticity (STDP). This review introduces the most influential models and focuses on two questions: To what degree are reward-based (e.g., TD learning) and correlation-based (Hebbian) learning related? and How do the different models correspond to possibly underlying biological mechanisms of synaptic plasticity? We first compare the different models in an open-loop condition, where behavioral feedback does not alter the learning. Here we observe that reward-based and correlation-based learning are indeed very similar. Machine control is then used to introduce the problem of closed-loop control (e.g., actor-critic architectures). Here the problem of evaluative (rewards) versus nonevaluative (correlations) feedback from the environment will be discussed, showing that both learning approaches are fundamentally different in the closed-loop condition. In trying to answer the second question, we compare neuronal versions of the different learning architectures to the anatomy of the involved brain structures (basal-ganglia, thalamus, and cortex) and the molecular biophysics of glutamatergic and dopaminergic synapses. Finally, we discuss the different algorithms used to model STDP and compare them to reward-based learning rules. Certain similarities are found in spite of the strongly different timescales. Here we focus on the biophysics of the different calcium-release mechanisms known to be involved in STDP.</abstract><cop>One Rogers Street, Cambridge, MA 02142-1209, USA</cop><pub>MIT Press</pub><pmid>15720770</pmid><doi>10.1162/0899766053011555</doi><tpages>75</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 0899-7667
ispartof Neural computation, 2005-02, Vol.17 (2), p.245-319
issn 0899-7667
1530-888X
language eng
recordid cdi_proquest_journals_211232171
source MIT Press Journals
subjects Algorithms
Applied sciences
Artificial intelligence
Biological and medical sciences
Brain
Computer science
control theory
systems
Exact sciences and technology
Forecasting
Fundamental and applied biological sciences. Psychology
General aspects
Learning
Learning and adaptive systems
Mathematics
Mathematics in biology. Statistical analysis. Models. Metrology. Data processing in biology (general aspects)
Neural networks
Neural Networks (Computer)
Neurology
Probability and statistics
Probability theory and stochastic processes
Review
Sciences and techniques of general use
Special processes (renewal theory, markov renewal processes, semi-markov processes, statistical mechanics type models, applications)
Time Factors
title Temporal Sequence Learning, Prediction, and Control: A Review of Different Models and Their Relation to Biological Mechanisms
url http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-27T10%3A45%3A26IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Temporal%20Sequence%20Learning,%20Prediction,%20and%20Control:%20A%20Review%20of%20Different%20Models%20and%20Their%20Relation%20to%20Biological%20Mechanisms&rft.jtitle=Neural%20computation&rft.au=W%C3%B6rg%C3%B6tter,%20Florentin&rft.date=2005-02-01&rft.volume=17&rft.issue=2&rft.spage=245&rft.epage=319&rft.pages=245-319&rft.issn=0899-7667&rft.eissn=1530-888X&rft.coden=NEUCEB&rft_id=info:doi/10.1162/0899766053011555&rft_dat=%3Cproquest_cross%3E793936591%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c464t-4f4adf0b5b16c029f1935aae4a4c985decd5a68cb208f117dec36d144d2d580e3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=211232171&rft_id=info:pmid/15720770&rfr_iscdi=true