Loading…
Temporal Sequence Learning, Prediction, and Control: A Review of Different Models and Their Relation to Biological Mechanisms
In this review, we compare methods for temporal sequence learning (TSL) across the disciplines machine-control, classical conditioning, neuronal models for TSL as well as spike-timing-dependent plasticity (STDP). This review introduces the most influential models and focuses on two questions: To wha...
Saved in:
Published in: | Neural computation 2005-02, Vol.17 (2), p.245-319 |
---|---|
Main Authors: | , |
Format: | Article |
Language: | English |
Subjects: | |
Citations: | Items that this one cites Items that cite this one |
Online Access: | Get full text |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
cited_by | cdi_FETCH-LOGICAL-c464t-4f4adf0b5b16c029f1935aae4a4c985decd5a68cb208f117dec36d144d2d580e3 |
---|---|
cites | cdi_FETCH-LOGICAL-c464t-4f4adf0b5b16c029f1935aae4a4c985decd5a68cb208f117dec36d144d2d580e3 |
container_end_page | 319 |
container_issue | 2 |
container_start_page | 245 |
container_title | Neural computation |
container_volume | 17 |
creator | Wörgötter, Florentin Porr, Bernd |
description | In this review, we compare methods for temporal sequence learning (TSL) across the disciplines machine-control, classical conditioning, neuronal models for TSL as well as spike-timing-dependent plasticity (STDP). This review introduces the most influential models and focuses on two questions: To what degree are reward-based (e.g., TD learning) and correlation-based (Hebbian) learning related? and How do the different models correspond to possibly underlying biological mechanisms of synaptic plasticity? We first compare the different models in an open-loop condition, where behavioral feedback does not alter the learning. Here we observe that reward-based and correlation-based learning are indeed very similar. Machine control is then used to introduce the problem of closed-loop control (e.g., actor-critic architectures). Here the problem of evaluative (rewards) versus nonevaluative (correlations) feedback from the environment will be discussed, showing that both learning approaches are fundamentally different in the closed-loop condition. In trying to answer the second question, we compare neuronal versions of the different learning architectures to the anatomy of the involved brain structures (basal-ganglia, thalamus, and cortex) and the molecular biophysics of glutamatergic and dopaminergic synapses. Finally, we discuss the different algorithms used to model STDP and compare them to reward-based learning rules. Certain similarities are found in spite of the strongly different timescales. Here we focus on the biophysics of the different calcium-release mechanisms known to be involved in STDP. |
doi_str_mv | 10.1162/0899766053011555 |
format | article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_211232171</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>793936591</sourcerecordid><originalsourceid>FETCH-LOGICAL-c464t-4f4adf0b5b16c029f1935aae4a4c985decd5a68cb208f117dec36d144d2d580e3</originalsourceid><addsrcrecordid>eNqFkc1vEzEQxS0EoqFw54QsJDhlwbPrj11uJZSClAoEQeK2cuxx62rXDvYGxKH_O04TqagS4mTL83sz8_wIeQrsFYCsX7O265SUTDQMQAhxj8yg3Ku2bb_fJ7NduSp1dUQe5XzFGJPAxENyBELVTCk2I9crHDcx6YF-xR9bDAbpEnUKPlzM6eeE1pvJxzCnOli6iGFKcXhDT-gX_OnxF42OvvPOYcIw0fNoccg35OoSfSrQoHdqOkX61schXnhTJp2judTB5zE_Jg-cHjI-OZzH5Nv709XiQ7X8dPZxcbKsDJd8qrjj2jq2FmuQhtWdg64RWiPX3HStsGis0LI165q1DkCVh0Za4NzWVrQMm2Pyct93k2Jxmad-9NngMOiAcZt7qTjnrFX_BUEpqbqaF_D5HfAqblMoJvoaoG5qUFAgtodMijkndP0m-VGn3z2wfhdgfzfAInl26Ltdj2hvBYfECvDiAOhcPtMlHYzPt5wUUnC5czLfc6P_a7d_zv0D6UuvDA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>211232171</pqid></control><display><type>article</type><title>Temporal Sequence Learning, Prediction, and Control: A Review of Different Models and Their Relation to Biological Mechanisms</title><source>MIT Press Journals</source><creator>Wörgötter, Florentin ; Porr, Bernd</creator><creatorcontrib>Wörgötter, Florentin ; Porr, Bernd</creatorcontrib><description>In this review, we compare methods for temporal sequence learning (TSL) across the disciplines machine-control, classical conditioning, neuronal models for TSL as well as spike-timing-dependent plasticity (STDP). This review introduces the most influential models and focuses on two questions: To what degree are reward-based (e.g., TD learning) and correlation-based (Hebbian) learning related? and How do the different models correspond to possibly underlying biological mechanisms of synaptic plasticity? We first compare the different models in an open-loop condition, where behavioral feedback does not alter the learning. Here we observe that reward-based and correlation-based learning are indeed very similar. Machine control is then used to introduce the problem of closed-loop control (e.g., actor-critic architectures). Here the problem of evaluative (rewards) versus nonevaluative (correlations) feedback from the environment will be discussed, showing that both learning approaches are fundamentally different in the closed-loop condition. In trying to answer the second question, we compare neuronal versions of the different learning architectures to the anatomy of the involved brain structures (basal-ganglia, thalamus, and cortex) and the molecular biophysics of glutamatergic and dopaminergic synapses. Finally, we discuss the different algorithms used to model STDP and compare them to reward-based learning rules. Certain similarities are found in spite of the strongly different timescales. Here we focus on the biophysics of the different calcium-release mechanisms known to be involved in STDP.</description><identifier>ISSN: 0899-7667</identifier><identifier>EISSN: 1530-888X</identifier><identifier>DOI: 10.1162/0899766053011555</identifier><identifier>PMID: 15720770</identifier><identifier>CODEN: NEUCEB</identifier><language>eng</language><publisher>One Rogers Street, Cambridge, MA 02142-1209, USA: MIT Press</publisher><subject>Algorithms ; Applied sciences ; Artificial intelligence ; Biological and medical sciences ; Brain ; Computer science; control theory; systems ; Exact sciences and technology ; Forecasting ; Fundamental and applied biological sciences. Psychology ; General aspects ; Learning ; Learning and adaptive systems ; Mathematics ; Mathematics in biology. Statistical analysis. Models. Metrology. Data processing in biology (general aspects) ; Neural networks ; Neural Networks (Computer) ; Neurology ; Probability and statistics ; Probability theory and stochastic processes ; Review ; Sciences and techniques of general use ; Special processes (renewal theory, markov renewal processes, semi-markov processes, statistical mechanics type models, applications) ; Time Factors</subject><ispartof>Neural computation, 2005-02, Vol.17 (2), p.245-319</ispartof><rights>2005 INIST-CNRS</rights><rights>Copyright MIT Press Journals Feb 2005</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c464t-4f4adf0b5b16c029f1935aae4a4c985decd5a68cb208f117dec36d144d2d580e3</citedby><cites>FETCH-LOGICAL-c464t-4f4adf0b5b16c029f1935aae4a4c985decd5a68cb208f117dec36d144d2d580e3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://direct.mit.edu/neco/article/doi/10.1162/0899766053011555$$EHTML$$P50$$Gmit$$H</linktohtml><link.rule.ids>314,776,780,27903,27904,53987,53988</link.rule.ids><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&idt=16565467$$DView record in Pascal Francis$$Hfree_for_read</backlink><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/15720770$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Wörgötter, Florentin</creatorcontrib><creatorcontrib>Porr, Bernd</creatorcontrib><title>Temporal Sequence Learning, Prediction, and Control: A Review of Different Models and Their Relation to Biological Mechanisms</title><title>Neural computation</title><addtitle>Neural Comput</addtitle><description>In this review, we compare methods for temporal sequence learning (TSL) across the disciplines machine-control, classical conditioning, neuronal models for TSL as well as spike-timing-dependent plasticity (STDP). This review introduces the most influential models and focuses on two questions: To what degree are reward-based (e.g., TD learning) and correlation-based (Hebbian) learning related? and How do the different models correspond to possibly underlying biological mechanisms of synaptic plasticity? We first compare the different models in an open-loop condition, where behavioral feedback does not alter the learning. Here we observe that reward-based and correlation-based learning are indeed very similar. Machine control is then used to introduce the problem of closed-loop control (e.g., actor-critic architectures). Here the problem of evaluative (rewards) versus nonevaluative (correlations) feedback from the environment will be discussed, showing that both learning approaches are fundamentally different in the closed-loop condition. In trying to answer the second question, we compare neuronal versions of the different learning architectures to the anatomy of the involved brain structures (basal-ganglia, thalamus, and cortex) and the molecular biophysics of glutamatergic and dopaminergic synapses. Finally, we discuss the different algorithms used to model STDP and compare them to reward-based learning rules. Certain similarities are found in spite of the strongly different timescales. Here we focus on the biophysics of the different calcium-release mechanisms known to be involved in STDP.</description><subject>Algorithms</subject><subject>Applied sciences</subject><subject>Artificial intelligence</subject><subject>Biological and medical sciences</subject><subject>Brain</subject><subject>Computer science; control theory; systems</subject><subject>Exact sciences and technology</subject><subject>Forecasting</subject><subject>Fundamental and applied biological sciences. Psychology</subject><subject>General aspects</subject><subject>Learning</subject><subject>Learning and adaptive systems</subject><subject>Mathematics</subject><subject>Mathematics in biology. Statistical analysis. Models. Metrology. Data processing in biology (general aspects)</subject><subject>Neural networks</subject><subject>Neural Networks (Computer)</subject><subject>Neurology</subject><subject>Probability and statistics</subject><subject>Probability theory and stochastic processes</subject><subject>Review</subject><subject>Sciences and techniques of general use</subject><subject>Special processes (renewal theory, markov renewal processes, semi-markov processes, statistical mechanics type models, applications)</subject><subject>Time Factors</subject><issn>0899-7667</issn><issn>1530-888X</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2005</creationdate><recordtype>article</recordtype><recordid>eNqFkc1vEzEQxS0EoqFw54QsJDhlwbPrj11uJZSClAoEQeK2cuxx62rXDvYGxKH_O04TqagS4mTL83sz8_wIeQrsFYCsX7O265SUTDQMQAhxj8yg3Ku2bb_fJ7NduSp1dUQe5XzFGJPAxENyBELVTCk2I9crHDcx6YF-xR9bDAbpEnUKPlzM6eeE1pvJxzCnOli6iGFKcXhDT-gX_OnxF42OvvPOYcIw0fNoccg35OoSfSrQoHdqOkX61schXnhTJp2judTB5zE_Jg-cHjI-OZzH5Nv709XiQ7X8dPZxcbKsDJd8qrjj2jq2FmuQhtWdg64RWiPX3HStsGis0LI165q1DkCVh0Za4NzWVrQMm2Pyct93k2Jxmad-9NngMOiAcZt7qTjnrFX_BUEpqbqaF_D5HfAqblMoJvoaoG5qUFAgtodMijkndP0m-VGn3z2wfhdgfzfAInl26Ltdj2hvBYfECvDiAOhcPtMlHYzPt5wUUnC5czLfc6P_a7d_zv0D6UuvDA</recordid><startdate>20050201</startdate><enddate>20050201</enddate><creator>Wörgötter, Florentin</creator><creator>Porr, Bernd</creator><general>MIT Press</general><general>MIT Press Journals, The</general><scope>IQODW</scope><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>7QO</scope><scope>7TK</scope><scope>FR3</scope><scope>P64</scope><scope>7X8</scope></search><sort><creationdate>20050201</creationdate><title>Temporal Sequence Learning, Prediction, and Control: A Review of Different Models and Their Relation to Biological Mechanisms</title><author>Wörgötter, Florentin ; Porr, Bernd</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c464t-4f4adf0b5b16c029f1935aae4a4c985decd5a68cb208f117dec36d144d2d580e3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2005</creationdate><topic>Algorithms</topic><topic>Applied sciences</topic><topic>Artificial intelligence</topic><topic>Biological and medical sciences</topic><topic>Brain</topic><topic>Computer science; control theory; systems</topic><topic>Exact sciences and technology</topic><topic>Forecasting</topic><topic>Fundamental and applied biological sciences. Psychology</topic><topic>General aspects</topic><topic>Learning</topic><topic>Learning and adaptive systems</topic><topic>Mathematics</topic><topic>Mathematics in biology. Statistical analysis. Models. Metrology. Data processing in biology (general aspects)</topic><topic>Neural networks</topic><topic>Neural Networks (Computer)</topic><topic>Neurology</topic><topic>Probability and statistics</topic><topic>Probability theory and stochastic processes</topic><topic>Review</topic><topic>Sciences and techniques of general use</topic><topic>Special processes (renewal theory, markov renewal processes, semi-markov processes, statistical mechanics type models, applications)</topic><topic>Time Factors</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Wörgötter, Florentin</creatorcontrib><creatorcontrib>Porr, Bernd</creatorcontrib><collection>Pascal-Francis</collection><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>Biotechnology Research Abstracts</collection><collection>Neurosciences Abstracts</collection><collection>Engineering Research Database</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>MEDLINE - Academic</collection><jtitle>Neural computation</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Wörgötter, Florentin</au><au>Porr, Bernd</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Temporal Sequence Learning, Prediction, and Control: A Review of Different Models and Their Relation to Biological Mechanisms</atitle><jtitle>Neural computation</jtitle><addtitle>Neural Comput</addtitle><date>2005-02-01</date><risdate>2005</risdate><volume>17</volume><issue>2</issue><spage>245</spage><epage>319</epage><pages>245-319</pages><issn>0899-7667</issn><eissn>1530-888X</eissn><coden>NEUCEB</coden><abstract>In this review, we compare methods for temporal sequence learning (TSL) across the disciplines machine-control, classical conditioning, neuronal models for TSL as well as spike-timing-dependent plasticity (STDP). This review introduces the most influential models and focuses on two questions: To what degree are reward-based (e.g., TD learning) and correlation-based (Hebbian) learning related? and How do the different models correspond to possibly underlying biological mechanisms of synaptic plasticity? We first compare the different models in an open-loop condition, where behavioral feedback does not alter the learning. Here we observe that reward-based and correlation-based learning are indeed very similar. Machine control is then used to introduce the problem of closed-loop control (e.g., actor-critic architectures). Here the problem of evaluative (rewards) versus nonevaluative (correlations) feedback from the environment will be discussed, showing that both learning approaches are fundamentally different in the closed-loop condition. In trying to answer the second question, we compare neuronal versions of the different learning architectures to the anatomy of the involved brain structures (basal-ganglia, thalamus, and cortex) and the molecular biophysics of glutamatergic and dopaminergic synapses. Finally, we discuss the different algorithms used to model STDP and compare them to reward-based learning rules. Certain similarities are found in spite of the strongly different timescales. Here we focus on the biophysics of the different calcium-release mechanisms known to be involved in STDP.</abstract><cop>One Rogers Street, Cambridge, MA 02142-1209, USA</cop><pub>MIT Press</pub><pmid>15720770</pmid><doi>10.1162/0899766053011555</doi><tpages>75</tpages><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0899-7667 |
ispartof | Neural computation, 2005-02, Vol.17 (2), p.245-319 |
issn | 0899-7667 1530-888X |
language | eng |
recordid | cdi_proquest_journals_211232171 |
source | MIT Press Journals |
subjects | Algorithms Applied sciences Artificial intelligence Biological and medical sciences Brain Computer science control theory systems Exact sciences and technology Forecasting Fundamental and applied biological sciences. Psychology General aspects Learning Learning and adaptive systems Mathematics Mathematics in biology. Statistical analysis. Models. Metrology. Data processing in biology (general aspects) Neural networks Neural Networks (Computer) Neurology Probability and statistics Probability theory and stochastic processes Review Sciences and techniques of general use Special processes (renewal theory, markov renewal processes, semi-markov processes, statistical mechanics type models, applications) Time Factors |
title | Temporal Sequence Learning, Prediction, and Control: A Review of Different Models and Their Relation to Biological Mechanisms |
url | http://sfxeu10.hosted.exlibrisgroup.com/loughborough?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-27T10%3A45%3A26IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Temporal%20Sequence%20Learning,%20Prediction,%20and%20Control:%20A%20Review%20of%20Different%20Models%20and%20Their%20Relation%20to%20Biological%20Mechanisms&rft.jtitle=Neural%20computation&rft.au=W%C3%B6rg%C3%B6tter,%20Florentin&rft.date=2005-02-01&rft.volume=17&rft.issue=2&rft.spage=245&rft.epage=319&rft.pages=245-319&rft.issn=0899-7667&rft.eissn=1530-888X&rft.coden=NEUCEB&rft_id=info:doi/10.1162/0899766053011555&rft_dat=%3Cproquest_cross%3E793936591%3C/proquest_cross%3E%3Cgrp_id%3Ecdi_FETCH-LOGICAL-c464t-4f4adf0b5b16c029f1935aae4a4c985decd5a68cb208f117dec36d144d2d580e3%3C/grp_id%3E%3Coa%3E%3C/oa%3E%3Curl%3E%3C/url%3E&rft_id=info:oai/&rft_pqid=211232171&rft_id=info:pmid/15720770&rfr_iscdi=true |