Pascal and Francis Bibliographic Databases

Help

Export

Selection :

Permanent link
http://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&idt=28775786

Abstraction from demonstration for efficient reinforcement learning in high-dimensional domains

Author
COBO, Luis C1 ; SUBRAMANIAN, Kaushik2 ; ISBELL, Charles L2 ; LANTERMAN, Aaron D1 ; THOMAZ, Andrea L2
[1] School of Electrical and Computer Engineering, Georgia Tech, Atlanta, GA, 30332, United States
[2] College of Computing, Georgia Tech, Atlanta, GA, 30332, United States
Source

Artificial intelligence (General ed.). 2014, Vol 216, pp 103-128, 26 p ; ref : 69 ref

CODEN
AINTBB
ISSN
0004-3702
Scientific domain
Cognition; Computer science
Publisher
Elsevier, Oxford
Publication country
United Kingdom
Document type
Article
Language
English
Author keyword
Dimensionality reduction Function approximation Learning from demonstration Reinforcement learning
Keyword (fr)
Abstraction Algorithme approximation Analyse n dimensionnelle Apprentissage renforcé Décision séquentielle Gestion tâche Imitation Méthode espace état Opérateur humain Prise de décision Réduction dimension Utilité attendue Approximation d'une fonction Jeu ordinateur
Keyword (en)
Abstraction Approximation algorithm Multidimensional analysis Reinforcement learning Sequential decision Task scheduling Imitation State space method Human operator Decision making Dimension reduction Expected utility Function approximation Computer games
Keyword (es)
Abstracción Algoritmo aproximación Análisis n dimensional Aprendizaje reforzado Decisión secuencial Gestión labor Imitación Método espacio estado Operador humano Toma decision Reducción dimensión Utilidad espera Aproximación de funciones Juegos de computadora
Classification
Pascal
001 Exact sciences and technology / 001D Applied sciences / 001D02 Computer science; control theory; systems / 001D02A Theoretical computing / 001D02A05 Algorithmics. Computability. Computer arithmetics

Pascal
001 Exact sciences and technology / 001D Applied sciences / 001D02 Computer science; control theory; systems / 001D02B Software / 001D02B04 Computer systems and distributed systems. User interface

Pascal
001 Exact sciences and technology / 001D Applied sciences / 001D02 Computer science; control theory; systems / 001D02B Software / 001D02B07 Memory organisation. Data processing / 001D02B07B Data processing. List processing. Character string processing

Pascal
001 Exact sciences and technology / 001D Applied sciences / 001D02 Computer science; control theory; systems / 001D02C Artificial intelligence / 001D02C02 Learning and adaptive systems

Discipline
Computer science : theoretical automation and systems
Origin
Inist-CNRS
Database
PASCAL
INIST identifier
28775786

Sauf mention contraire ci-dessus, le contenu de cette notice bibliographique peut être utilisé dans le cadre d’une licence CC BY 4.0 Inist-CNRS / Unless otherwise stated above, the content of this bibliographic record may be used under a CC BY 4.0 licence by Inist-CNRS / A menos que se haya señalado antes, el contenido de este registro bibliográfico puede ser utilizado al amparo de una licencia CC BY 4.0 Inist-CNRS

Access to the document

Searching the Web