FSM-Based Agents for Playing Super Mario Game

•Télécharger en tant que PPS, PDF•

1 j'aime•3,978 vues

Presentation of the work "Evolutionary FSM-Based Agents for Playing Super Mario Game" at LION 2013 (LION 7). by Rosa Hidalgo, María Sandra Rodríguez, Antonio M. Mora, Pablo García, Juan Julián Merelo y Antonio J. Fernández Conference http://www.intelligent-optimization.org/LION7/

Technologie

Evolutionary FSM-Based
Agents for Playing
Super Mario Game
LION 2013

R.M. Hidalgo-Bermúdez, M.S. Rodríguez-Domingo, A.M. Mora
P. García-Sánchez, J.J. Merelo, Antonio J. Fernández-Leiva

INDEX
1. INTRODUCTION
2. PRELIMINARY CONCEPTS AND BACKGROUND
3. MARIO AI: COMPETITION AND ENVIRONMENT
4. EVOLUTIONARY FSM-BASED AGENT
5. EXPERIMENTS AND RESULTS
6. CONCLUSIONS AND FUTURE WORK

INTRODUCTION
Why did we choose Mario Bros?

This game is worldwide known and remains very
popular nowadays.
There is an open source clone of the game named
Infinite Mario Bros.

INTRODUCTION

The main goal of the game is to go across lateral
platforming levels, trying to avoid different
enemies and obstacles by using some useful
items.
We have developed an autonomous agent using
different approaches based on Finite State
Machines (FSMs) combined with Genetic
Algorithms (GAs) techniques.

PRELIMINARY CONCEPTS AND
BACKGROUND
GAs are based on the model of darwinistic evolution. A
standard GA’s procedure is:

PRELIMINARY CONCEPTS AND
BACKGROUND
A Finite State Machine (FSM) is a computational model which
represents a set of states and connections between them.
It is represented as a directed graph, where each node
corresponds to a state.

MARIO AI: COMPETITION AND
ENVIRONMENT
Infinite Mario Bros is an open source modification of
Super Mario Bros. It includes wide support for
implementing autonomous agents to control Mario
character using AI techniques.

The Agent follows the rules of Mario AI
Championship. The tracks are:
 Gameplay  Turing Test
 Learning  Level Generation

MARIO AI: COMPETITION AND
ENVIRONMENT
The possible actions of the Agent are:
 Go Right  Go Left
 Jump  Down
 Shoot/Run

Mario can be in three different modes:
 Fire  Big  Small

MARIO AI: COMPETITION AND
ENVIRONMENT
Game data is presented in the form of two 19x19
matrices. Each cell of each matrix holds a value,
which represents a type of obstacle or enemy ,
respectively

MARIO AI: COMPETITION AND
ENVIRONMENT
Obstacles and useful items:

MARIO AI: COMPETITION AND
ENVIRONMENT
Enemies:

EVOLUTIONARY FSM-BASED AGENT

Possible states

EVOLUTIONARY FSM-BASED AGENT
Chromosome (modelling a FSM)

EVOLUTIONARY FSM-BASED AGENT
Two approaches:

Mono-seed
All the individuals are tested in the same level
The length of the level and the time limit to complete it grow with
each generation

Multi-seed
Every individual is tested in 30 random levels
The fitness is computed considering the results of all the plays

EVOLUTIONARY FSM-BASED AGENT
There are two types of fitness:
 Generic fitness
 Hierarchical fitness

The selection mechanism considers the best individual and a
percentage of the best ones, selected by tournament according to
their fitness.

Crossover is performed considering the best individual of the
present generation as one of the parents, and one of the
individuals with positive fitness as the other parent.

In the mutation operator, various genes in each of these
individuals are randomly selected to be mutated

EXPERIMENTS AND RESULTS
Mono-seed approach

EXPERIMENTS AND RESULTS
Multi-seed approach

SOME OF OUR APPROACHES
Difficulty level 1 (completed):
http://www.youtube.com/watch?v=6Pj6dZCE070

Difficulty level 2 (completed):
http://www.youtube.com/watch?v=gtfuY-L0WDA

Difficulty level 3 (completed):
http://www.youtube.com/watch?v=qQVQ43sWwYY

Difficulty level 12 (stacked):
http://www.youtube.com/watch?v=zNGfBApX7sk

CONCLUSIONS AND FUTURE WORK
An agent has been implemented using Finite State Machine (FSM)
models, improved by means of Genetic Algorithms (GA) and
considering different evaluation schemes: mono-seed and a multi-
seed evaluation approaches.

Mono-seed can yield excellent agents for the level where they were
‘trained’ (evolved), having a quite bad behaviour in a different level.
Multi-seed takes much more computational time and has higher
resource requirements, but the agents it yields are very good
playing in any level of the considered difficulty (in the evolution).

All these agents play much better than an expert human player and
can complete the levels in a time impossible to get for the human.

Contenu connexe

Tendances

Oodbms ch 20saurabhshertukde

Normalization in a DatabaseBishrul Haq

Dbms classification according to data modelsABDUL KHALIQ

Data modelsAnuj Modi

Lecture2 oracle pptHitesh Kumar Markam

System designGheethu Joy

Query processing and Query OptimizationNiraj Gandha

data modeling and modelssabah N

databaseShwetanshu Gupta

Advanced sqlDhani Ahmad

Classification of data martkhush_boo31

Query Optimization - Brandon Latronica"FENG "GEORGE"" YU

Database indexRiteshkiit

DFD, Decision Table, Decision Chart, Structure ChartsSOuvagya Kumar Jena

5 data resource managementNymphea Saraf

5 enterprise applicationsWaheed Iqbal Boss

06 explain the advanced network typologiesJadavsejal

Normalization PRESENTATIONbit allahabad

Business intelligence- Components, Tools, Need and Applicationsraj

Introduction To Database Management Systemcpjcollege

Tendances (20)

Oodbms ch 20

Normalization in a Database

Dbms classification according to data models

Data models

Lecture2 oracle ppt

System design

Query processing and Query Optimization

data modeling and models

database

Advanced sql

Classification of data mart

Query Optimization - Brandon Latronica

Database index

DFD, Decision Table, Decision Chart, Structure Charts

5 data resource management

5 enterprise applications

06 explain the advanced network typologies

Normalization PRESENTATION

Business intelligence- Components, Tools, Need and Applications

Introduction To Database Management System

En vedette

Designing and Evolving an Unreal Tournament 2004 Expert BotAntonio Mora

Evolving Bot AI in Unreal (Poster EVOGames 2010, in EVO* 2010)Antonio Mora

Super Mario (El Personaje y las Mario AI Competitions)Antonio Mora

Evolving the Cooperative Behaviour in Unreal BotsAntonio Mora

Ciencia y videojuegosAntonio Mora

TESIS: "Resolución de Problema Militar de Búsqueda de Camino Óptimo Multiobje...Antonio Mora

Science and Videogames. Computational intelligence in videogamesAntonio Mora

Optimización basada en colonias de hormigas. Conceptos principalesAntonio Mora

FSM Energy Policy - Volume II (Action Plans)FSM Department of Resources & Development

Sandpile evo star 2011Carlos M. Fernandes

Evopar12 Validating a P2P EAJuan Luis Jiménez Laredo

CONTEXT-AWARENESS IN A SERVICE ORIENTED E-HEALTH PLATFORMPablo García Sánchez

Optimizing Strategy Parameters in a Game BotPablo García Sánchez

Evolving Evil: Optimizing Flocking Strategies through Genetic Algorithms for ...Antonio Mora

Sistemas clasificadoresUniversity of Granada

Evolutionary Deckbuilding in HearthstonePablo García Sánchez

Adaptative bots for real time strategy game via map characterization Antonio Fernández Ares

Universal grammarGuillermo Cid

Parameter TuningPablo García Sánchez

Genebot maebPablo García Sánchez

En vedette (20)

Designing and Evolving an Unreal Tournament 2004 Expert Bot

Evolving Bot AI in Unreal (Poster EVOGames 2010, in EVO* 2010)

Super Mario (El Personaje y las Mario AI Competitions)

Evolving the Cooperative Behaviour in Unreal Bots

Ciencia y videojuegos

TESIS: "Resolución de Problema Militar de Búsqueda de Camino Óptimo Multiobje...

Science and Videogames. Computational intelligence in videogames

Optimización basada en colonias de hormigas. Conceptos principales

FSM Energy Policy - Volume II (Action Plans)

Sandpile evo star 2011

Evopar12 Validating a P2P EA

CONTEXT-AWARENESS IN A SERVICE ORIENTED E-HEALTH PLATFORM

Optimizing Strategy Parameters in a Game Bot

Evolving Evil: Optimizing Flocking Strategies through Genetic Algorithms for ...

Sistemas clasificadores

Evolutionary Deckbuilding in Hearthstone

Adaptative bots for real time strategy game via map characterization

Universal grammar

Parameter Tuning

Genebot maeb

Plus de Antonio Mora

Study on Genetic Algorithm Approaches to Improve an Autonomous Agent for a Fi...Antonio Mora

Optimización Adaptativa basada en Colonias de Hormigas para la Composición de...Antonio Mora

Adaptive Ant Colony Optimization for Service Function Chaining in a Dynamic 5...Antonio Mora

Research in Videogames. (Much) further than just AIAntonio Mora

GRETIVE: Un Bot Evolutivo para HearthStone basado en PerfilesAntonio Mora

Improving the Performance of MCTS-Based μRTS Agents Through Move PruningAntonio Mora

Testing hybrid computational intelligence algorithms for general game playing...Antonio Mora

Checking the difficulty of evolutionary-generated maps in a N-Body inspired m...Antonio Mora

Applying Ant Colony Optimization for Service Function Chaining in a 5G Networ...Antonio Mora

Investigación en videojuegos. (mucho) Mas allá de la IAAntonio Mora

Beating uncertainty in racing bot evolution through enhanced exploration and ...Antonio Mora

Inteligencia Computacional en Videojuegos (Meetup GranadAI 2019)Antonio Mora

Free Form Evolution for Angry Birds Level GenerationAntonio Mora

Ciencia y Videojuegos (ULP 2019)Antonio Mora

The Evolutionary Race: Improving the Process of Evaluating Car Controllers in...Antonio Mora

Predicción de Quiebra Financiera de Empresas Mediante Equilibrado de Datos y ...Antonio Mora

Driving in TORCS using modular fuzzy controllers - Poster - EvoGAMES 2017Antonio Mora

Sólo puede quedar uno: Evolución de Bots para RTS basada en supervivenciaAntonio Mora

Living-UGR: Una aventura gráfica geolocalizada para difundir el patrimonio de...Antonio Mora

Gamification in Teaching - How to motivate students through gamesAntonio Mora

Plus de Antonio Mora (20)

Study on Genetic Algorithm Approaches to Improve an Autonomous Agent for a Fi...

Optimización Adaptativa basada en Colonias de Hormigas para la Composición de...

Adaptive Ant Colony Optimization for Service Function Chaining in a Dynamic 5...

Research in Videogames. (Much) further than just AI

GRETIVE: Un Bot Evolutivo para HearthStone basado en Perfiles

Improving the Performance of MCTS-Based μRTS Agents Through Move Pruning

Testing hybrid computational intelligence algorithms for general game playing...

Checking the difficulty of evolutionary-generated maps in a N-Body inspired m...

Applying Ant Colony Optimization for Service Function Chaining in a 5G Networ...

Investigación en videojuegos. (mucho) Mas allá de la IA

Beating uncertainty in racing bot evolution through enhanced exploration and ...

Inteligencia Computacional en Videojuegos (Meetup GranadAI 2019)

Free Form Evolution for Angry Birds Level Generation

Ciencia y Videojuegos (ULP 2019)

The Evolutionary Race: Improving the Process of Evaluating Car Controllers in...

Predicción de Quiebra Financiera de Empresas Mediante Equilibrado de Datos y ...

Driving in TORCS using modular fuzzy controllers - Poster - EvoGAMES 2017

Sólo puede quedar uno: Evolución de Bots para RTS basada en supervivencia

Living-UGR: Una aventura gráfica geolocalizada para difundir el patrimonio de...

Gamification in Teaching - How to motivate students through games

Dernier

Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent

08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls

Pigging Solutions Piggable Sweeping ElbowsPigging Solutions

How to convert PDF to text with Nanonetsnaman860154

The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad

Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies

Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik

Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard

Factors to Consider When Choosing Accounts Payable Services Providers.pptxKatpro Technologies

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106

SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren

Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software

08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls

How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes

Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC

Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski

Pigging Solutions in Pet Food ManufacturingPigging Solutions

Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst

WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal

Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada

Dernier (20)

Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...

08448380779 Call Girls In Diplomatic Enclave Women Seeking Men

Pigging Solutions Piggable Sweeping Elbows

How to convert PDF to text with Nanonets

The Codex of Business Writing Software for Real-World Solutions 2.pptx

Benefits Of Flutter Compared To Other Frameworks

Injustice - Developers Among Us (SciFiDevCon 2024)

Maximizing Board Effectiveness 2024 Webinar.pptx

Factors to Consider When Choosing Accounts Payable Services Providers.pptx

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics

SQL Database Design For Developers at php[tek] 2024

Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation

08448380779 Call Girls In Civil Lines Women Seeking Men

How to Troubleshoot Apps for the Modern Connected Worker

Breaking the Kubernetes Kill Chain: Host Path Mount

Integration and Automation in Practice: CI/CD in Mule Integration and Automat...

Pigging Solutions in Pet Food Manufacturing

Human Factors of XR: Using Human Factors to Design XR Systems

WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service

Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024

FSM-Based Agents for Playing Super Mario Game

1. Evolutionary FSM-Based Agents for Playing Super Mario Game LION 2013 R.M. Hidalgo-Bermúdez, M.S. Rodríguez-Domingo, A.M. Mora P. García-Sánchez, J.J. Merelo, Antonio J. Fernández-Leiva

2. INDEX 1. INTRODUCTION 2. PRELIMINARY CONCEPTS AND BACKGROUND 3. MARIO AI: COMPETITION AND ENVIRONMENT 4. EVOLUTIONARY FSM-BASED AGENT 5. EXPERIMENTS AND RESULTS 6. CONCLUSIONS AND FUTURE WORK

3. INTRODUCTION Why did we choose Mario Bros? This game is worldwide known and remains very popular nowadays. There is an open source clone of the game named Infinite Mario Bros.

4. INTRODUCTION The main goal of the game is to go across lateral platforming levels, trying to avoid different enemies and obstacles by using some useful items. We have developed an autonomous agent using different approaches based on Finite State Machines (FSMs) combined with Genetic Algorithms (GAs) techniques.

5. PRELIMINARY CONCEPTS AND BACKGROUND GAs are based on the model of darwinistic evolution. A standard GA’s procedure is:

6. PRELIMINARY CONCEPTS AND BACKGROUND A Finite State Machine (FSM) is a computational model which represents a set of states and connections between them. It is represented as a directed graph, where each node corresponds to a state.

7. MARIO AI: COMPETITION AND ENVIRONMENT Infinite Mario Bros is an open source modification of Super Mario Bros. It includes wide support for implementing autonomous agents to control Mario character using AI techniques. The Agent follows the rules of Mario AI Championship. The tracks are:  Gameplay  Turing Test  Learning  Level Generation

8. MARIO AI: COMPETITION AND ENVIRONMENT The possible actions of the Agent are:  Go Right  Go Left  Jump  Down  Shoot/Run Mario can be in three different modes:  Fire  Big  Small

9. MARIO AI: COMPETITION AND ENVIRONMENT Game data is presented in the form of two 19x19 matrices. Each cell of each matrix holds a value, which represents a type of obstacle or enemy , respectively

10. MARIO AI: COMPETITION AND ENVIRONMENT Obstacles and useful items:

11. MARIO AI: COMPETITION AND ENVIRONMENT Enemies:

12. EVOLUTIONARY FSM-BASED AGENT Possible states

13. EVOLUTIONARY FSM-BASED AGENT Chromosome (modelling a FSM)

14. EVOLUTIONARY FSM-BASED AGENT Two approaches: Mono-seed All the individuals are tested in the same level The length of the level and the time limit to complete it grow with each generation Multi-seed Every individual is tested in 30 random levels The fitness is computed considering the results of all the plays

15. EVOLUTIONARY FSM-BASED AGENT There are two types of fitness:  Generic fitness  Hierarchical fitness The selection mechanism considers the best individual and a percentage of the best ones, selected by tournament according to their fitness. Crossover is performed considering the best individual of the present generation as one of the parents, and one of the individuals with positive fitness as the other parent. In the mutation operator, various genes in each of these individuals are randomly selected to be mutated

16. EXPERIMENTS AND RESULTS Mono-seed approach

17. EXPERIMENTS AND RESULTS Mono-seed approach

18. EXPERIMENTS AND RESULTS

19. EXPERIMENTS AND RESULTS

20. EXPERIMENTS AND RESULTS Multi-seed approach

21. SOME OF OUR APPROACHES Difficulty level 1 (completed): http://www.youtube.com/watch?v=6Pj6dZCE070 Difficulty level 2 (completed): http://www.youtube.com/watch?v=gtfuY-L0WDA Difficulty level 3 (completed): http://www.youtube.com/watch?v=qQVQ43sWwYY Difficulty level 12 (stacked): http://www.youtube.com/watch?v=zNGfBApX7sk

22. CONCLUSIONS AND FUTURE WORK An agent has been implemented using Finite State Machine (FSM) models, improved by means of Genetic Algorithms (GA) and considering different evaluation schemes: mono-seed and a multi- seed evaluation approaches. Mono-seed can yield excellent agents for the level where they were ‘trained’ (evolved), having a quite bad behaviour in a different level. Multi-seed takes much more computational time and has higher resource requirements, but the agents it yields are very good playing in any level of the considered difficulty (in the evolution). All these agents play much better than an expert human player and can complete the levels in a time impossible to get for the human.

23. Thanks!

Notes de l'éditeur

Esto se puede eliminar si lo consideras sabido por la audiencia
Esto se puede eliminar si lo consideras sabido por la audiencia
Decir que nos centramos en el track Gameplay. Buscamos hacer un agente para superar ese reto (según sus normas)
Las matrices son muy importantes para saber cómo actuar en cada instante (tick)
Supongo lo que son todos esos items. :D
Se codifican los posibles estados, incluyendo el 8 (no hacer nada) y el 11 (saltar y agacharse al mismo tiempo). Este último no está implementado en el simulador de Infinite Mario Bros, pero sí en los juegos de Mario.
Si no te queda claro, mírate esto en el artículo original (el de las 17 páginas).
Mono-seed se ejecuta siempre en la misma pantalla (con el mismo nivel de dificultad), y se va incrementando la longitud de la misma con las generaciones. Con el multi-seed se busca evitar el ‘ruido’ típico de la evaluación de individuos en juegos. Es decir, que un mismo individuo sea valorado como muy bueno en una partida y como muy malo en otra, dependiendo de factores pseudoaleatorios (comportamientos de enemigos, sucesos en la partida, etc).
Mírate los detalles de cada operador en el artículo. ;)
Parámetros de ejecución del algoritmo en mono-semilla. Obtenidos por experimentación sistemática.
Se ve el gran tamaño de población requerido para evolucionar un agente bueno.
El tiempo de ejecución es alto y los requisitos de memoria son un problema que hace que muchas veces ‘pete’ el programa (el algoritmo).
Se ve una tendencia positiva (el fitness mejora paulatinamente), aunque no es tan drástica como nos gustaría y hay oscilaciones debidas al ‘ruido’ al evaluar a los individuos.
Parámetros de multi-seed. Los resultados son similares y los problemas que ocurren (memoria y tiempo) se acrecentan al realizar muchas más evaluaciones y variaciones por individuo.
Mejor si bajas estos ejemplos a tu disco duro por si no hay conexión. Puedes usar el programa “aTube Catcher”, por ejemplo. El del nivel 12 petó, como en todos los casos, pero hasta ese momento su comportamiento era muy bueno.
Los agentes obtenidos se pudieron evolucionar hasta superar el nivel de dificultad 4, más allá ocurrían errores de memoria (más a medida que incrementaba el nivel de dificultad). Aún así, ese nivel es más que suficiente para lo requerido en el GamePlay track.
Las conclusiones, después de realizar este trabajo, son las siguientes:

FSM-Based Agents for Playing Super Mario Game

Recommandé

Recommandé

Contenu connexe

Tendances

Tendances (20)

En vedette

En vedette (20)

Plus de Antonio Mora

Plus de Antonio Mora (20)

Dernier

Dernier (20)

FSM-Based Agents for Playing Super Mario Game

Notes de l'éditeur