Conference PaperPDF Available

Recognizing Formations in Opponent Teams

Authors:

Abstract and Figures

WWW home page:http://www.virtualwerder.de Abstract. The online coach within the simulation league has become more pow- erful over the last few years. Therefore, new options with regard to the recognition of the opponents strategy are possible. For example, the online coach is the only player who gets the information of all the objects on the field. This leads to the idea determine the opponents play system by the online coach and then choose an effective counter-strategy. This has been done with the help of an artificial neural network and will be discussed in this paper. All soccer-clients are initialized with a specific behavior and can change their behavior to an appropriate mode depend- ing on the coach's commands. The result is a flexible and effective game played by the eleven soccer-clients.
Content may be subject to copyright.
Recognizing Formations in Opponent Teams
Ubbo Visser, Christian Dr¨
ucker, Sebastian H¨
ubner,
Esko Schmidt, Hans-Georg Weland
TZI - Center for Computing Technologies
University of Bremen
D-28334 Bremen, Germany
{visser|druecker|huebner|esko|weland}@tzi.de
WWW home page: http://www.virtualwerder.de
Abstract. The online coach within the simulation league has become more pow-
erful over the last few years. Therefore, new options with regard to the recognition
of the opponents strategy are possible. For example, the online coach is the only
player who gets the information of all the objects on the field. This leads to the
idea determine the opponents play system by the online coach and then choose an
effective counter-strategy. This has been done with the help of an artificial neural
network and will be discussed in this paper. All soccer-clients are initialized with
a specific behavior and can change their behavior to an appropriate mode depend-
ing on the coach’s commands. The result is a flexible and effective game played
by the eleven soccer-clients.
1 Introduction
Our team is based on the sources that were released by the CMUnited99 team [10]. We
decided to do so because it would have been to time consuming to reinvent all basic
skills.1Instead, we focus on research w.r.t. high level functions which will hopefully
lead to new ideas and results for the RoboCup community. Our long run plan is to use
part of the provided functions from the CMU-client to construct a more sophisticated
team with individual players.
Over the last few years several attempts have been made in learning of team be-
haviour. Similar approacheshave been developed from numerous research groups. These
studies have the focus on learning team behavior within the simulation and middle size
league (see [11], [12], [13], [9]). Raines et al. [7], e.g., describe a new approach to
automate assistants to aid humans in understanding team behaviours for the simulation
league. This approaches are designed for the analysis of games, off-line after playing, to
gain new experiences for the next games. Frank et al. ([4]) present a real time approach
which is based on statistical methods. A team will be evaluated statistically but there is
no recognition of team strategies.
While conducting our research for this project we obtained support from real-life
soccer experts. In an interview, Thomas Schaaf, the manager of SV Werder Bremen
pointed out the importance of the strategy recognition of the opponent team. While the
1We would like to give special thanks to the original authors
coach-client has been able to participate through analysis and control in real matches
since 1998 [3], the idea of general strategic planning becomes possible. Like in real
life matches, the coach is able to give strategic commands depending on the opponent’s
system and the current score. We presume that the performance of our team can be
improved by analyzing the opponent’s strategy.
2 Agents
The Virtual Werder team consists of individual players which have different behaviors.
Players: 22 types of players have been developed with a variety of characteristics to
ensure the flexibility and variability of actions and reactions within a game. There are
different types of forwards, defenders, mid-fielders and goal keepers. Therefore, our
clients have the option to change their behavior from one style to another at any stage
of the game. This concept is an important key feature to carry out changes in a forma-
tion while playing against teams that are switching between different play systems. We
plan to use the online-coach, so that our clients have the ability to receive the messages,
parse them, and change their behavior accordingly. On the other hand, the low-level
skills of the agents are based on the sources provided by CMUnited99 [2]. These skills
include functions to locate the ball, the other team members and the opponent players.
In addition, methods to communicate via UDP-sockets, a parser for soccer server mes-
sages and other utilities such as the memory structures, have been used. Our clients are
initialized with a certain behavior and the desired formation when connecting to the
soccer server. Furthermore, they have the ability to choose other behaviors and switch
to them immediately. These high-level functions include methods to carry out different
defense systems such as man-to-man marking and zone defense. We also included new
mid-level functions, e.g. finding a teammate able to catch a pass. Another new skill in-
cludes the player moving to a certain point on the field, while keeping track of the ball.
This is accomplished through the turn neck command. The characteristics of these 22
types of players are described in the Virtual Werder team description.
Coach: The coach observes the game continually, analyzes the formation of the op-
ponent team at given points in time with an artificial neural network (ANN) ([5], [8])
and broadcasts an adequate counter formation to the players during the next interrup-
tion. The current evaluation takes place twice per second. The positions of the players
serve as inputs for an ANN, that is trained with the formations most commonly played
in our test games and the log-files (see section 3). In order to get a reliable impression
only outputs with high ratings are used. Whenever the play mode switches to another
state than PLAY ON, the coach generates a message for his team. It instructs the play-
ers which formation the opponent is currently using and gives information about the
appropriate counter attack.
3 Approach
We observed that teams in the last RoboCup-tournaments typically relied on their strat-
egy and team formation and often didn’t change it within a game. When changes were
Fig.1. Positions of opponent players and bounding box; marked cells define the input
vector for the neural network.
made they depended on the score. A common practice was to switch from a offensive
to a defensive formation if the team lead with more than ngoals. On the other hand
some teams remained on the same system regardless of the opponent’s strategy. This
is the point where the online-coach diagnoses what to do depending on the opponent’s
formation. The coach-client had to be fed with formations and information on how to
analyze them. Part of the integrated knowledge was obtained from an interview with a
real expert, the Werder Bremen’s head coach Thomas Schaaf, other parts from litera-
ture, e. g. [1] and from games played in the last RoboCup tournaments. This knowledge
can be used by the online-coach within the decision process.
The observation and analysis of the opponents team is processed in several steps
which are described below. The CMUnited99 sources provide the communication with
the soccer server and the parsing of messages. Furthermore, it supplies the coach with
collected information in data structures which are easily accessible. This information is
then prepared and will act as an input vector for an ANN. In order to prepare the data
we first have to decide which variables should be used. Our model consists of 64 binary
input variables. For this purpose a bounding box is placed around the positions of the
opponent players. The box is currently divided into a grid of eight by eight cells which
leads to an arrangement of 64 fields (Fig. 1). There must be at least one player inside a
specific field with the value of this field set to 1. Otherwise, the value must be set to 0.
The sum of the fields defines the input vector for the ANN. The network classifies the
vector in one of 16 output classes, each representing a specific play system.
The network is implemented as a C-function2and is called to calculate the ratings
at 16 possible output neurons which represent the opponent’s formations (section 2). If
the output neuron with the highest value exceeds a demanded threshold, this class will
be chosen as the result of this function. This result and other information in addition,
e.g. the size and position of the bounding box or the current score of the game, deter-
2snns2c by Bernward Kett was used to transform the trained network from an internal SNNS
representation to a usable C-function [14].
mines the appropriate counter formation. During the next interruption of the game, this
information is broadcasted to the team.
Training results: Our coach uses an ANN to analyze the formation of the opponent’s
team. The network itself was trained to recognize 16 different formations. The coach-
client uses this knowledge about the classified formation to evaluate a proper counter-
attack. Soccer formations are typically noted as a combination of defense, mid-field
and forward players, e.g. a 5-3-2 represents a team with five defense players (and a
goalie), three players in the middle field and two forward players. However, there are
some special systems that do not fit into this pattern, these are referred to by name, e. g.
the Catenaccio system [1].
We developed a tool called “ExportPlayer” to obtain formation examples for our
network. This program is based on the log-player and takes automatic snapshots of ex-
isting log-files. It extracts the positions of the players (not including the goal keeper) and
normalizes the coordinates to a grid of eight by eight cells. Cells labeled ’0’ do not con-
tain players, cells labeled ’1’ contain at least one player. The ExportPlayer returns a pat-
tern file which contains the values of all input and output neurons to serve the ANN (see
also team description). We used the Stuttgart Neural Network Simulator (SNNSv4.1)
[14] to train the examples and to create the code for a feedforward-network. We used
standard backpropagation as learning method with the learning parameter η= 1.0and
the maximum difference dmax = 0.3between the teaching value and the output.
The choice of the threshold mentioned in section 3is a very important factor for the
efficiency of the online-coach. Table 1shows the correlation between different thresh-
olds which results in the amount of permitted input patterns and their correctness. The
inquiry is based on 680 patterns obtained from log-files and test games. These have
been previously classified by us. On ten separate occasions, these patterns were ran-
domly divided into 612 training and 68 test sets (10%) and were processed by the ne.
The average of these ten different results have been calculated for validity.
Due to the large quantity of test patterns (currently, a snapshot is made twice per
second), the relatively high amount of rejections is not problematic in this environment.
Furthermore, it is similar to a real soccer game where distinct formation occurs infre-
quently.
threshold classified classified correctly
0% 100% 48.37%
80% 49.67% 65.53%
85% 41.34% 67.88%
90% 32.52% 69.30%
95% 20.10% 72.27%
Table 1. Relation between output threshold and correctness
4 Results
Our hypothesis was that we improve the performance of our team by detecting op-
ponents strategies and obtaining the appropriate counter formation. Our test environ-
ment consists of our team and the teams of CMUnited99 and last years’ Mainz Rolling
Brains. We carried out ten games against each team with two different play systems.
We decided that the formations 5-4-1 (defensive) and 3-4-3 (offensive) were the most
promising for our demonstration.
Against CMUnited99 Virtual Werder performs better with a defensive formation.
The average loss against the CMU-team was 0:14 with the 3-4-3 and 0.1:9 with the
defensive strategy (we might add that this is an unacceptable situation in total). Table 2
shows the average results. Against Mainz Rolling Brains on the other hand we can see
that Virtual Werder performs better with the offensive formation. The average score of
3.1:0.7 was better than the 0.5:0.9 score with the defensive formation. We believe that
the understaffed mid-field caused this situation (see table 2).
We come to the conclusion that the online-coach can help to detect the opponent’s
strategy. Once a team knows the play system of the opponent, appropriate counter ac-
tions can be carried out. However, later experiments have shown that we cannot exclude
that the score is caused by other skills such as the individual play style. We think that
further investigations with a ’standard team’ would be helpful to make a clear point
on this issue. In summary, we have seen that the Virtual Werder team performs better
with this new information. The average score depends upon the chosen play system and
whether the team can change their system online.
Mainz RB CMU-99
VW Def. 5-4-1 0.5:0.9 0.1:9
VW Off. 3-4-3 3.1:0.7 0:14
Table 2. Relation between formation and score
The technology of strategy detection could be useful for other application areas.
Firstly, the quality of action predictions of physical agents can be improved which plays
an important role within the control mechanisms of autonomous agents. Secondly, it is
important to improve the robustness and security issues of electronic markets within the
area of electronic commerce.
5 Future Work
Further work can be done in the following areas:
Keeping track of changes: This means that the coach-client consists of internal states.
With internal states a list of ’scenes’ describing the current play system of the oppo-
nent’s can be stored. The next step is to detect changes in the opponent strategy. A low
pass filter can then be used to determine whether the play system changed temporary or
for a longer period.
Evaluation of counter attacks: The evaluation of a fitting counter-attack is another
issue in our research. Therefore, we will focus on new criteria, such as play cycle and
score. The idea is to change formations in situations that do not depend on the oppo-
nent’s play.
Captain: Looking at formations is a first step to a more strategic play. The next step to
improve the team performance will be the transfer of the coach knowledge to a key-
player, which can give commands to the team members during the game, not only
within a break. This ”captain”-concept could also be extended with the concept of a
key defense player, which is responsible for the guidance of the defense.
References
1. Christoph Biermann and Ulrich Fuchs. Der Ball ist rund, damit das Spiel die Richtung
¨
andern kann. Kiepenheuer & Wisch, 1999.
2. CMU. Sources of low-level skills. http://www.cs.cmu.edu/afs/cs/usr/pstone/mosaic/
RoboCup/CMUnited99-sim.html.
3. Emiel Corten, Klaus Dorer, Fredrik Heintz, Kosta Kostiadis, Johan Kummeneje, Helmut
Myritz, Itsuki Noda, Jukka Riekki, Ratrick Riley, Peter Stone, and Tralvey Yeap. Soc-
cerserver manual ver5.1 release. Manual, 1999.
4. Ian Frank, Kumiko Tanaka-Ishi, Katsuto Arai, and Hitoshi Matsubara. The statistics proxy
server. In Tucker Balch, Peter Stone, and Gerhard Kraetschmar, editors, 4th International
Workshop on RoboCup, pages 199–204, Melbourne, Australia, 2000. Carnegie Mellum Uni-
versity Press.
5. J. Hertz, A. Krogh, and R.G. Palmer. Introduction to the theory of neural computation,
volume 1. Addison-Wesley Publishing Company, Redwood City, California, 1991.
6. H. Kitano, Y. Kuniyoshi, I. Noda, M. Asada, H. Matsubara, and E. Osawa. Robocup: A
challenge problem for ai. Artificial Intelligence Magazine, 18(1):73–85, 1997.
7. Taylor Raines, Millind Tambe, and Stacy Marsella. Automated assistants to aid humans in
understanding team behaviors. In Fourth International Conference on Autonomous Agents
(Agents 2000), Barcelona, Spain, 2000.
8. Raul Rojas. Neural Networks - A Systematic Introduction. Springer-Verlag, Berlin, 1996.
9. Peter Stone. Layered Learning in Multiagent Systems: A Winning Approach to Robotic Soc-
cer. Intelligent Robotics and Autonomous Agents. MIT Press, Cambridge, Massachusetts,
2000.
10. Peter Stone, Patrick Riley, and Manuela Veloso. The CMUnited-99 champion simulator
team. In Manuela Veloso, Enrico Pagello, and Hiroaki Kitano, editors, RoboCup-99: Robot
Soccer World Cup III, Berlin, 2000. Springer-Verlag.
11. Peter Stone and Manuela Veloso. A layered approach to learning client behaviors in the
robocup soccer server. Applied Artificial Intelligence, 12(3):165–188, 1998.
12. Peter Stone and Manuela Veloso. Towards collaborative and adversarial learning: A case
study in robotic soccer. International Journal of Human-Computer Studies, 48, 1998.
13. Peter Stone and Manuela Veloso. Team-partitioned, opaque-transition reinforcement learn-
ing. In M. Asada and H. Kitano, editors, Robot Soccer World Cup II, Berlin, 1999. Springer-
Verlag.
14. A. Zell, G. Mamier, M. Vogt, N. Mache, R. Hubner, K.U. Herrmann, T. Soyez, M. Schmalzl,
T. Sommer, A. Hatzigeorgiou, S. D¨
oring, and D. Posselt. Snns: Stuttgart neural network
simulator, user manual version 3.2. Technical Report 3/94, Universit¨
at Stuttgart, 1994.
... Football coaches can take advantage of data from past games to assess the effectiveness of their offensive formations and player packages against the opposition's defensive strategy, leading to significant improvements in team performance. The structured and repetitive gameplay in football makes related work in soccer formation recognition [14,15] less directly relevant to this work, as soccer features a more dynamic style of play. ...
Article
Full-text available
American football is one of the most popular team sports in the United States. There are approximately 16,000 high school and 890 college football teams, and each team plays around 10–14 games per football season. Contrary to most casual fans’ views, American football is more than speed and power, it requires preparation and strategies. Coaches analyze hours of video of their own and opponents’ games to extract important information such as offensive play formations, personnel packages and opposing coaches’ tendency to gain competitive advantages. This time-consuming and slow process called “tagging” takes away the coaches’ time from other duties and limits the players’ time for preparation and training. In this work, we created three datasets for our experiments to demonstrate the importance of player detection accuracy, which is easily affected by camera placement and player occlusion issues. We applied a unique data augmentation technique to generate data for each specific experiment. Our model achieved a remarkable 98.52% accuracy in play type recognition and 92.38% accuracy in player position recognition for the experiment that assumes no missing players or no occlusion problem, which could be achieved by placing the camera high above the football field.
... Їх можна застосувати, викликавши відповідні методи API на сервері. Такі дії включають в себе [4]: ...
Article
У даній статті наведено основні принципи та підходи до командної взаємодії інтелектуальних агентів, що використовуються у всесвітніх змаганнях 2D RoboCup simulation league. Відповідно до підходу – «Drop-in player challenge», розроблено та описано базові моделі поведінки, що були протестовані в середовищі моделювання 2D RoboCup simulator. Стаття стисло описує суть моделей поведінки супроводжуючи ілюстраціями та хід проведення експериментального тестування з результатами у вигляді таблиць. Підходом «Drop-in player challenge» почали займатися відносно недавно. Багато відомих команд, такі як Austrian Kangaroos, B-Human, Cerberus, що змагаються восновних лігах почали брати участь у змаганнях «змішаних» гравців. Це свідчить про потребу саме в такому типі взаємодії. Основна задача роботи – це вдосконалення взаємодії з союзними роботами, що мають інші кодові бази. Такі роботи не мають змоги комунікувати – тільки аналізувати дії союзних гравців і бути корисними для досягнення спільної мети. Також присутній елемент внутрішньо-командної конкуренції, коли задача агенту не тільки максимально допомогти союзним роботам, а і стати найрезультативнішим гравцем в команді. Тема основної роботи охоплює проблематику окремих випадків тактики гри команди, на кшталт, гри з застосуванням тільки нападаючих моделей поведінки, гри з застосуванням тільки оборонних моделей, різних комбінацій моделей поведінки для визначення глобального оптимуму виграшної тактики. За результатами тестування було визначено ефективність розроблених моделей поведінки, найкращі співвідношення застосувань моделей на полі, а також протестовані пограничні випадки застосування моделей, що підтвердило теоретичні передбачення. Бібл. 6, іл. 7, табл. 4.
... Various new methods use principles of (artificial) neural networks (McCulloch and Pitts, 1943) for sports analytics in football. Visser et al. (2001) used artificial neural networks to recognize the team formation of the opponent team by using a set of predefined reference team formations. Expanding on this, Ayanegui-Santiago (2009) suggested the inclusion of multiple relations between players for the recognition of team formations in football. ...
Article
Full-text available
In this paper, we explore the use of the Static Qualitative Trajectory Calculus (QTCS), a qualitative spatiotemporal method based on the QTC, for the analysis of team formations in football. While methods for team formation analysis in sports are predominantly quantitative in nature, QTCS enables the comparison of team formations by describing the relative positions between players in a qualitative manner, which is more related to the way players position themselves on the field. QTCS has the potential to allow to monitor to what extent a football team plays according to a coach’s predetermined formation. When applied to multiple matches of one team, the method can contribute to the definition of the playing style of a team. We present an experiment aimed at identifying the team formation played by Belgian national football team during the 2018 FIFA World Cup held in France.
... Various new methods use principles of (artificial) neural networks (McCulloch and Walter, 1943). Visser et al. (2001) used artificial neural network systems to recognize the team formation of the opponent team. Starting with the positions of the opponent players at a certain timestamp, the neural network tried to classify that moment into a set of predefined team formations (Atmosukarto et al., 2013) later used an analogue method in American football) and proposed the appropriate counter team formation for the own team. ...
Conference Paper
In this paper, we introduce the Static Qualitative Trajectory Calculus (QTCS), a qualitative spatiotemporal method based on the Qualitative Trajectory Calculus (QTC), for team formation analysis in football. While methods for team formation analysis are mostly quantitative, QTCS enables the comparison of team formations by describing the relative positions between players in a qualitative manner, which is much more related to the way players position themselves on the field. To illustrate the method, we present a series of examples based on real football matches of a 2016-2017 European football competition. With QTCS, team formations of both an entire team as well as a smaller group of players can be described. Analysis of these formations can be done for multiple matches, thereby defining the playing style of a team, or at critical moments during a game, such as set pieces.
... The task of recognizing the opponent strategy in order to apply an appropriate counter action has been already adressed in previous researches. For example, the works of Visser et al. [1] and Drücker et al. [2] propose a system for recognizing opponent's formations and then apply a counter formation. This is done by using an artificial neural network that is able to classify data among 16 formation classes and then apply the counter formation especially designed against each class. ...
Article
Full-text available
In the domain of the Soccer simulation 2D league of the RoboCup project, appropriate player positioning against a given opponent team is an important factor of soccer team performance. This work proposes a model which decides the strategy that should be applied regarding a particular opponent team. This task can be realized by applying preliminary a learning phase where the model determines the most effective strategies against clusters of opponent teams. The model determines the best strategies by using sequential Bayes' estimators. As a first trial of the system, the proposed model is used to determine the association of player formations against opponent teams in the particular situation of corner-kick. The implemented model shows satisfying abilities to compare player formations that are similar to each other in terms of performance and determines the right ranking even by running a decent number of simulation games.
Conference Paper
In the domain of RoboCup 2D soccer simulation league, appropriate player positioning against a given opponent team is an important factor of soccer team performance. This work proposes a model which decides the strategy that should be applied regarding a particular opponent team. This task can be realized by applying preliminary a learning phase where the model determines the most effective strategies against clusters of opponent teams. The model determines the best strategies by using sequential Bayes’ estimators. As a first trial of the system, the proposed model is used to determine the association of player formations against opponent teams in the particular situation of corner-kick. The implemented model shows satisfying abilities to compare player formations that are similar to each other in terms of performance and determines the right ranking even by running a decent number of simulation games.
Article
Full-text available
Most existing software packages for sports video analysis require manual annotation of important events in the video. Despite being the most popular sport in the United States, most American football game analysis is still done manually. Line of scrimmage and offensive team formation recognition are two statistics that must be tagged by American Football coaches when watching and evaluating past play video clips, a process which takesmanyman hours per week. These two statistics are the building blocks for more high-level analysis such as play strategy inference and automatic statistic generation. In this chapter, we propose a novel framework where given an American football play clip, we automatically identify the video frame in which the offensive team lines in formation (formation frame), the line of scrimmage for that play, and the type of player formation the offensive team takes on. The proposed framework achieves 95% accuracy in detecting the formation frame, 98% accuracy in detecting the line of scrimmage, and up to 67%accuracy in classifying the offensive team’s formation. To validate our framework, we compiled a large dataset comprising more than 800 play-clips of standard and high definition resolution from real-world football games. This dataset will be made publicly available for future comparison.
Article
This paper shows the results of applying machine learning techniques to the problem of predicting soccer plays in the Small Size League of RoboCup. We have modeled the task as a multi-class classification problem by learning the plays of the STOx's team. For this, we have created a database of observations for this team's plays and obtained key features that describe the game state during a match. We have shown experimentally, that these features allow two learning classifiers to obtain high prediction accuracies and that most miss-classified observations are found early on the plays.
Conference Paper
This paper briefly presents the research performed in the context of FC Portugal project concerning coordination methodologies applied to robotic soccer. FC Portugal's research has been integrated in several teams that have participated with considerable success in distinct RoboCup leagues and competitions. The paper includes a brief description of the main RoboCup competitions in which FC Portugal (and associated teams) has participated with focus in the simulation leagues and related challenges. It also presents a complete state of the art concerning coordination methodologies applied to robotic soccer followed by FC Portugal main contributions on this area. The team contributions include methodologies for strategic reasoning, coaching, strategic positioning, dynamic role exchange and flexible setplay definition and execution. These methodologies compose a complete coordination framework that enable a robotic team to play soccer or execute similar tasks.
Conference Paper
Full-text available
Multi-agent teamwork is critical in a large number of agent applications, including training, education, virtual enterprises and collective robotics. Tools that can help humans analyze, evaluate, and understand team behaviors are becoming increasingly important as well. We have taken a step towards building such a tool by creating an automated analyst agent called ISAAC for post-hoc, off-line agent-team analysis. ISAAC's novelty stems from a key design constraint that arises in team analysis: multiple types of models of team behavior are necessary to analyze different granularities of team events, including agent actions, interactions, and global performance. These heterogeneous team models are automatically acquired via machine learning over teams' external behavior traces, where the specific learning techniques are tailored to the particular model learned. Additionally, ISAAC employs multiple presentation techniques that can aid human understanding of the analyses. This paper presents ISAAC's general conceptual framework, motivating its design, as well as its concrete application in the domain of RoboCup soccer. In the RoboCup domain, ISAAC was used prior to and during the RoboCup'99 tournament, and was awarded the RoboCup scientific challenge award.
Article
Full-text available
The Robot World-Cup Soccer (RoboCup) is an attempt to foster AI and intelligent robotics research by providing a standard problem where a wide range of technologies can be integrated and examined. The first RoboCup competition will be held at the Fifteenth International Joint Conference on Artificial Intelligence in Nagoya, Japan. A robot team must actually perform a soccer game, incorporating various technologies, including design principles of autonomous agents, multiagent collaboration, strategy acquisition, real-time reasoning, robotics, and sensor fusion. RoboCup is a task for a team of multiple fast-moving robots under a dynamic environment. Although RoboCup's final target is a world cup with real robots, RoboCup offers a software platform for research on the software aspects of RoboCup. This article describes technical challenges involved in RoboCup, rules, and the simulation environment.
Book
This book looks at multiagent systems that consist of teams of autonomous agents acting in real-time, noisy, collaborative, and adversarial environments. This book looks at multiagent systems that consist of teams of autonomous agents acting in real-time, noisy, collaborative, and adversarial environments. The book makes four main contributions to the fields of machine learning and multiagent systems. First, it describes an architecture within which a flexible team structure allows member agents to decompose a task into flexible roles and to switch roles while acting. Second, it presents layered learning, a general-purpose machine-learning method for complex domains in which learning a mapping directly from agents' sensors to their actuators is intractable with existing machine-learning methods. Third, the book introduces a new multiagent reinforcement learning algorithm—team-partitioned, opaque-transition reinforcement learning (TPOT-RL)—designed for domains in which agents cannot necessarily observe the state-changes caused by other agents' actions. The final contribution is a fully functioning multiagent system that incorporates learning in a real-time, noisy domain with teammates and adversaries—a computer-simulated robotic soccer team. Peter Stone's work is the basis for the CMUnited Robotic Soccer Team, which has dominated recent RoboCup competitions. RoboCup not only helps roboticists to prove their theories in a realistic situation, but has drawn considerable public and professional attention to the field of intelligent robotics. The CMUnited team won the 1999 Stockholm simulator competition, outscoring its opponents by the rather impressive cumulative score of 110-0. Bradford Books imprint
Conference Paper
The CMUnited-99 simulator team became the 1999 RoboCup simulator league champion by winning all 8 of its games, outscoring opponents by a combined score of 110-0. CMUnited-99 builds upon the successful CMUnited-98 implementation, but also improves upon it in many ways. This paper gives a detailed presentation of CMUnited-99’s improvements over CMUnited-98.
Conference Paper
. We present a novel multi-agent learning paradigm calledteam-partitioned, opaque-transition reinforcement learning (TPOT-RL).TPOT-RL introduces the use of action-dependent features to generalizethe state space. In our work, we use a learned action-dependent featurespace to aid higher-level reinforcement learning. TPOT-RL is an effectivetechnique to allow a team of agents to learn to cooperate towards theachievement of a specific goal. It is an adaptation of traditional RL methodsthat is ...
Conference Paper
We present a real-time statistical analysis tool for soccer. This system is designed to promote the advancement of RoboCup by facilitating fundamental research on issues such as learning and team evaluation and assessment. Analysis of a game is carried out by a central server, to which clients can connect to request data. We describe the operation of the system and give examples of its potential applications.