A customisable 3D platform for agent-based AI research

Last update: Jan 05, 2023

Overview

DeepMind Lab is a 3D learning environment based on id Software's Quake III Arena via ioquake3 and other open source software.

DeepMind Lab - Laser Tag Space Bounce Level (Hard)

DeepMind Lab provides a suite of challenging 3D navigation and puzzle-solving tasks for learning agents. Its primary purpose is to act as a testbed for research in artificial intelligence, especially deep reinforcement learning.

About

Disclaimer: This is not an official Google product.

If you use DeepMind Lab in your research and would like to cite the DeepMind Lab environment, we suggest you cite the DeepMind Lab paper.

You can reach us at [email protected].

Getting started on Linux

Get Bazel from bazel.io.
Clone DeepMind Lab, e.g. by running

$ git clone https://github.com/deepmind/lab
$ cd lab

For a live example of a random agent, run

lab$ bazel run :python_random_agent --define graphics=sdl -- \
               --length=10000 --width=640 --height=480

Here is some more detailed build documentation, including how to install dependencies if you don't have them.

To enable compiler optimizations, pass the flag --compilation_mode=opt, or -c opt for short, to each bazel build, bazel test and bazel run command. The flag is omitted from the examples here for brevity, but it should be used for real training and evaluation where performance matters.

Play as a human

To test the game using human input controls, run

lab$ bazel run :game -- --level_script=tests/empty_room_test --level_setting=logToStdErr=true
# or:
lab$ bazel run :game -- -l tests/empty_room_test -s logToStdErr=true

Leave the logToStdErr setting off to disable most log output.

The values of observations that the environment exposes can be printed at every step by adding a flag --observation OBSERVATION_NAME for each observation of interest.

lab$ bazel run :game -- --level_script=lt_chasm --observation VEL.TRANS --observation VEL.ROT

Train an agent

DeepMind Lab ships with an example random agent in python/random_agent.py which can be used as a starting point for implementing a learning agent. To let this agent interact with DeepMind Lab for training, run

lab$ bazel run :python_random_agent

The Python API is used for agent-environment interactions. We also provide bindings to DeepMind's "dm_env" general API for reinforcement learning, as well as a way to build a self-contained PIP package; see the separate documentation for details.

DeepMind Lab ships with different levels implementing different tasks. These tasks can be configured using Lua scripts, as described in the Lua API.

Upstream sources

DeepMind Lab is built from the ioquake3 game engine, and it uses the tools q3map2 and bspc for map creation. Bug fixes and cleanups that originate with those projects are best fixed upstream and then merged into DeepMind Lab.

bspc is taken from github.com/TTimo/bspc, revision d9a372db3fb6163bc49ead41c76c801a3d14cf80. There are virtually no local modifications, although we integrate this code with the main ioq3 code and do not use their copy in the deps directory. We expect this code to be stable.
q3map2 is taken from github.com/TTimo/GtkRadiant, revision d3d00345c542c8d7cc74e2e8a577bdf76f79c701. A few minor local modifications add synchronization. We also expect this code to be stable.
ioquake3 is taken from github.com/ioquake/ioq3, revision 29db64070aa0bae49953bddbedbed5e317af48ba. The code contains extensive modifications and additions. We aim to merge upstream changes occasionally.

We are very grateful to the maintainers of these repositories for all their hard work on maintaining high-quality code bases.

External dependencies, prerequisites and porting notes

DeepMind Lab currently ships as source code only. It depends on a few external software libraries, which we ship in several different ways:

The zlib, glib, libxml2, jpeg and png libraries are referenced as external Bazel sources, and Bazel BUILD files are provided. The dependent code itself should be fairly portable, but the BUILD rules we ship are specific to Linux on x86. To build on a different platform you will most likely have to edit those BUILD files.
Message digest algorithms are included in this package (in //third_party/md), taken from the reference implementations of their respective RFCs. A "generic reinforcement learning API" is included in //third_party/rl_api, which has also been created by the DeepMind Lab authors. This code is portable.
EGL headers are included in this package (in //third_party/GL/{EGL,KHR}), taken from the Khronos OpenGL/OpenGL ES XML API Registry at www.khronos.org/registry/EGL. The headers have been modified slightly to remove the dependency of EGL on X.
Several additional libraries are required but are not shipped in any form; they must be present on your system:
- SDL 2
- gettext (required by glib)
- OpenGL: A hardware driver and library are needed for hardware-accelerated human play. The headless library that machine learning agents will want to use can use either hardware-accelerated rendering via EGL or GLX or software rendering via OSMesa, depending on the --define headless=... build setting.
- Python 2.7 (other versions might work, too) with NumPy, PIL (a few tests require a NumPy version of at least 1.8), or Python 3 (at least 3.5) with NumPy and Pillow.

The build rules are using a few compiler settings that are specific to GCC. If some flags are not recognized by your compiler (typically those would be specific warning suppressions), you may have to edit those flags. The warnings should be noisy but harmless.

Comments

Unable to build: Python include path missing

I'm on Ubuntu 16.04 and following the guidelines from https://github.com/deepmind/lab/blob/master/docs/build.md . I'm unable to run any of the examples

Here are the major library versions:

Bazel version: 0.4.3
Lua: 5.1.5
Python: 2.7
OpenGL version: 4.5.0
GCC: 5.4.0

Error while trying to run a random agent

lab$ bazel run :game -- --level_script tests/demo_map --verbose_failures
WARNING: Output base '/home/ml/hsatij/.cache/bazel/_bazel_hsatij/2a466b12af1ad7b6dcfd974c8d06585b' is on NFS. This may lead to surprising failures and undetermined behavior.
INFO: Found 1 target...
ERROR: /home/ml/hsatij/.cache/bazel/_bazel_hsatij/2a466b12af1ad7b6dcfd974c8d06585b/external/jpeg_archive/BUILD:74:1: Executing genrule @jpeg_archive//:configure failed: linux-sandbox failed: error executing command /home/ml/hsatij/.cache/bazel/_bazel_hsatij/2a466b12af1ad7b6dcfd974c8d06585b/execroot/lab/_bin/linux-sandbox ... (remaining 5 argument(s) skipped).
src/main/tools/linux-sandbox-pid1.cc:398: "remount(NULL, /home/ml/hsatij/.cache/bazel/_bazel_hsatij/2a466b12af1ad7b6dcfd974c8d06585b/bazel-sandbox/905e0eef-0788-42e6-8852-7b444149d38c-13/tmp/home/ndg/projects, NULL, 2101281, NULL)": No such file or directory
Target //:game failed to build
Use --verbose_failures to see the command lines of failed build steps.
INFO: Elapsed time: 2.569s, Critical Path: 1.20s
ERROR: Build failed. Not running target.

On trying to build the Python interface to DeepMind Lab with OpenGL

lab$ bazel build :deepmind_lab.so --define headless=glx --verbose_failures
WARNING: Output base '/home/ml/hsatij/.cache/bazel/_bazel_hsatij/2a466b12af1ad7b6dcfd974c8d06585b' is on NFS. This may lead to surprising failures and undetermined behavior.
INFO: Found 1 target...
ERROR: /home/ml/hsatij/code/libs/lab/BUILD:972:1: C++ compilation of rule '//:dmlablib' failed: linux-sandbox failed: error executing command 
  (cd /home/ml/hsatij/.cache/bazel/_bazel_hsatij/2a466b12af1ad7b6dcfd974c8d06585b/bazel-sandbox/ff30fcd2-9759-4ca4-8fa3-b82956431988-1/execroot/lab && \
  exec env - \
  /home/ml/hsatij/.cache/bazel/_bazel_hsatij/2a466b12af1ad7b6dcfd974c8d06585b/execroot/lab/_bin/linux-sandbox @/home/ml/hsatij/.cache/bazel/_bazel_hsatij/2a466b12af1ad7b6dcfd974c8d06585b/bazel-sandbox/ff30fcd2-9759-4ca4-8fa3-b82956431988-1/linux-sandbox.params -- /usr/bin/gcc -U_FORTIFY_SOURCE '-D_FORTIFY_SOURCE=1' -fstack-protector -Wall -Wl,-z,-relro,-z,now -B/usr/bin -B/usr/bin -Wunused-but-set-parameter -Wno-free-nonheap-object -fno-canonical-system-headers -fno-omit-frame-pointer '-std=c++0x' -MD -MF bazel-out/local-fastbuild/bin/_objs/dmlablib/public/dmlab_so_loader.pic.d '-frandom-seed=bazel-out/local-fastbuild/bin/_objs/dmlablib/public/dmlab_so_loader.pic.o' -fPIC -iquote . -iquote bazel-out/local-fastbuild/genfiles -iquote external/bazel_tools -iquote bazel-out/local-fastbuild/genfiles/external/bazel_tools -isystem external/bazel_tools/tools/cpp/gcc3 '-DDMLAB_SO_LOCATION="libdmlab.so"' -fno-canonical-system-headers -Wno-builtin-macro-redefined '-D__DATE__="redacted"' '-D__TIMESTAMP__="redacted"' '-D__TIME__="redacted"' -c public/dmlab_so_loader.cc -o bazel-out/local-fastbuild/bin/_objs/dmlablib/public/dmlab_so_loader.pic.o).
src/main/tools/linux-sandbox-pid1.cc:398: "remount(NULL, /home/ml/hsatij/.cache/bazel/_bazel_hsatij/2a466b12af1ad7b6dcfd974c8d06585b/bazel-sandbox/ff30fcd2-9759-4ca4-8fa3-b82956431988-1/tmp/home/ndg/projects, NULL, 2101281, NULL)": No such file or directory
Target //:deepmind_lab.so failed to build
INFO: Elapsed time: 2.294s, Critical Path: 1.11s

I'm new to bazel and unfortunately the error logs are too cryptic for me. Any help to resolve this will be appreciated !

solved

opened by hercky 30

How to use lab as a python module
Hi,

I built lab on Ubuntu 14.04. The python module tests all pass. I'm a little confused about importing deepmind_lab and running experiments with it in python. I apologize if this is due to my lack of general python/bazel knowledge. My understanding is that the process of experimentation with deepmind_lab is:

create a python file experiment.py. import deepmind_lab and use it in the experiment.

add a py_binary entry in the BUILD file for bazel named "experiment"

perform bazel run :experiment

Is this correct? And, is there any way to instead run experiments directly with python experiment.py?
fixed-in-next-verison
opened by johnholl 22
Lab + python multiprocessing
I have implemented A3C with multiprocessing (+ pytorch) as opposed to using threads, however bazel run seems to break silently and clearly without any visible trace. This is what I do:

$ bazel run :struct_runner --define headless=false [...] $ echo $? # this is the error code of the previous process 8

struct_runner.py initialises a lab environment, then creates a bunch of processes in which more envs are created. In particular, the silent crash happens when I create a process p and do p.start() - it also appears to be non-deterministic with respect to the number of processes I manage to spawn before bazel kills them and quits.

I know that @miyosuda has implemented A3C using threads here, however multiprocessing is supported very well by pytorch and it would be a shame to have to deal with threads management.
opened by edran 15
Random agent compiling error
Hey on Ubuntu 18.04, Bazel 0.20.0, Tensorflow 1.2 GPU I am trying to run the random agent and I am getting the following error. Can you please help me on how to solve this error? Thanks

bazel run :python_random_agent --define graphics=sdl -- \

--length=10000 --width=640 --height=480

Starting local Bazel server and connecting to it... INFO: Invocation ID: 69a115a9-45b9-4642-87c8-8fc2170651ac INFO: SHA256 (https://github.com/abseil/abseil-cpp/archive/master.zip) = d3bb4e5578f06ddf3e0e21def6aabf3b4ae81d68b909f06cfcb727734584cab3 INFO: Analysed target //:python_random_agent (53 packages loaded, 3598 targets configured). INFO: Found 1 target... ERROR: /home/neuwong/lab/BUILD:790:1: C++ compilation of rule '//:game_lib_sdl' failed (Exit 1) gcc failed: error executing command /usr/bin/gcc -U_FORTIFY_SOURCE -fstack-protector -Wall -Wunused-but-set-parameter -Wno-free-nonheap-object -fno-omit-frame-pointer -MD -MF bazel-out/k8-fastbuild/bin/_objs/game_lib_sdl/sdl_input.pic.d ... (remaining 103 argument(s) skipped)

Use --sandbox_debug to see verbose messages from the sandbox engine/code/sdl/sdl_input.c:26:11: fatal error: SDL.h: No such file or directory

include <SDL.h>

^~~~~~~

compilation terminated. Target //:python_random_agent failed to build Use --verbose_failures to see the command lines of failed build steps. INFO: Elapsed time: 33.266s, Critical Path: 7.03s INFO: 69 processes: 69 linux-sandbox. FAILED: Build did NOT complete successfully FAILED: Build did NOT complete successfully
opened by behroozmrd47 13

Pip and Python3

Informed by #32, #92 and #52 I tried to configure Bazel on a conda environment, with python 3. bazel build //:deepmind_lab.so appears to run successfully, however bazel run //:python_random_agent is not able to import deepmind_lab

INFO: Running command line: bazel-out/k8-py3-fastbuild/bin/python_random_agent
ImportError: numpy.core.multiarray failed to import
Traceback (most recent call last):
  File "/home/florin/.cache/bazel/_bazel_florin/329274937dbc60c7c6c49b959689f873/execroot/org_deepmind_lab/bazel-out/k8-py3-fastbuild/bin/python_random_agent.runfiles/org_deepmind_lab/python/random_agent.py", line 26, in <module>
    import deepmind_lab
ImportError: numpy.core.multiarray failed to import
ERROR: Non-zero return code '1' from command: Process exited with status 1

You can find the changes bellow. @tkoeppe Does this seem right? Also, are there any log files I can provide you with in order to help with the debugging?

diff --git a/BUILD b/BUILD
index 9e274c5..3972e07 100644
--- a/BUILD
+++ b/BUILD
@@ -984,6 +984,7 @@ py_binary(
     data = [":deepmind_lab.so"],
     main = "python/random_agent.py",
     visibility = ["//python/tests:__subpackages__"],
+    default_python_version = "PY3"
 )
 
 LOAD_TEST_SCRIPTS = [
diff --git a/WORKSPACE b/WORKSPACE
index fa8da47..967c712 100644
--- a/WORKSPACE
+++ b/WORKSPACE
@@ -91,5 +91,5 @@ new_local_repository(
 new_local_repository(
     name = "python_system",
     build_file = "python.BUILD",
-    path = "/usr",
+    path = "/home/florin/Tools/miniconda3/envs/torch30",
 )
diff --git a/python.BUILD b/python.BUILD
index f0b3f9a..b6a6fa8 100644
--- a/python.BUILD
+++ b/python.BUILD
@@ -5,7 +5,11 @@
 
 cc_library(
     name = "python",
-    hdrs = glob(["include/python2.7/*.h"]),
-    includes = ["include/python2.7"],
+    hdrs = glob(["include/python3.6m/*.h",
+                 "lib/python3.6/site-packages/numpy/core/include/**/*.h"
+    ]),
+    includes = ["include/python3.6m",
+                "lib/python3.6/site-packages/numpy/core/include"
+    ],
     visibility = ["//visibility:public"],
 )

opened by floringogianu 13

Pip package not working with python 3
Hi,

I am having some problems when trying to build/install Deepmind Lab as a pip package with Python 3. Currently I am capable of generating the package .whl file (DeepMind_Lab-1.0-py3-none-any.whl) and installing in a conda environment, but I am getting the following error when importing the deepmind_lab module:

deepmind_lab.so: undefined symbol: PyCObject_Type

Any suggestions about how to fix this?

Thanks,
opened by camigord 13
I meet the error when bazel build -c opt //:deepmind_lab.so

bazel build -c opt //:deepmind_lab.so ERROR: /home/ubuntu/zz/projects/lab/WORKSPACE:146:1: //external:tree_archive: no such attribute 'repo_mapping' in 'http_archive' rule ERROR: Skipping '//:deepmind_lab.so': error loading package 'external': Package 'external' contains errors WARNING: Target pattern parsing failed. ERROR: error loading package 'external': Package 'external' contains errors

opened by XqWang3 12
Sorry i don't know what i'm doing wrong

[email protected]:~$ cd lab [email protected]:~/lab$ sudo bazel run :game -- --level_script=tests/empty_room_test --level_setting=logToStdErr=true --sandbox_debug Extracting Bazel installation... Starting local Bazel server and connecting to it... INFO: SHA256 (https://github.com/abseil/abseil-cpp/archive/master.zip) = f63fa171a79bfd38f995b899989e74144255fcea57ad74079792385841db64dd DEBUG: Rule 'com_google_absl' indicated that a canonical reproducible form can be obtained by modifying arguments sha256 = "f63fa171a79bfd38f995b899989e74144255fcea57ad74079792385841db64dd" INFO: Analysed target //:game (51 packages loaded, 3679 targets configured). INFO: Found 1 target... ERROR: /home/user/.cache/bazel/_bazel_root/851aceb1df3dfff730804dc58954a97b/external/glib_archive/BUILD.bazel:1:1: Executing genrule @glib_archive//:gen_configure failed (Exit 1) bash failed: error executing command /bin/bash -c ... (remaining 1 argument(s) skipped)

Use --sandbox_debug to see verbose messages from the sandbox configure: error: *** You must have either have gettext support in your C library, or use the *** GNU gettext library. (http://www.gnu.org/software/gettext/gettext.html)

Target //:game failed to build Use --verbose_failures to see the command lines of failed build steps. INFO: Elapsed time: 254.198s, Critical Path: 46.81s INFO: 154 processes: 154 processwrapper-sandbox. FAILED: Build did NOT complete successfully FAILED: Build did NOT complete successfully

opened by ZombiePm 10
dm-lab in Keras
Heya,

I'm currently re-implementing the vector-based navigation architecture of deep-mind's nature paper for tensorflow 2. However, it seems there is an issue regarding the environment of the python API when used with the Keras model API. In particular, I do something like:

... observations = ['RGB', 'DEBUG.POS.ROT', 'DEBUG.POS.TRANS'] env = deepmind_lab.Lab(level, observations, config=lab_config, renderer='hardware') env.reset() ... def generate_batch(env, batch_size=10): while True: ... obs = env.observations() ... x, y = ... some computation env.reset() ... yield x, y model.fit_generator( generate_batch(env, batch_size=10), steps_per_epoch=1000, verbose=1, use_multiprocessing=False )

depending whether i use env.reset() or not it sometimes drops error (related to: https://github.com/deepmind/lab/issues/134). Even if it does not explicitely complains, it stops always within the first batch. Since Tensorflow 2 seems to convert the generate_batch function into some eager-execution/autograph stuff, I am quite confused whether the env object gets either copied or something else nasty going on under the hood. I hope someone has more clue than I do :)

The agent I am using is performing random actions (so it should not be the same state-sequence all the time, especially because the rnd seed is not statically set)
opened by uahic 9

Failed to find function dmlab_connect in library!

Hi,

After running bazel build -c opt //:deepmind_lab.so, I run bazel run :python_random_agent --define headless=osmesa and get the following output:

Failed to find function dmlab_connect in library!
Traceback (most recent call last):
  File "/network/home/alversaf/.cache/bazel/_bazel_alversaf/c78c672c9c47c1e9c2d9a08f489f56c4/execroot/org_deepmind_lab/bazel-out/k8-fastbuild/bin/python_random_agent.runfiles/org_deepmind_lab/python/random_agent.py", line 206, in <module>
    args.record, args.demo, args.demofiles, args.video)
  File "/network/home/alversaf/.cache/bazel/_bazel_alversaf/c78c672c9c47c1e9c2d9a08f489f56c4/execroot/org_deepmind_lab/bazel-out/k8-fastbuild/bin/python_random_agent.runfiles/org_deepmind_lab/python/random_agent.py", line 155, in run
    env = deepmind_lab.Lab(level, ['RGB_INTERLEAVED'], config=config)
RuntimeError: Failed to connect RL API

What should I do? Thanks! (By the way, I am running bazel through a singularity container.)

solved

opened by alversafa 9

Can't build to simple gym-like interface

Walking through the examples, I haven't understood how can I build the code, and after build have an easy, gym-like environment I could instantiate / reset / step / etc? Everything that I managed was building and starting stuff directly through bazel, similar to python_random_agent, but I don't see how to call step/reset outside of bazel. Am I missing something or there is no description of how to build+run such an environment?

opened by bjg2 9

Complete installation script for Ubuntu 20.04

Not an issue, just thought I'd share a full installation script for the DMLab Python package because it took a while to put all the pieces together and might be helpful for others. Tested on Ubuntu 20.04, might also work on newer versions:

#!/bin/sh
set -eu

# Dependencies
apt-get update && apt-get install -y \
    build-essential curl freeglut3 gettext git libffi-dev libglu1-mesa \
    libglu1-mesa-dev libjpeg-dev liblua5.1-0-dev libosmesa6-dev \
    libsdl2-dev lua5.1 pkg-config python-setuptools python3-dev \
    software-properties-common unzip zip zlib1g-dev g++
pip3 install numpy

# Bazel
apt-get install -y apt-transport-https curl gnupg
curl -fsSL https://bazel.build/bazel-release.pub.gpg | gpg --dearmor > bazel.gpg
mv bazel.gpg /etc/apt/trusted.gpg.d/
echo "deb [arch=amd64] https://storage.googleapis.com/bazel-apt stable jdk1.8" | tee /etc/apt/sources.list.d/bazel.list
apt-get update && apt-get install -y bazel

# Build
git clone https://github.com/deepmind/lab.git
cd lab
echo 'build --cxxopt=-std=c++17' > .bazelrc
bazel build -c opt //python/pip_package:build_pip_package
./bazel-bin/python/pip_package/build_pip_package /tmp/dmlab_pkg
pip3 install --force-reinstall /tmp/dmlab_pkg/deepmind_lab-*.whl
cd ..
rm -rf lab

# Stimuli
mkdir dmlab_data
cd dmlab_data
pip3 install Pillow
curl https://bradylab.ucsd.edu/stimuli/ObjectsAll.zip -o ObjectsAll.zip
unzip ObjectsAll.zip
cd OBJECTSALL
python3 << EOM
import os
from PIL import Image
files = [f for f in os.listdir('.') if f.lower().endswith('jpg')]
for i, file in enumerate(sorted(files)):
  print(file)
  im = Image.open(file)
  im.save('../%04d.png' % (i+1))
EOM
cd ..
rm -rf __MACOSX OBJECTSALL ObjectsAll.zip

apt-get clean

opened by danijar 3

Change the speed of the agent dynamically

Hi, I'm wondering if it is possible to modify the speed in which the agent walks/runs dynamically/programmatically ? For example, speed up/down according to a function while the agent is interacting in the environment.

Thank you very much for your support,

opened by kevinNejad 0
Disable Velocity

Is it possible to disable velocity kinematics/dynamics of the agent? In other words, if a "forward" action is taken, the agent moves forward by a fixed amount and stops immediately with no residual velocity carrying it forward?

opened by Alvinosaur 1
Access camera params (extrinsics, intrinsics, near/far plane) for pointcloud calculation

Hello, Is there a way to get either access to a point-cloud observation or access to the camera params (extrinsics, intrinsics, near/far plane) such that I can calculate the point-cloud from z-buffer observation? Thanks!

opened by stepjam 1

Releases(release-2020-12-07)

release-2020-12-07(Dec 7, 2020)
New Levels:

Psychlab.

contributed/psychlab/memory_suite_01/explore_goal_locations_extrapolate

contributed/psychlab/memory_suite_01/explore_goal_locations_holdout_extrapolate

contributed/psychlab/memory_suite_01/explore_goal_locations_holdout_interpolate

contributed/psychlab/memory_suite_01/explore_goal_locations_holdout_large

contributed/psychlab/memory_suite_01/explore_goal_locations_holdout_small

contributed/psychlab/memory_suite_01/explore_goal_locations_interpolate

contributed/psychlab/memory_suite_01/explore_goal_locations_train_large

contributed/psychlab/memory_suite_01/explore_goal_locations_train_small

Language binding tasks.

contributed/fast_mapping/fast_mapping

contributed/fast_mapping/slow_mapping

New Features:

A property system has been added that allows dynamic querying and modifying of environment state. Level scripts can register and consume custom properties.

A new Python module, dmenv_module, is provided that exposes the DeepMind dm_env API.

Minor Improvements:

Quake console commands can now be issued via a write-only property.

New numeric "accumulate" operations for TensorView and the Lua Tensor types: sum, product, sum-of-squares, and dot product of two tensors.

EnvCApi Changes:

"Properties" have been added to the EnvCApi. Properties may be queried, set, and enumerated.

The new API version is 1.4 (up from 1.3).

The EnvCApi function fps is now deprecated; environments should instead use the new property system to communicate this information.

Bug Fixes:

Fix observation 'VEL.ROT' to allow non-zero values when combined with pixel observations. Previously, the presence of pixel observations caused the angular velocity information to be lost due to a logic error.

Source code(tar.gz)
Source code(zip)
release-2019-10-07(Oct 7, 2019)
New Levels:

Psychlab.

contributed/psychlab/cued_temporal_production

contributed/psychlab/memory_suite_01/arbitrary_visuomotor_mapping_train

contributed/psychlab/memory_suite_01/arbitrary_visuomotor_mapping_holdout_interpolate

contributed/psychlab/memory_suite_01/arbitrary_visuomotor_mapping_holdout_extrapolate

contributed/psychlab/memory_suite_01/change_detection_train

contributed/psychlab/memory_suite_01/change_detection_holdout_interpolate

contributed/psychlab/memory_suite_01/change_detection_holdout_extrapolate

contributed/psychlab/memory_suite_01/continuous_recognition_train

contributed/psychlab/memory_suite_01/continuous_recognition_holdout_interpolate

contributed/psychlab/memory_suite_01/continuous_recognition_holdout_extrapolate

contributed/psychlab/memory_suite_01/what_then_where_train

contributed/psychlab/memory_suite_01/what_then_where_holdout_interpolate

contributed/psychlab/ready_set_go

contributed/psychlab/temporal_bisection

contributed/psychlab/temporal_discrimination

contributed/psychlab/visuospatial_suite/memory_guided_saccade

contributed/psychlab/visuospatial_suite/odd_one_out

contributed/psychlab/visuospatial_suite/pathfinder

contributed/psychlab/visuospatial_suite/pursuit

contributed/psychlab/visuospatial_suite/visual_match

contributed/psychlab/visuospatial_suite/visually_guided_antisaccade

contributed/psychlab/visuospatial_suite/visually_guided_prosaccade

Minor Improvements:

The game demo executable can now print observations at each step.

EnvCApi Changes:

The meaning of major and minor versions and the resulting notions of stability are clarified. The new API version is 1.3 (up from 1.2).

The EnvCApi act function is now deprecated in favour of two finer-grained functions: A call to act should be replaced by a call act_discrete to set discrete actions, followed by an optional call to act_continuous to set continuous actions. (DeepMind Lab does not use continuous actions.)

New support for "text actions", which can be set with the new act_text API function. (DeepMind Lab does not use text actions.)

Bug Fixes:

Observation 'DEBUG.CAMERA_INTERLEAVED.TOP_DOWN' is now correct for levels dmlab30/explore_object_rewards_{few,many}.

An error is now raised if there is not enough space to place every possible room (regardless of whether the random generation actually produces a room of excessive size) and if a non-zero number of rooms was requested.

The affected levels have been updated and will generate layouts similar to before, but the whole maze is offset by 100 units, and object placements will change.

Fix top-down camera for language levels.

Correct typo in bot Leonis, skill level 1, based on OpenArena's bot code gargoyle_c.c.

Tensor scalar operations using arrays now work similar to the way they do with single values.

Source code(tar.gz)
Source code(zip)
release-2019-02-04(Feb 4, 2019)
New Levels:

Psychlab.

contributed/psychlab/harlow

Minor Improvements:

Improve documentation of how to configure non-hermetic dependencies (Lua, Python, NumPy).

Add 'allowHoldOutLevels' setting to allow running of levels that should not be trained on, but held out for evaluation.

Add logging library 'common.log', which provides the ability to control which log messages are emitted via the setting 'logLevel'.

Update the ioq3 upstream code to the latest state.

Lua 5.1 is now downloaded and built from source, and is thus no longer a required local dependency.

A minimal version of the "realpath" utility is now bundled with the code, and thus "realpath" is no longer a required local dependency.

Bug Fixes:

Prevent missing sounds from causing clients to disconnect.

Fix a bug in the call of the theme callback 'placeFloorModels', which had caused an "'index' is missing" error during compilation of text levels with texture sets that use floor models, such as MINESWEEPER, GO, and PACMAN.

Fix bug where levels 'keys_doors_medium', 'keys_doors_random' and 'rooms_keys_doors_puzzle' would not accept the common 'logLevel' setting.

Expose a 'demofiles' command line flag for the Python random agent, without which the agent was not able to record or play back demos.

Fix a memory deallocation order error introduced by an earlier commit.

Source code(tar.gz)
Source code(zip)
release-2018-06-20(Jun 20, 2018)
New Levels:

Psychlab.

contributed/psychlab/glass_pattern_detection

contributed/psychlab/landoltC_identification

contributed/psychlab/motion_discrimination{,_easy}

contributed/psychlab/multiple_object_tracking{,_easy}

contributed/psychlab/odd_one_out

Bug Fixes:

Let Python level cache set to None mean the same as not setting it at all.

Change Python module initialization in Python-3 mode to make PIP packages work in Python 3.

Minor Improvements:

Add support for absl::variant to lua::Push and lua::Read.

The demo :game has a new flag --start_index to start at an episode index other than 0.

Add a console command dm_pickup to pick up an item identified by its id.

More Python demos and tests now work with Python 3.

Add a shader for rendering decals with transparency.

Source code(tar.gz)
Source code(zip)
release-2018-05-15(May 15, 2018)
New Levels:

DMLab-30.

contributed/dmlab30/psychlab_arbitrary_visuomotor_mapping

contributed/dmlab30/psychlab_continuous_recognition

Psychlab.

contributed/psychlab/arbitrary_visuomotor_mapping

contributed/psychlab/continuous_recognition

New Features:

Support for level caching for improved performance in the Python module.

Add the ability to spawn pickups dynamically at arbitrary locations.

Add implementations to read datasets including Cifar10 and Stimuli.

Add the ability to specify custom actions via 'customDiscreteActionSpec' and 'customDiscreteAction' callbacks.

Bug Fixes:

Fix playerId and otherPlayerId out by one errors in 'game_rewards.lua'.

Require playerId passed to game:addScore to be one indexed instead of zero indexed and allow game:addScore to be used without a playerId.

game:renderCustomView now renders the view with top-left as the origin. The previous behaviour can be achieved by calling reverse(1) on the returned tensor.

Fix a bug in image.scale whereby the offset into the data was erroneously ignored.

Fix a typo in a require statement in visual_search_factory.lua.

Fix a few erroneous dependencies on Lua dictionary iteration order.

game:AddScore now works even on the final frame of an episode.

Minor Improvements:

Moved .map files into assets/maps/src and .bsp files into assets/maps/built. Added further pre-built maps, which removes the need for the expensive :map_assets build step.

Allow game to be rendered with top-left as origin instead of bottom-left.

Add 'mixerSeed' setting to change behaviour of all random number generators.

Support for BGR_INTERLEAVED and BGRD_INTERLEAVED observation formats.

Add a Lua API to load PNGs from file contents.

Add 'eyePos' to playerInfo() for a more accurate eye position of player. Used in place of player pos + height.

Add support for absl::string_view to lua::Push and lua::Read.

Allow player model to be overridden via 'playerModel' callback.

Add ability to specify custom actions via 'customDiscreteActionSpec' and 'customDiscreteAction' callbacks.

Add game:console command to issue Quake 3 console commands directly.

Add clamp to tensor operations.

Add new callback api:newClientInfo, allowing each client to intercept when players are loading.

Skymaze level generation is now restricted to produce only 100000 distinct levels. This allows for caching to avoid expensive recompilations.

Add cvars 'cg_drawScriptRectanglesAlways' and 'cg_drawScriptTextAlways' to enable script rendering when reducedUI or minimalUI is enabled.

All pickup types can now choose their movement type separately, and in particular, all pickup types can be made static. Two separate table entries are now specified for an item, 'typeTag' and 'moveType'.

Deprecated Features:

Observation format names RGB_INTERLEAVED and RGBD_INTERLEAVED replace RGB_INTERLACED and RGBD_INTERLACED, respectively. The old format names are deprecated and will be removed in a future release.

The pickup item's tag member is now called moveType. The old name is deprecated and will be removed in a future release.

Source code(tar.gz)
Source code(zip)
release-2018-02-07(Feb 7, 2018)
New Levels:

DMLab-30.

contributed/dmlab30/rooms_collect_good_objects_{test,train}

contributed/dmlab30/rooms_exploit_deferred_effects_{test,train}

contributed/dmlab30/rooms_select_nonmatching_object

contributed/dmlab30/rooms_watermaze

contributed/dmlab30/rooms_keys_doors_puzzle

contributed/dmlab30/language_select_described_object

contributed/dmlab30/language_select_located_object

contributed/dmlab30/language_execute_random_task

contributed/dmlab30/language_answer_quantitative_question

contributed/dmlab30/lasertag_one_opponent_small

contributed/dmlab30/lasertag_three_opponents_small

contributed/dmlab30/lasertag_one_opponent_large

contributed/dmlab30/lasertag_three_opponents_large

contributed/dmlab30/natlab_fixed_large_map

contributed/dmlab30/natlab_varying_map_regrowth

contributed/dmlab30/natlab_varying_map_randomized

contributed/dmlab30/skymaze_irreversible_path_hard

contributed/dmlab30/skymaze_irreversible_path_varied

contributed/dmlab30/psychlab_sequential_comparison

contributed/dmlab30/psychlab_visual_search

contributed/dmlab30/explore_object_locations_small

contributed/dmlab30/explore_object_locations_large

contributed/dmlab30/explore_obstructed_goals_small

contributed/dmlab30/explore_obstructed_goals_large

contributed/dmlab30/explore_goal_locations_small

contributed/dmlab30/explore_goal_locations_large

contributed/dmlab30/explore_object_rewards_few

contributed/dmlab30/explore_object_rewards_many

New Features:

Basic support for demo recording and playback.

Minor Improvements:

Add a mechanism to build DeepMind Lab as a PIP package.

Extend basic testing to all levels under game_scripts/levels.

Add settings minimalUI and reducedUI to avoid rendering parts of the HUD.

Add teleported flag to game:playerInfo() to tell whether a player has teleported that frame.

Add Lua functions countEntities and countVariations to the maze generation API to count the number of occurrences of a specific entity or variation, respectively

Bug Fixes:

Fix out-of-bounds access in Lua 'image' library.

Fix off-by-one error in renderergl1 grid mesh rendering.

Source code(tar.gz)
Source code(zip)
release-2018-01-26(Jan 26, 2018)
New Levels:

Psychlab, a platform for implementing classical experimental paradigms from cognitive psychology.

contributed/psychlab/sequential_comparison

contributed/psychlab/visual_search

New Features:

Extend functionality of the built-in tensor Lua library.

Add built-in image Lua library for loading and scaling PNGs.

Add error handling to the env_c_api (version 1.1).

Add ability to create events from Lua scripts.

Add ability to retrieve game entity from Lua scripts.

Add ability create pickup models during level load.

Add ability to update textures from script after the level has loaded.

Add Lua customisable themes. Note: This change renames helpers in maze_generation to be in lowerCamelCase (e.g. MazeGeneration -> mazeGeneration).

The directory game_scripts has moved out of the assets directory, and level scripts now live separately from the library code in the levels subdirectory.

Minor Improvements:

Remove unnecessary dependency of map assets on Lua scripts, preventing time-consuming rebuilding of maps when scripts are modified.

Add ability to disable bobbing of reward and goal pickups.

The setting controls (with values internal, external) has been renamed to nativeApp (with values true, false, respectively). When set to true, programs linked against game_lib_sdl will use the native SDL input devices.

Change LuaSnippetEmitter methods to use table call conventions.

Add config variable for monochromatic lightmaps ('r_monolightmaps'). Enabled by default.

Add config variable to limit texture size ('r_textureMaxSize').

api:modifyTexture must now return whether the texture was modified.

Add ability to adjust rewards.

Add ability to raycast between different points on the map.

Add ability to test whether a view vector is within an angle range within a oriented view frame.

Bug Fixes:

Increase current score storage from short to long.

Fix ramp jump velocity in level lt_space_bounce_hard.

Fix Lua function 'addScore' from module 'dmlab.system.game' to allow negative scores added to a player.

Remove some undefined behaviour in the engine.

Reduce inaccuracies related to angle conversion and normalization.

Behavior of team spawn points now matches that of player spawn points. 'randomAngleRange' spawnVar must be set to 0 to match previous behavior.

Source code(tar.gz)
Source code(zip)
release-2016-12-06(Dec 8, 2016)

This is the initial public release of DeepMind Lab.
Source code(tar.gz)
Source code(zip)

Owner

DeepMind

GitHub Repository

Open world survival environment for reinforcement learning

Crafter Open world survival environment for reinforcement learning. Highlights Crafter is a procedurally generated 2D world, where the agent finds foo

213 Jan 05, 2023

Rethinking the Importance of Implementation Tricks in Multi-Agent Reinforcement Learning

MARL Tricks Our codes for RIIT: Rethinking the Importance of Implementation Tricks in Multi-AgentReinforcement Learning. We implemented and standardiz

404 Dec 25, 2022

A customisable 3D platform for agent-based AI research

DeepMind Lab is a 3D learning environment based on id Software's Quake III Arena via ioquake3 and other open source software. DeepMind Lab provides a

6.8k Jan 05, 2023

A platform for Reasoning systems (Reinforcement Learning, Contextual Bandits, etc.)

Applied Reinforcement Learning @ Facebook Overview ReAgent is an open source end-to-end platform for applied reinforcement learning (RL) developed and

3.3k Jan 05, 2023

Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".

SLM Lab Modular Deep Reinforcement Learning framework in PyTorch. Documentation: https://slm-lab.gitbook.io/slm-lab/ BeamRider Breakout KungFuMaster M

1.1k Dec 24, 2022

A collection of various RL algorithms like policy gradients, DQN and PPO. The goal of this repo will be to make it a go-to resource for learning about RL. How to visualize, debug and solve RL problems. I've additionally included playground.py for learning more about OpenAI gym, etc.

Reinforcement Learning (PyTorch) 🤖 + 🍰 = ❤️ This repo will contain PyTorch implementation of various fundamental RL algorithms. It's aimed at making

123 Dec 23, 2022

A customisable 3D platform for agent-based AI research

Related tags

Overview

About

Getting started on Linux

Play as a human

Train an agent

Upstream sources

External dependencies, prerequisites and porting notes

Comments

include <SDL.h>

Releases(release-2020-12-07)

release-2020-12-07(Dec 7, 2020)

New Levels:

New Features:

Minor Improvements:

EnvCApi Changes:

Bug Fixes:

release-2019-10-07(Oct 7, 2019)

New Levels:

Minor Improvements:

EnvCApi Changes:

Bug Fixes:

release-2019-02-04(Feb 4, 2019)

New Levels:

Minor Improvements:

Bug Fixes:

release-2018-06-20(Jun 20, 2018)

New Levels:

Bug Fixes:

Minor Improvements:

release-2018-05-15(May 15, 2018)

New Levels:

New Features:

Bug Fixes:

Minor Improvements:

Deprecated Features:

release-2018-02-07(Feb 7, 2018)

New Levels:

New Features:

Minor Improvements:

Bug Fixes:

release-2018-01-26(Jan 26, 2018)

New Levels:

New Features:

Minor Improvements:

Bug Fixes:

release-2016-12-06(Dec 8, 2016)

Owner

DeepMind

Open world survival environment for reinforcement learning

Rethinking the Importance of Implementation Tricks in Multi-Agent Reinforcement Learning

A customisable 3D platform for agent-based AI research

A platform for Reasoning systems (Reinforcement Learning, Contextual Bandits, etc.)

Modular Deep Reinforcement Learning framework in PyTorch. Companion library of the book "Foundations of Deep Reinforcement Learning".

A collection of various RL algorithms like policy gradients, DQN and PPO. The goal of this repo will be to make it a go-to resource for learning about RL. How to visualize, debug and solve RL problems. I've additionally included playground.py for learning more about OpenAI gym, etc.

A toolkit for reproducible reinforcement learning research.

A toolkit for developing and comparing reinforcement learning algorithms.

Deep Reinforcement Learning for Keras.

TensorFlow Reinforcement Learning

Doom-based AI Research Platform for Reinforcement Learning from Raw Visual Information. :godmode:

A general-purpose multi-agent training framework.

Retro Games in Gym

Paddle-RLBooks is a reinforcement learning code study guide based on pure PaddlePaddle.

This is the official implementation of Multi-Agent PPO.

Monitor your el-cheapo UPS via SNMP

Tensorforce: a TensorFlow library for applied reinforcement learning

ChainerRL is a deep reinforcement learning library built on top of Chainer.

TF-Agents: A reliable, scalable and easy to use TensorFlow library for Contextual Bandits and Reinforcement Learning.

An open source robotics benchmark for meta- and multi-task reinforcement learning