Toggle light / dark theme

By Planning in the Latent Space of a Learned World Model. The world model Director builds from pixels allows effective planning in a latent space. To anticipate future model states given future actions, the world model first maps pictures to model states. Director optimizes two policies based on the model states’ anticipated trajectories: Every predetermined number of steps, the management selects a new objective, and the employee learns to accomplish the goals using simple activities. The direction would have a difficult control challenge if they had to choose plans directly in the high-dimensional continuous representation space of the world model. To reduce the size of the discrete codes created by the model states, they instead learn a goal autoencoder. The goal autoencoder then transforms the discrete codes into model states and passes them as goals to the worker after the manager has chosen them.

Deep reinforcement learning advancements have accelerated the study of decision-making in artificial agents. Artificial agents may actively affect their environment by moving a robot arm based on camera inputs or clicking a button in a web browser, in contrast to generative ML models like GPT-3 and Imagen. Although artificial intelligence has the potential to aid humans more and more, existing approaches are limited by the necessity for precise feedback in the form of often given rewards to acquire effective techniques. For instance, even robust computers like AlphaGo are restricted to a certain number of moves before earning their next reward while having access to massive computing resources.

Contrarily, complex activities like preparing a meal necessitate decision-making at all levels, from menu planning to following directions to the shop to buy supplies to properly executing the fine motor skills required at each stage along the way based on high-dimensional sensory inputs. Artificial agents can complete tasks more independently with scarce incentives thanks to hierarchical reinforcement learning (HRL), which automatically breaks down complicated tasks into achievable subgoals. Research on HRL has, however, been difficult because there is no universal answer, and existing approaches rely on manually defined target spaces or subtasks.

The bottomless bucket is Karl Marx’s utopian creed: “From each according to his ability, to each according to his needs.” In this idyllic world, everyone works for the good of society, with the fruits of their labor distributed freely — everyone taking what they need, and only what they need. We know how that worked out. When rewards are unrelated to effort, being a slacker is more appealing than being a worker. With more slackers than workers, not nearly enough is produced to satisfy everyone’s needs. A common joke in the Soviet Union was, “They pretend to pay us, and we pretend to work.”

In addition to helping those who in the great lottery of life have drawn blanks, governments should adopt myriad policies that expand the economic pie, including education, infrastructure, and the enforcement of laws and contracts. Public safety, national defense, dealing with externalities are also important. There are many legitimate government activities and there are inevitably tradeoffs. Governing a country is completely different from playing a simple, rigged distribution game.

I love computers. I use them every day — not just for word processing but for mathematical calculations, statistical analyses, and Monte Carlo simulations that would literally take me several lifetimes to do by hand. Computers have benefited and entertained all of us. However, AI is nowhere near ready to rule the world because computer algorithms do not have the intelligence, wisdom, or commonsense required to make rational decisions.

An innovative new collaboration between EPFL’s HexHive Laboratory and Oracle has developed automated, far-reaching technology in the ongoing battle between IT security managers and attackers, hoping to find bugs before the hackers do.

On the 9th of December 2021 the world of IT went into a state of shock. Before its developers even knew it, the log4j application—part of the Apache suite used on most web servers—was being exploited by hackers, allowing them to take control of servers and all over the world.

The Wall Street Journal reported news that nobody wanted to hear: “U.S. officials say hundreds of millions of devices are at risk. Hackers could use the bug to steal data, install malware or take control.”

It’s not that the particular labelers didn’t do a good job, it’s that they were given an impossible task.

There are no shortcuts to gleaning insight into human communications. We’re not stupid like machines are. We can incorporate our entire environment and lived history into the context of our communications and, through the tamest expression of our masterful grasp on semantic manipulation, turn nonsense into philosophy (shit happens) or turn a truly mundane statement into the punchline of an ageless joke (to get to the other side).

What these Google researchers have done is spent who knows how much time and money developing a crappy digital version of a Magic 8-Ball. Sometimes it’s right, sometimes it’s wrong, and there’s no way to be sure one way or another.

Judges must now consult the AI on every case by law, Beijing’s Supreme Court said in an update on the system published this week, and if they go against its recommendation they must submit a written explanation for why.

The AI has also been connected to police databases and China’s Orwellian social credit system, handing it the power to punish people — for example by automatically putting a thief’s property up for sale online.

Beijing has hailed the new technology for making ‘a significant contribution to the judicial advancement of human civilisation’ — while critics say it risks creating a world in which man is ruled by machine.

A DeepMind research group conducted a comprehensive generalization study on neural network architectures in the paper ‘Neural Networks and the Chomsky Hierarchy’, which investigates whether insights from the theory of computation and the Chomsky hierarchy can predict the actual limitations of neural network generalization.

While we understand that developing powerful machine learning models requires an accurate generalization to out-of-distribution inputs. However, how and why neural networks can generalize on algorithmic sequence prediction tasks is unclear.

The research group performed a thorough generalization study on more than 2000 individual models spread across 16 tasks of cutting-edge neural network architectures and memory-augmented neural networks on a battery of sequence-prediction tasks encompassing all tiers of the Chomsky hierarchy that can be evaluated practically with finite-time computation.

We’re proud to be a platinum sponsor of ICML, the annual conference on machine learning. Learn about Amazon’s presence at the conference, accepted publications,… See more.


The International Conference on Machine Learning (ICML) is the premier gathering of professionals dedicated to the advancement of the branch of artificial intelligence known as machine learning. The conference is globally renowned for presenting and publishing cutting-edge research on all aspects of machine learning used in closely related areas like artificial intelligence, statistics and data science, as well as important application areas such as machine vision, computational biology, speech recognition, and robotics.

Early detection and identification of pathogenic bacteria in food and water samples are essential to public health. Bacterial infections cause millions of deaths worldwide and bring a heavy economic burden, costing more than 4 billion dollars annually in the United States alone. Among pathogenic bacteria, Escherichia coli (E. coli) and other coliform bacteria are among the most common ones, and they indicate fecal contamination in food and water samples. The most conventional and frequently used method for detecting these bacteria involves culturing of the samples, which usually takes 24 hours for the final read-out and needs expert visual examination. Although some methods based on, for example, the amplification of nucleic acids, can reduce the detection time to a few hours, they cannot differentiate live and dead bacteria and present low sensitivity at low concentrations of bacteria. That is why the U.S. Environmental Protection Agency (EPA) approves no nucleic acid-based bacteria sensing method for screening water samples.

In an article recently published in ACS Photonics, a journal of the American Chemical Society (ACS), a team of scientists, led by Professor Aydogan Ozcan from the Electrical and Computer Engineering Department at the University of California, Los Angeles (UCLA), and co-workers have developed an AI-powered smart bacterial colony detection system using a thin-film transistor (TFT) array, which is a widely used technology in mobile phones and other displays.

The ultra-large imaging area of the TFT array (27 mm × 26 mm) manufactured by researchers at Japan Display Inc. enabled the system to rapidly capture the growth patterns of bacterial colonies without the need for scanning, which significantly simplified both the hardware and software design. This system achieved ~12-hour time savings compared to gold-standard culture-based methods approved by EPA. By analyzing the microscopic images captured by the TFT array as a function of time, the AI-based system could rapidly and automatically detect colony growth with a deep neural network. Following the detection of each colony, a second neural network is used to classify the species.

Using reinforcement learning (RL) to train robots directly in real-world environments has been considered impractical due to the huge amount of trial and error operations typically required before the agent finally gets it right. The use of deep RL in simulated environments has thus become the go-to alternative, but this approach is far from ideal, as it requires designing simulated tasks and collecting expert demonstrations. Moreover, simulations can fail to capture the complexities of real-world environments, are prone to inaccuracies, and the resulting robot behaviours will not adapt to real-world environmental changes.

The Dreamer algorithm proposed by Hafner et al. at ICLR 2020 introduced an RL agent capable of solving long-horizon tasks purely via latent imagination. Although Dreamer has demonstrated its potential for learning from small amounts of interaction in the compact state space of a learned world model, learning accurate real-world models remains challenging, and it was unknown whether Dreamer could enable faster learning on physical robots.

In the new paper DayDreamer: World Models for Physical Robot Learning, Hafner and a research team from the University of California, Berkeley leverage recent advances in the Dreamer world model to enable online RL for robot training without simulators or demonstrations. The novel approach achieves promising results and establishes a strong baseline for efficient real-world robot training.

Research in the field of machine learning and AI, now a key technology in practically every industry and company, is far too voluminous for anyone to read it all. This column, Perceptron, aims to collect some of the most relevant recent discoveries and papers — particularly in, but not limited to, artificial intelligence — and explain why they matter.

In this batch of recent research, Meta open-sourced a language system that it claims is the first capable of translating 200 different languages with “state-of-the-art” results. Not to be outdone, Google detailed a machine learning model, Minerva, that can solve quantitative reasoning problems including mathematical and scientific questions. And Microsoft released a language model, Godel, for generating “realistic” conversations that’s along the lines of Google’s widely publicized Lamda. And then we have some new text-to-image generators with a twist.

Meta’s new model, NLLB-200, is a part of the company’s No Language Left Behind initiative to develop machine-powered translation capabilities for most of the world’s languages. Trained to understand languages such as Kamba (spoken by the Bantu ethnic group) and Lao (the official language of Laos), as well as over 540 African languages not supported well or at all by previous translation systems, NLLB-200 will be used to translate languages on the Facebook News Feed and Instagram in addition to the Wikimedia Foundation’s Content Translation Tool, Meta recently announced.