Applying machine learning to the exploration of physicochemical systems
Last update: January 2019
How can AI help make discoveries on the bench in a chemistry lab? How can chemical systems help devise better AI algorithms? I explored those questions with the team I lead in theĀ Cronin Group. The group cutting edge knowledge combined with my experience from the developmental sciences allowed us to bring innovative ideas to the field, including the use of active learning, curiosity-driven exploration, and human comparative studies.
Compilation of droplet behaviors – one of the system we studied.
Compilation of robotic platforms designed by the team.
Content
Approach
I grew frustrated that most AI research was done on simulated environments. I wanted to see this technology used in the real world on systems we do not ourselves design nor understand – that is not even on robots.
The physical sciences seem fitting. In physics, chemistry, or biology, an important part of everyday research is to understand yet unknown systems. Scientists devise experiments and build instruments to query these systems with the aim to be able to model, and later predict, their evolution.
But because such systems are not yet fully understood, or are massively parallel, they cannot be simulated. There is no cheating with the physical sciences, you cannot speed up experiments, you cannot be 100% sure that you are controlling the right experimental variables, and you do not know the right answer. That is all the fun of it.
This is in sharp contrast with the simulation, big data, and controlled robotic problems AI algorithms are designed on and for.Ā [1]Ā If we ought to bring AI advances to experimental research in the labs, we need to drop the habit of thinking that experiments are free, unlimited, and well defined. That habit comes from the extraordinary power of computers and can sometime turn AI research into a parameter tuning and stacking game.
Interestingly, a subfield of robotics, calledĀ developmental robotics, imposes itself similar constraints. Their aim is to build a robot that can learn like a child. A child has real world constraints, it has limited time to interact with the world and he has no complete built-in knowledge of the world around him. From his own body, to modern objects like smartphones and bikes, a child has to learn it all in a few years by interacting with the world. To do so a child constantly devises his own experiments to test the world around him, much like a scientist in its lab.
As it turns out, I pursued my PhD under the supervision ofĀ Pierre-Yves Oudeyer, one of the pioneers of this field. Once I connected the dots, an ocean of research directions opened to bring principles from developmental robotics into the physical sciences. How can a laboratory robot be curious and devise its own questions about a new systems? How could it learn a hierarchy of skills to produce increasingly more complex molecules autonomously? How could such machines learn from and collaborate with scientists? Could maturational constraints and the concept of embodiment help leverage the exploration power of algorithms on natural systems?
Hence came the decision to join a chemistry lab for a postdoc.
Team
Lee Cronin, a pioneer in the digitization of chemistry, welcomed me intoĀ his labĀ and gave me the responsibility of an internal team to explore these questions. The group had already published a work showcasing the robotically assisted optimization of oil-in-water droplets, a suitable system for our experimentation.
Projects
Curiosity driven exploration of oil-in-water droplets
We developed a chemical robotic discovery assistant equipped with a curiosity algorithm (CA) that was set to explore the states a complex chemical system can exhibit searching for novelty. This differ from optimization scenario where a global target is defined beforehand.TheĀ Dropfactory robotĀ is able to perform, record, and clean 8 droplet experiments in parallel.
By applying the CA-robot to the study of self-propelling multicomponent oil-in-water droplets, we are able to observe an order of magnitude more variety of droplet behaviours than possible with a random parameter search and given the same budget. We demonstrate that the CA-robot enabled the discovery of a sudden and highly specific response of droplets to slight temperature changes.
Once the temperature dependance was discovered, six modes of self-propelled droplets motion were identified and classified using a time-temperature phase diagram and probed using a variety of techniques including NMR, which allowed the design of a payload release system triggered by temperature. This work illustrates how target free search can significantly increase the rate of unpredictable observations leading to new discoveries with potential applications in formulation chemistry.
Effect of temperature on droplet motion.
Resources
- A curious formulation robot enables the discovery of a novel protocell behavior.Ā Grizou, J., Points, L. J., Sharma, A. & Cronin, L. (2020).Ā Science Advances.Ā [pdf]Ā [doi]Ā [robot design]Ā [experiment code]Ā [analysis code]Ā [project]
Active selection of crystallization experiments and comparison with human methodology
The discovery of new gigantic molecules formed by self-assembly and crystal growth is challenging as it combines two contingent events; first is the formation of a new molecule, and second its crystallization. We constructed an automated workflow that can probe both events and employed it for a new polyoxometalate cluster which has a trigonal-ring type architecture.The synthesis and crystallization was probed using an active machine-learning algorithm to explore the crystallization space. The algorithm results were compared with those obtained by human experimenters and a random sampling approach.
A critical constraint was that we could only perform 10 experiments per day. We ran the active learning by batches of 10 experiments and performed 10 iterations for a total of 100 experiments for each method and repeat. Meaning that 10 days and significant reagent were needed just to validate the approach the first time. Our result showed that the active learning algorithmic search increases the crystallization prediction accuracy to 82.4Ā±0.7% over 77.1Ā±0.9% from human experimenters given the same experimental budget.
Resources
- Intuition-enabled Machine Learning beats the Competition when Joint Human-Robot Teams per-form Inorganic Chemical Experiments.Ā Duros, V., Grizou, J., Sharma, A., Mehr, S.H.M., Bubliauskas, A., Frei, P., Miras, H.N. and Cronin, L. (2019).Ā Journal of chemical information and modeling.Ā [pdf]Ā [journal]Ā [project]
- Human vs Robots in the Discovery and Crystallization of Gigantic Polyoxometalates.Ā Grizou, J., Duros, V., Xuan, W., Hosni, Z., Long, D.-L., Miras H., Cronin L. (2017).Ā Angewandte Chemie 129.36: 10955-10960.Ā [pdf]Ā [SI]Ā [code]Ā [journal]
Hierarchical exploration on gold nanoparticles
We developed a liquid-handling robot built with the aim of evolving the size and shape of gold nanoparticles (AuNPs) as a function of composition via an automated evolutionary process. The robot synthesises nanoparticles by mixing reagents in different ratios, placing them in vials and extraction to in-line UV-Vis analysis that is used to compute a fitness value.The project aimed at exploring how we could reuse the result of genetic algorithm experiments (e.g. evolution of nano-spheres) to evolve more complex nanoparticles such as nano-rods. The diagram below illustrate this process.
After several cycles and the application of a genetic algorithm (GA), the fitness factor at each stage increased every generation leading towards the desired outcome. Our system could synthesise recursively gold nanospheres, gold nanorods (AuNRs), and arrow-headed gold nanorods with high reproducibility.
Resources
- A Nanomaterials Discovery Robot for the Darwinian Evolution of Shape Programmable Gold Nanoparticles.Ā Keenan, G., Salley, D., MartĆn, S., Grizou, J., Sharma, A. and Cronin, L. (2020).Ā Nature Communications.Ā [pdf]Ā [doi]Ā [project]
Physical environment as an experiment variable
Evolution via natural selection is governed by the persistence and propagation of living things in an environment. The environment is important since it contributes in shaping evolution. Although evolution has been widely studied in a variety of fields from biology to computer science, still little is known about the impact of environmental changes on an artificial chemical evolving system outside of computer simulations. We developed a fully automated 3D-printed chemorobotic fluidic system that is able to generate and select droplet protocells in real time while changing the surroundings where they undergo artificial evolution.Compared to our previous work, the use of a 3D-printed device enable us to change the physical environment in which the droplet population evolves. The 3D printed platform also innovates as no moving parts are required to initiate, run, and clean droplet experiments.
Trailer of the platform in action.
Using such 3D-printed devices, we were able to change the physical environment in which the droplet population evolves. We thus explicitly studied how the genotype is modulated through a programmable environment to express its phenotype, in contrast to the more studied genotype to phenotype direct approach.
Droplet recipes evolving through a genetic algorithm in different arenas.
By successively evolving the droplets from one environment to another, we were able to observe disruption in their evolutionary trajectories, as well as adaptation of their genome.
Resources
- Adaptive artificial evolution of droplet protocells in a 3D-printed fluidic chemorobotic platform with configurable environments.Ā Parrilla-Gutierrez, J. M., Tsuda, S., Grizou, J., Taylor, J., Henson, A., & Cronin, L. (2017).Ā Nature communications, 8(1), 1144.Ā [pdf]Ā [journal]
AI as a tool to understand complex systems
Exploring and understanding the emergence of complex behaviors is difficult even in āsimpleā chemical systems since the dynamics can rest on a knife edge between stability and instability. We study the dynamics of a oil droplets in an aqueous environment using an automated platform equipped with artificial intelligence. Compared to previous work, we increased the number of experimental parameters and allowed the robot to choose both the concentration of oils in the droplets and the concentration of surfactants in the aqueous phase. Our aim was to identify more droplet behaviors and understand better the mechanisms behind them.Operation of the platform.
To test the platform, we started by running evolutionary experiments and comparing them to our previous work.
Comparison of speed achieved with the oil-only, aqueous-only, and aqueous-oil optimizations.
By opening the parameters space to the chemical environment of the droplets, we were able to generate faster moving droplets. The recipes and data generated from this process were then used for physicochemical analysis, where traditional chemical analysis, machine learning, and archetypal droplet experiments are used to study the behavioral mechanisms and to predict droplet behaviors.
The data acquired were used to build predictive models of the system, of which no physical model, or only inaccurate models, were available.
Physical properties such as viscosity, surface tension, and density are shown to be related to behaviors, as well as to droplet behavioral niches, such as collective swarming. By dying droplets with phenolphthalein, we could observe the internal dynamics within the droplets and depending of their composition. For example, pentanol goes very pink, has rapid flows, and dissolves, while DEP only goes pink at the interface.
Dye experiments showing the complex flow of chemicals inside the droplets.
Resources
- Artificial intelligence exploration of unstable protocells leads to predictable properties and discovery of collective behaviorAdaptive artificial evolution of droplet protocells in a 3D-printed fluidic chemorobotic platform with configurable environments. Laurie J. Points, James Ward Taylor, Jonathan Grizou, Kevin Donkers, and Leroy Cronin (2018). PNAS, 115 (5) 885-890.Ā [pdf]Ā [journal]
Black box optimization of oil-in-water droplets
This work focus on the optimization of dynamic properties of droplets on a robotic platform. It was completed before I joined the Cronin group, and made me realize it became possible to apply a range of new exploration methods freshly developed in computer science research groups directly on physical systems in the lab.
They presented a liquid-handling robot built with the aim of investigating the properties of oil droplets as a function of composition via an automated evolutionary process. The robot makes the droplets by mixing four different compounds in different ratios and placing them in a Petri dish after which they are recorded using a camera and the behaviour of the droplets analysed using image recognition software to give a fitness value.
The first droplet robot made in the Cronin group.
In separate experiments, the fitness function discriminates based on movement, division and vibration over 21 cycles, giving successive fitness increases. Analysis and theoretical modelling of the data yields fitness landscapes analogous to the genotypeāphenotype correlations found in biological evolution.
Resources
Evolution of oil droplets in a chemorobotic platform.Ā Gutierrez, J. M. P., Hinkley, T., Taylor, J. W., Yanev, K., & Cronin, L. (2014).Ā Nature communications, 5, 5571.Ā [pdf]Ā [journal]
Resources
Workshop
In 2017, we organized aĀ workshopĀ providing an extensive overview of this research.Ā [pdf]Ā [slides]Ā [website]
Code and tools
We have developed a collection of software and hardware tools that has been made open source to help further research in this area. These tools are described on theĀ open robotics project pageĀ and available on theĀ Cronin group GitHub account.
Publications
- A curious formulation robot enables the discovery of a novel protocell behavior.Ā Grizou, J., Points, L. J., Sharma, A. & Cronin, L. (2020).Ā Science Advances.Ā [pdf]Ā [doi]Ā [robot design]Ā [experiment code]Ā [analysis code]Ā [project]
- A Nanomaterials Discovery Robot for the Darwinian Evolution of Shape Programmable Gold Nanoparticles.Ā Keenan, G., Salley, D., MartĆn, S., Grizou, J., Sharma, A. and Cronin, L. (2020).Ā Nature Communications.Ā [pdf]Ā [doi]Ā [project]
- Intuition-enabled Machine Learning beats the Competition when Joint Human-Robot Teams per-form Inorganic Chemical Experiments.Ā Duros, V., Grizou, J., Sharma, A., Mehr, S.H.M., Bubliauskas, A., Frei, P., Miras, H.N. and Cronin, L. (2019).Ā Journal of chemical information and modeling.Ā [pdf]Ā [journal]Ā [project]
- Artificial intelligence exploration of unstable protocells leads to predictable properties and discovery of collective behavior.Ā Points, L. J., Taylor, J. W., Grizou, J., Donkers, K., & Cronin, L. (2018).Ā PNAS – Proceedings of the National Academy of Sciences, 201711089.Ā [pdf]Ā [journal]
- Adaptive artificial evolution of droplet protocells in a 3D-printed fluidic chemorobotic platform with configurable environments.Ā Parrilla-Gutierrez, J. M., Tsuda, S., Grizou, J., Taylor, J., Henson, A., & Cronin, L. (2017).Ā Nature communications, 8(1), 1144.Ā [pdf]Ā [journal]
- The evolution of active droplets in chemorobotic platforms.Ā Points, L. J., Grizou, J., Gutierrez, J. M. P., Taylor, J. W., & Cronin, L. (2017).Ā Artificial Life Conference Proceedings 14.Ā [pdf]
- Human vs Robots in the Discovery and Crystallization of Gigantic Polyoxometalates.Ā Grizou, J., Duros, V., Xuan, W., Hosni, Z., Long, D.-L., Miras H., Cronin L. (2017).Ā Angewandte Chemie 129.36: 10955-10960.Ā [pdf]Ā [SI]Ā [code]Ā [journal]
- Developmental robotics in a chemistry lab.Ā Grizou, J., Points L.J., Cronin, L. (2017).Ā Development and Learning and Epigenetic Robotics (ICDL-Epirob), 2017 Joint IEEE International Conferences on.Ā [pdf]Ā [slides]Ā [website]
- Evolution of oil droplets in a chemorobotic platform.Ā Gutierrez, J. M. P., Hinkley, T., Taylor, J. W., Yanev, K., & Cronin, L. (2014).Ā Nature communications, 5, 5571.Ā [pdf]Ā [journal]
Personal Notes
The above work – done between 2015 and 2018 – is the application of ideas from a small subfield of machine learning and robotics, calledĀ developmental roboticsĀ (see myĀ PhD lab), to an emerging field at the intersection of robotics, AI, and chemistry.
I believe the projects we showcased provide a good window into future applications of AI and robotics in formulation sciences. More specifically in R&D departments to help find new formulations for consumer goods and medical treatments, under the ever evolving constraints from law enforcement (forbidden compounds, environment preservation), resource exhaustion, and budget optimization. The research presented above can significantly reduce the time and resources needed for the development of new products. Especially useful will be machines that can explore rather than optimize and collaborate with human rather than replace them.
I also believe that constraints from working on physical systems in the labs can be beneficial to the field of AI. When the only solution to querying data is to perform time consuming and costly experiments. It forces AI practitioners to redefine their working methodology and might change the focus of what matters when designing new algorithms.
I am eager to see this research applied to outside the lab. Do not hesitate to get in touch to share ideas or discuss potential projects.
Footnotes
[1]Ā There are good reasons for that, convenience and speed for fundamental research is one, but also because a lot of problems worth solving can be captured adequately by data and simulation. My point focus on hands-on experiments in laboratories.
Ā
Ā