alphaXiv

History

Papers Benchmarks

Xinterra

28 Jun 2022

materials-science physics computational-physics

What Information is Necessary and Sufficient to Predict Materials Properties using Machine Learning?

Imperial College London

National University of Singapore Xinterra Solar Energy Research Institute of Singapore Singapore–MIT Alliance for Research and Technology

Conventional wisdom of materials modelling stipulates that both chemical composition and crystal structure are integral in the prediction of physical properties. However, recent developments challenge this by reporting accurate property-prediction machine learning (ML) frameworks using composition alone without knowledge of the local atomic environments or long-range order. To probe this behavior, we conduct a systematic comparison of supervised ML models built on composition only vs. composition plus structure features. Similar performance for property prediction is found using both models for compounds close to the thermodynamic convex hull. We hypothesize that composition embeds structural information of ground-state structures in support of composition-centric models for property prediction and inverse design of stable compounds.

03 Feb 2023

computer-science materials-science machine-learning

Fast Bayesian Optimization of Needle-in-a-Haystack Problems using Zooming Memory-Based Initialization (ZoMBI)

National University of Singapore Xinterra Singapore–MIT Alliance for Research and Technology

Needle-in-a-Haystack problems exist across a wide range of applications including rare disease prediction, ecological resource management, fraud detection, and material property optimization. A Needle-in-a-Haystack problem arises when there is an extreme imbalance of optimum conditions relative to the size of the dataset. For example, only

0.82\%

out of

146

k total materials in the open-access Materials Project database have a negative Poisson's ratio. However, current state-of-the-art optimization algorithms are not designed with the capabilities to find solutions to these challenging multidimensional Needle-in-a-Haystack problems, resulting in slow convergence to a global optimum or pigeonholing into a local minimum. In this paper, we present a Zooming Memory-Based Initialization algorithm, entitled ZoMBI. ZoMBI actively extracts knowledge from the previously best-performing evaluated experiments to iteratively zoom in the sampling search bounds towards the global optimum "needle" and then prunes the memory of low-performing historical experiments to accelerate compute times by reducing the algorithm time complexity from

O(n^3)

O(\phi^3)

for

\phi

forward experiments per activation, which trends to a constant

O(1)

over several activations. Additionally, ZoMBI implements two custom adaptive acquisition functions to further guide the sampling of new experiments toward the global optimum. We validate the algorithm's optimization performance on three real-world datasets exhibiting Needle-in-a-Haystack and further stress-test the algorithm's performance on an additional 174 analytical datasets. The ZoMBI algorithm demonstrates compute time speed-ups of 400x compared to traditional Bayesian optimization as well as efficiently discovering optima in under 100 experiments that are up to 3x more highly optimized than those discovered by similar methods MiP-EGO, TuRBO, and HEBO.

There are no more papers matching your filters at the moment.

Events

Personalize Your Feed

Install Browser Extension

We're hiring

alphaXiv

Explore

State of the Art

Sign In

Labs

Feedback

Dark mode

What Information is Necessary and Sufficient to Predict Materials Properties using Machine Learning?

Fast Bayesian Optimization of Needle-in-a-Haystack Problems using Zooming Memory-Based Initialization (ZoMBI)

Events

AI for Law

Personalize Your Feed