Opt¶
In this chapter, we will learn about structural optimization (also called structural relaxation, structure optimization or structure relaxation), which calculates the local stability coordinates of a system and its energy.
There are two types of optimization algorithms: global optimization and local optimization. We focus our attention on local optimization methods for structural optimization in this chapter.
The coordinates of a system consisting of N atoms have \(3*N\) dimensions of freedom, for example, a system consisting of 100 atoms is an optimization problem in a high-dimensional space of 300 dimensions. Finding a globally optimal solution is generally a difficult problem.
Structural optimization finds the system’s stable structure by determining the system’s initial coordinates and finding local stable points from there. Even when the composition of a substance (combination of elements and their proportions, e.g., NaCl) is known, it is not apparent what its structure is. Determining the stable structure is essential because many properties of a substance are determined from its state near its stable structure.
The following analytical and measuring instruments may be used to determine the structure experimentally.
However, even in that case
Only partial information on the material can be obtained from the spectral information, and information on the 3D structure cannot be directly observed (1~6)
It is necessary to prepare structures such as single crystals or thin films, and it is sometimes difficult to prepare such samples in reality (3, 8)
Lack of resolution for atomic scale observation (7)
Although the shape of the surface can be observed, it is not possible to distinguish the type of atoms and molecules (9)
It is not easy to determine how each atom is arranged in the material we are dealing with and to identify the three-dimensional structure of the material.
Examples of analytical instruments:
Name |
Description |
---|---|
|
Obtain information on molecular structure by measuring infrared absorption due to molecular vibrations. |
|
Raman scattering light is used to evaluate materials. |
|
Identification and quantification of substances by analyzing diffraction obtained by a beam of incident X-rays to crystal structures. |
|
X-rays are irradiated to a material, and the photoelectrons e- emitted as a result of ionization of the material are measured for energy analysis. |
|
Accurately measures the mass of individual compounds that make up a substance. |
|
Analyze organic compounds using the phenomenon of nuclear resonance. |
|
Observation of surfaces with a resolution of 0.4~4nm using electron beams with short wavelengths. |
|
Observation of microscopic substances by the intensity of the electron beam transmitted through the object. |
|
Microscope to detect the atomic forces acting between the tip and the sample. |
Therefore, it is vital to estimate which structure is likely to be in the simulation and verify it in the experiment (or vice versa).
Opt molecules¶
First, let us try to optimize the structure of a single organic molecule without periodic boundary conditions as a simple example.
1. Hydrogen molecule¶
First, we will use structural optimization to find the bond length for the most stable structure of the hydrogen molecule, which we did in the previous chapter.
The flow will be as follows
Step 1: Prepare a calculator to calculate potential energy.
Step 2: Prepare molecules for structural optimization.
Step 3: Select a method for the structural optimization algorithm and run it.
[1]:
import pfp_api_client
from pfp_api_client.pfp.calculators.ase_calculator import ASECalculator
from pfp_api_client.pfp.estimator import Estimator, EstimatorCalcMode
print(f"pfp_api_client: {pfp_api_client.__version__}")
estimator = Estimator(calc_mode=EstimatorCalcMode.MOLECULE, model_version="v3.0.0")
calculator = ASECalculator(estimator)
pfp_api_client: 1.5.0
The calc_mode
of PFP Estimator
is used as follows.
General system:
EstimatorCalcMode.CRYSTAL_U0
, mode without U parameter.Oxides, etc.:
EstimatorCalcMode.CRYSTAL
, mode with U parameter.Single organic molecules:
EstimatorCalcMode.MOLECULE
, molecular mode.
Step 2: Prepare molecules for structural optimization
Here we create a hydrogen molecule. We will manually set the initial coordinates at a distance of 1.0A.
[2]:
from ase import Atoms
atoms = Atoms("H2", [[0, 0, 0], [0, 0, 1.0]])
atoms.calc = calculator
Step 3: Select and run the structural optimization algorithm method
As local optimization algorithms, ASE offers the FIRE
and MDMin
methods which perform optimization with gradient descent-like behavior, and the BFGS
, LBFGS
, BFGSLineSearch
=QuasiNewton
, LBFGSLineSearch
methods which perform optimization by considering second derivertive are provided. The differences and properties of these algorithms are explained in the later section.
This time, we will use LBFGS
to perform structural optimization. We will set atoms
to the local optimization algorithm and call the run
method to perform structural optimization. If you specify the trajectory
argument, you can save the structural optimization process.
[3]:
import os
from ase.optimize import LBFGS, BFGS, FIRE
os.makedirs("output", exist_ok=True)
opt = LBFGS(atoms, trajectory="output/bfgs_h2.traj")
opt.run(fmax=0.01)
Step Time Energy fmax
*Force-consistent energies used in optimization.
LBFGS: 0 02:19:43 -3.890512* 4.4993
LBFGS: 1 02:19:43 -4.402221* 3.2330
LBFGS: 2 02:19:43 -3.401438* 16.5629
LBFGS: 3 02:19:43 -4.551079* 2.2355
LBFGS: 4 02:19:43 -4.610468* 1.3879
LBFGS: 5 02:19:43 -4.641043* 0.3552
LBFGS: 6 02:19:43 -4.642630* 0.0576
LBFGS: 7 02:19:43 -4.642674* 0.0023
[3]:
True
In this simple example, the structural optimization was completed in 7 steps, and it took about 2 seconds in Matlantis.
Let’s visualize the process of structural optimization.
[4]:
from ase.visualize import view
from ase.io import Trajectory
from pfcc_extras.visualize.view import view_ngl
traj = Trajectory("output/bfgs_h2.traj")
view_ngl(traj, representations=["ball+stick"], w=400, h=300)
Plot how the atomic distances have changed during structural optimization.
[5]:
import numpy as np
# np.linalg.norm?
distance_list = []
E_pot_list = []
for atoms in traj:
p1, p2 = atoms.get_positions()
distance = np.linalg.norm(p1 - p2)
distance_list.append(distance)
E_pot_list.append(atoms.get_potential_energy())
print("Distance trajectory", distance_list)
Distance trajectory [1.0, 0.8714471953255789, 0.5432643813289825, 0.8178499966054394, 0.7851965174194478, 0.7317306649684048, 0.7426249533569255, 0.7411042704555475]
[6]:
import matplotlib.pyplot as plt
plt.plot(distance_list)
plt.xlabel("step")
plt.ylabel("Distance [A]")
plt.title("H2 opt trajectory")
plt.show()
Finally, it settled around 0.74
A.
In the previous chapter, we calculated the energies for all bond distances and found the lowest energy value, but we can efficiently find a stable structure by using structural optimization.
2. Cyclohexane¶
As a next example, consider the stable structure of cyclohexane.
Cyclohexane has the structure C6H12, but is known to take several conformations. Typical conformational isomers include chair (A), twist-boat (B), boat (C), and half-chair (D), as shown in the figure below.
The chair conformation is the most stable (lowest energy); at 25°C, 99.99% of all molecules in cyclohexane solution are in this conformation. In this section, we will prepare the initial structures of the chair shape (A) and the boat shape (C), perform structural optimization, and compare their shapes and potential energies.
Cite from cyclohexane conformation
Prepare initial coordinate structure
The chair shape is prepared from the structure published in PubChem.
3D structure image of CID 8078 Cyclohexane (Compound)
PubChem Identifier: CID 8078 URL: https://pubchem.ncbi.nlm.nih.gov/compound/8078#section=3D-Conformer
[7]:
from ase.io import read
chair_atoms = read("../input/Conformer3D_CID_8078.sdf")
view_ngl(chair_atoms, representations=["ball+stick"], w=400, h=300)
In this tutorial, let’s generate the boat shape by manually changing the coordinate values from the chair shape.
You can skip understanding the code even if you don’t know what it does. (We create atoms rotated 180 degrees, and only atoms 2, 10, and 11 are overwritten with the rotated coordinates.)
[8]:
boat_atoms = chair_atoms.copy()
axis = boat_atoms.positions[[1, 5]].mean(axis=0) - boat_atoms.positions[[0, 4]].mean(axis=0)
boat_atoms.rotate(180, axis)
# inds = [1, 3, 5, 8, 9, 12, 13, 16, 17]
inds = [2, 10, 11]
boat_atoms.positions[inds] = chair_atoms.positions[inds]
[9]:
view_ngl(boat_atoms, representations=["ball+stick"], w=400, h=300)
[10]:
chair_atoms.calc = calculator
opt = LBFGS(chair_atoms, trajectory="output/lbfgs_cyclohexane_chair.traj")
opt.run()
Step Time Energy fmax
*Force-consistent energies used in optimization.
LBFGS: 0 02:20:03 -76.855956* 0.1778
LBFGS: 1 02:20:03 -76.858225* 0.1151
LBFGS: 2 02:20:03 -76.860049* 0.0998
LBFGS: 3 02:20:03 -76.862485* 0.0499
[10]:
True
[11]:
boat_atoms.calc = calculator
opt = LBFGS(boat_atoms, trajectory="output/lbfgs_cyclohexane_boat.traj")
opt.run()
Step Time Energy fmax
*Force-consistent energies used in optimization.
LBFGS: 0 02:20:14 -71.025783* 5.3644
LBFGS: 1 02:20:14 -72.330446* 4.4228
LBFGS: 2 02:20:14 -74.196692* 3.2822
LBFGS: 3 02:20:14 -75.112910* 4.2950
LBFGS: 4 02:20:14 -75.580790* 2.7280
LBFGS: 5 02:20:14 -75.743022* 2.4649
LBFGS: 6 02:20:15 -75.854447* 1.8966
LBFGS: 7 02:20:15 -76.089088* 0.7839
LBFGS: 8 02:20:15 -76.161671* 0.6727
LBFGS: 9 02:20:15 -76.249390* 0.8271
LBFGS: 10 02:20:15 -76.300558* 0.7451
LBFGS: 11 02:20:15 -76.366199* 0.6365
LBFGS: 12 02:20:15 -76.410962* 0.5463
LBFGS: 13 02:20:15 -76.432972* 0.2407
LBFGS: 14 02:20:15 -76.442167* 0.2395
LBFGS: 15 02:20:15 -76.450881* 0.2134
LBFGS: 16 02:20:15 -76.458265* 0.1773
LBFGS: 17 02:20:15 -76.462880* 0.1930
LBFGS: 18 02:20:15 -76.466240* 0.1989
LBFGS: 19 02:20:15 -76.470400* 0.2080
LBFGS: 20 02:20:15 -76.476647* 0.2489
LBFGS: 21 02:20:15 -76.483845* 0.2404
LBFGS: 22 02:20:15 -76.489993* 0.1807
LBFGS: 23 02:20:15 -76.494768* 0.2158
LBFGS: 24 02:20:15 -76.499497* 0.2090
LBFGS: 25 02:20:16 -76.504614* 0.1982
LBFGS: 26 02:20:16 -76.509015* 0.2078
LBFGS: 27 02:20:16 -76.512713* 0.2074
LBFGS: 28 02:20:16 -76.517285* 0.1909
LBFGS: 29 02:20:16 -76.523674* 0.2553
LBFGS: 30 02:20:16 -76.530986* 0.2316
LBFGS: 31 02:20:16 -76.537306* 0.2286
LBFGS: 32 02:20:16 -76.542625* 0.2381
LBFGS: 33 02:20:16 -76.548161* 0.2532
LBFGS: 34 02:20:16 -76.554129* 0.2480
LBFGS: 35 02:20:16 -76.558687* 0.1736
LBFGS: 36 02:20:16 -76.560842* 0.0990
LBFGS: 37 02:20:16 -76.561720* 0.0792
LBFGS: 38 02:20:16 -76.562696* 0.0877
LBFGS: 39 02:20:16 -76.563681* 0.0767
LBFGS: 40 02:20:16 -76.564258* 0.0412
[11]:
True
Visualize and check the structure after structural optimization.
[12]:
view_ngl([chair_atoms, boat_atoms], representations=["ball+stick"], w=400, h=300, replace_structure=True)
Also, compare the energy difference between them.
The unit of energy is eV, but you can use the units
module to convert units.
[13]:
from ase.units import Bohr,Rydberg,kJ,kB,fs,Hartree,mol,kcal
E_chair = chair_atoms.get_potential_energy()
E_boat = boat_atoms.get_potential_energy()
E_diff = E_boat - E_chair
print(f"E_chair {E_chair:.2f} eV")
print(f"E_boat {E_boat:.2f} eV")
print(f"E_diff {E_diff:.2f} eV = {E_diff / (kJ/mol):.2f} kJ/mol")
E_chair -76.86 eV
E_boat -76.56 eV
E_diff 0.30 eV = 28.77 kJ/mol
The chair shape was found to be lower energy and more stable than the boat shape. The energy difference was also close to the 25 kJ/mol value shown in the figure above.
Thus, in structural optimization, the initial structure is important to find the local stability point. Even for the same cyclohexane, the results after structural optimization will differ depending on how the initial structure is prepared.
Opt Crystals¶
Next, let us try to perform structural optimization for systems with periodic structures. In the above example of an organic molecule without periodic structure, only the coordinates of each atom were subject to be optimized, but in a periodic system, the cell size is also subject to be optimized.
In addition, for a crystal system with periodic structure (bulk), the energy depends on the crystal structure (e.g., BCC, FCC).
1. Si crystal¶
Let us investigate what kind of structure is most stable as the crystal structure of Si. As candidates for crystal structures, we try the following structures.
sc: Simple Cubic
bcc: Body Centered Cubic
fcc: Face Centered Cubic
hcp: Hexagonal Close Packed
diamond: Diamond structure
We calculate the energy per atom.
First, the following code calculates the potential energy and density for various values of the lattice constant a
for the Si BCC structure.
Here, the np.linspace
method is used to generate five lattice constants a
between 2.5A and 7.0A, and the energy at each lattice constant is calculated.
[14]:
estimator = Estimator(calc_mode=EstimatorCalcMode.CRYSTAL_U0, model_version="v3.0.0")
calculator = ASECalculator(estimator)
[15]:
from ase.build import bulk
a_array = np.linspace(2.5, 7.0, 5)
bcc_energy_list = []
bcc_density_list = []
for a in a_array:
atoms = bulk("Si", "bcc", a=a)
atoms.calc = calculator
# calculate potential energy for 1 atom (eV/atom)
E_pot = atoms.get_potential_energy() / len(atoms)
density = len(atoms) / atoms.get_volume()
bcc_energy_list.append(E_pot)
bcc_density_list.append(atoms.get_volume())
print("lattice a: ", list(a_array))
print("Density : ", bcc_density_list)
print("Energy : ", bcc_energy_list)
lattice a: [2.5, 3.625, 4.75, 5.875, 7.0]
Density : [7.8125, 23.817382812499996, 53.58593750000001, 101.38964843749996, 171.50000000000006]
Energy : [-1.1319000978713731, -3.3385418260066886, -1.325669089403413, -0.1946264644375943, -0.007487893090558714]
We make a method to wrap above procedure so that we can perform this calculations on a variety of structures.
[16]:
from ase.build import bulk
def calc_density_energy(calculator, symbol: str = "Fe", crystalstructure: str = "bcc"):
a_array = np.linspace(2.0, 8.0, 30)
energy_list = []
density_list = []
for a in a_array:
atoms = bulk(symbol, crystalstructure, a=a)
atoms.calc = calculator
E_pot = atoms.get_potential_energy() / len(atoms)
density = len(atoms) / atoms.get_volume()
energy_list.append(E_pot)
density_list.append(density)
return np.array(energy_list), np.array(density_list)
This calc_density_energy
method can be used to obtain a list of energies and densities for various structures.
[17]:
bcc_energy, bcc_density = calc_density_energy(calculator, "Si", "bcc")
fcc_energy, fcc_density = calc_density_energy(calculator, "Si", "fcc")
sc_energy, sc_density = calc_density_energy(calculator, "Si", "sc")
hcp_energy, hcp_density = calc_density_energy(calculator, "Si", "hcp")
diamond_energy, diamond_density = calc_density_energy(calculator, "Si", "diamond")
Looking at the calculated energy values, we can see that the energy values are unstable when the value of a
is too small (when the density is made unrealistically large).
[18]:
bcc_energy
[18]:
array([-1.51431809e+03, 1.90312736e+02, 1.00883824e+00, -2.69269530e+00,
-3.69642360e+00, -4.01363424e+00, -3.93359487e+00, -3.67273086e+00,
-3.28083585e+00, -2.88532212e+00, -2.49506160e+00, -2.09706308e+00,
-1.72758801e+00, -1.41201213e+00, -1.11850814e+00, -8.32015884e-01,
-5.81543805e-01, -3.93798904e-01, -2.63483682e-01, -1.65962615e-01,
-9.01755903e-02, -5.25514094e-02, -4.20469155e-02, -3.06421093e-02,
-7.48789309e-03, -7.48789309e-03, -7.48789309e-03, -7.48789309e-03,
-7.48789309e-03, -7.48789309e-03])
The following plots avoid such values.
Using the np.where
method, indices with absolute values of 5 or less are taken as inds
, and plotted by a_array[inds]
excluding the outlier values.
[19]:
a_array = np.linspace(2.0, 8.0, 30)
inds = np.where(np.abs(sc_energy) < 5.0)[0]
plt.plot(a_array[inds], sc_energy[inds], label="sc")
inds = np.where(np.abs(bcc_energy) < 5.0)[0]
plt.plot(a_array[inds], bcc_energy[inds], label="bcc")
inds = np.where(np.abs(fcc_energy) < 5.0)[0]
plt.plot(a_array[inds], fcc_energy[inds], label="fcc")
inds = np.where(np.abs(hcp_energy) < 5.0)[0]
plt.plot(a_array[inds], hcp_energy[inds], label="hcp")
inds = np.where(np.abs(diamond_energy) < 5.0)[0]
plt.plot(a_array[inds], diamond_energy[inds], label="diamond")
plt.xlabel("a [A]")
plt.ylabel("Energy per atom [eV]")
plt.legend()
plt.title("Si bulk energy for various structure")
plt.ylim(-6.0, 0)
#plt.yscale("log")
plt.show()
The above figure shows the lattice constant a
as the horizonal axis. But it is difficult to compare because the position at which the energy becomes small differs from structure to structure.
In the figure below, we plot the energy with density on the horizontal axis.
[20]:
inds = np.where(np.abs(sc_energy) < 5.0)[0]
plt.plot(sc_density[inds], sc_energy[inds], label="sc")
inds = np.where(np.abs(bcc_energy) < 5.0)[0]
plt.plot(bcc_density[inds], bcc_energy[inds], label="bcc")
inds = np.where(np.abs(fcc_energy) < 5.0)[0]
plt.plot(fcc_density[inds], fcc_energy[inds], label="fcc")
inds = np.where(np.abs(hcp_energy) < 5.0)[0]
plt.plot(hcp_density[inds], hcp_energy[inds], label="hcp")
inds = np.where(np.abs(diamond_energy) < 5.0)[0]
plt.plot(diamond_density[inds], diamond_energy[inds], label="diamond")
plt.xlabel("Density [1/A^3]")
plt.ylabel("Energy per atom [eV]")
plt.legend()
plt.title("Si bulk energy for various structure")
plt.ylim(-6.0, 0)
#plt.yscale("log")
plt.show()
Now the figure is easier to read by taking density on the horizontal axis.
Thus, we can obtain the lattice constants likely to minimize the energy for each of the SC, BCC, FCC, HCP, and Diamond structures by performing calculations with different lattice constants. By comparing the minimum energies for each structure in this calculation, we can analyze that the diamond structure is stable for Si.
2. Lattice constant optimization of Fe crystal¶
Next, let’s find this lattice constant efficiently by using a structural optimization algorithm. When optimizing a bulk with a periodic structure, the method differs depending on whether we are optimizing only the atomic coordinates or including the cell size. If you want to optimize the cell as well, you need to apply the following Filter class to the ASE atoms.
Table: Optimization target and Filter to be used
Position |
Cell |
Filter |
---|---|---|
✓ |
No filter (Default behavior) |
|
✓ |
|
|
✓ |
✓ |
|
Fix atomic coordinates, opt only cell –>
StrainFilter
Fix cell, opt only atomic coordinates –> No Filter
Opt both atomic coordinates and cell –>
ExpCellFilter
Here is an example of using StrainFilter
.
We will optimize the lattice constants of the BCC and FCC structures of Fe atoms. Since the BCC and FCC structures contain only one atom in the primitive cell, optimizing the atomic coordinates is not necessary, and it is sufficient to optimize only the lattice constants. So StrainFilter
is appropriate.
[21]:
from ase.constraints import ExpCellFilter, StrainFilter
fe_bcc_atoms = bulk("Fe", "bcc", a=3.0)
print("Before opt:", fe_bcc_atoms.cell)
fe_bcc_atoms.calc = calculator
fe_bcc_atoms = StrainFilter(fe_bcc_atoms)
opt = LBFGS(fe_bcc_atoms, trajectory="output/lbfgs_fe_bcc.traj")
opt.run()
Before opt: Cell([[-1.5, 1.5, 1.5], [1.5, -1.5, 1.5], [1.5, 1.5, -1.5]])
Step Time Energy fmax
*Force-consistent energies used in optimization.
LBFGS: 0 02:20:35 -4.865733* 2.2558
LBFGS: 1 02:20:35 -4.933622* 2.0702
LBFGS: 2 02:20:35 -4.434309* 8.0541
LBFGS: 3 02:20:35 -4.992679* 0.5748
LBFGS: 4 02:20:35 -4.996655* 0.1459
LBFGS: 5 02:20:35 -4.996941* 0.0051
[21]:
True
[22]:
from ase.constraints import ExpCellFilter, StrainFilter
fe_fcc_atoms = bulk("Fe", "fcc", a=3.0)
print("Before opt:", fe_fcc_atoms.cell)
fe_fcc_atoms.calc = calculator
fe_fcc_atoms = StrainFilter(fe_fcc_atoms)
opt = LBFGS(fe_fcc_atoms, trajectory="output/lbfgs_si_fcc.traj")
opt.run()
Before opt: Cell([[0.0, 1.5, 1.5], [1.5, 0.0, 1.5], [1.5, 1.5, 0.0]])
Step Time Energy fmax
*Force-consistent energies used in optimization.
LBFGS: 0 02:20:36 -2.789322* 16.4962
LBFGS: 1 02:20:36 -4.806755* 3.0130
LBFGS: 2 02:20:36 -4.878337* 0.6926
LBFGS: 3 02:20:36 -4.882889* 0.1071
LBFGS: 4 02:20:36 -4.883017* 0.0137
[22]:
True
[23]:
fe_bcc_atoms.atoms.cell
[23]:
Cell([[-1.4243732594602483, 1.4243732644003524, 1.4243732505018425], [1.4243732675507748, -1.4243732687974981, 1.4243732629895147], [1.42437325401479, 1.4243732633520398, -1.4243732548989883]])
[24]:
fe_fcc_atoms.atoms.cell
[24]:
Cell([[-2.272649880573051e-09, 1.7255768493652164, 1.7255767924285237], [1.7255767836800167, 1.862583482829644e-09, 1.7255767882932904], [1.7255767714053183, 1.7255768329552847, 1.4137281709826914e-08]])
When optimizing the lattice constants using StrainFilter
, the values of the a, b, and c axes are slightly different because of the lack of consideration for the symmetry of the crystal. (In fact, it is possible to optimize while preserving the symmetry by using ExpCellFilter
or FixSymmetry
as described below.)
Here we will take the average value to calculate the final lattice constant. For the argument a
to the bulk
method of ASE, the cells of the BCC structure are
[[-a/2, a/2, a/2],
[a/2, -a/2, a/2],
[a/2, a/2, -a/2]]
The cells of the FCC structure are
[[0, a/2, a/2],
[a/2, 0, a/2],
[a/2, a/2, 0]]
The following calculation is performed using this fact.
[25]:
a = np.abs(fe_bcc_atoms.atoms.cell).mean() * 2
E_pot = fe_bcc_atoms.get_potential_energy()
print(f"BCC opt: a = {a:.2f} A, E = {E_pot:.2f} eV")
a = fe_fcc_atoms.atoms.cell.sum() / 3
E_pot = fe_fcc_atoms.get_potential_energy()
print(f"FCC opt: a = {a:.2f} A, E = {E_pot:.2f} eV")
BCC opt: a = 2.85 A, E = -5.00 eV
FCC opt: a = 3.45 A, E = -4.88 eV
As a result of structural optimization, we found that the BCC structure has the minimum energy at a=2.85
A.
We also found that Fe is more stable in the BCC structure than in the FCC structure, which is consistent with experimental results.
Visualization of the progress of structural optimization shows that only the lattice constants are optimized, while the atomic coordinates remain the same.
[26]:
view_ngl(Trajectory("output/lbfgs_fe_bcc.traj"), w=400, h=300, replace_structure=True)
One caveat to using StrainFilter
is that the a, b, and c axes of a cell are optimized independently, so even if you want to keep that ratio, it may not be preserved.
3. Lattice constant optimization of Si crystal diamond structure¶
Next, let us perform optimization for the Si crystal diamond structure. Here, we use ExpCellFilter
to optimize both atomic coordinates and cell size.
In ExpCellFilter
, you can perform optimization while preserving the ratio of a, b, and c axis lengths of cells by setting hydrostatic_strain=True
.
[27]:
from ase.constraints import ExpCellFilter, StrainFilter
si_diamond_atoms = bulk("Si", "diamond")
si_diamond_atoms.calc = calculator
print("cell size before: ", si_diamond_atoms.cell)
si_diamond_atoms = ExpCellFilter(si_diamond_atoms, hydrostatic_strain=True)
opt = LBFGS(si_diamond_atoms, trajectory="output/lbfgs_si_diamond.traj")
opt.run()
print("cell size after : ", si_diamond_atoms.atoms.cell)
cell size before: Cell([[0.0, 2.715, 2.715], [2.715, 0.0, 2.715], [2.715, 2.715, 0.0]])
Step Time Energy fmax
*Force-consistent energies used in optimization.
LBFGS: 0 02:20:49 -9.108032* 0.4967
LBFGS: 1 02:20:49 -9.113262* 0.0002
cell size after : Cell([[0.0, 2.734332539572064, 2.734332539572064], [2.734332539572064, 0.0, 2.734332539572064], [2.734332539572064, 2.734332539572064, 0.0]])
Comparing the before and after of cell size, we can see that the cell size ratio was maintained during the structural relaxation.
[28]:
view_ngl(Trajectory("output/lbfgs_si_diamond.traj"), w=400, h=300, replace_structure=True)
4. SiO\(_2\) crystal¶
So far, we have dealt with crystals containing only one element. Finally, we will optimize for SiO\(_2\) crystals as a more practical example.
SiO2 is also known to take various structures such as Quartz, Cristobalite, and Tridymite in reality. The existence of many crystal systems with the same composition is called polymorphism.
In this case, the Quartz structure is prepared from the Materials project, and structural optimization is performed. An ExpCellFilter
is used to optimize both coordinates and cell size at the same time.
[29]:
atoms = read("../input/SiO2_mp-6930_conventional_standard.cif")
atoms.calc = calculator
view_ngl(atoms, representations=["ball+stick"])
[30]:
opt = LBFGS(ExpCellFilter(atoms), trajectory="output/lbfgs_sio2.traj")
opt.run()
Step Time Energy fmax
*Force-consistent energies used in optimization.
LBFGS: 0 02:20:57 -57.136945* 0.0379
[30]:
True
[31]:
view_ngl(Trajectory("output/lbfgs_sio2.traj"), representations=["ball+stick"])
[Note] Structural optimization is required every time Calculator (potential energy) changes
You might think that the structure of the Materials project is already a perfectly stable structure, since it is also a structure that has undergone structural optimization. However, the exact local point (the structure that takes the lowest energy) is different when the potential energy changes.
[Column] Structural optimization and the real world¶
The structural optimization calculations presented in this chapter are the foundation for almost all the atomistic simulations introduced later. However, the computational methods used inside the structural optimization are somewhat technical, and those new to atomistic simulations through this tutorial may not intuitively understand the necessity of such calculations. Therefore, in this column, we will review structural optimization calculations from the viewpoint of their correspondence with the real world.
Need for structural optimization¶
Considering how the atomic arrangement of material in the real world is determined, each atom is free to move around without any particular constraints, so excess energy is dissipated and transformed into a stable structure with low energy. Once we consider the case where the temperature is sufficiently low, ignoring the effect of temperature, in that case, the kinetic energy of the atoms will be zero, and the potential energy will be minimal. In this sense, structural optimization calculations correspond to absolute zero temperature conditions in the real world.
What about the case of finite temperature? From the theory of statistical mechanics, it is known that temperature and kinetic energy have a constant relationship in a system in equilibrium. A rough calculation shows that a 1 K rise corresponds roughly to a 1/10000 eV/atom increase in kinetic energy. Conversely, an increase in energy of 1 eV/atom corresponds to a structure of about 10000 K in terms of temperature. Thus, even at finite temperatures, an energy-optimized structure is often a good enough approximation, so it is common practice to start the simulation with a structural optimization calculation (and then increase the temperature for finite temperatures).
Simulations without structural optimization may lead to unrealistic results due to differences in assumptions. For example, if the cell size of the iron crystal structure is reduced by 1%, you may not notice any particular change if you only visualize the atomic structure. However, as can be seen by considering the operation of shrinking iron by 1% in the real world, this corresponds to iron under high compressive force (specifically, pressure in excess of 1 GPa), which can change the results of the simulation.
Structural optimization methods and real world structural relaxation¶
Let us consider why there are several structural optimization methods by their correspondence to the real world. What happens when the material at an appropriately high temperature, like molten iron, is cooled slightly by removing heat? The iron atoms constantly vibrate due to the heat, taking on various atomic configurations and gradually transitioning to a structure with lower potential energy. Eventually, they transition to a crystal structure with low energy, a well-ordered atomic arrangement. In this sense, structural optimization in the real world is considered simulated annealing). (Historically, this corresponds to the original idea of annealing.)
In simulated annealing, atoms are in motion for a long time at an elevated temperature. It means that, in principle, an atomistic simulation can yield the same stable structure as in the real world if a sufficiently long molecular dynamics calculation can be performed.
The problem here, however, is the gap between the time scale of the MD simulation and reality. While atomic motion is typically a femtosecond (\(10^{-15}\) s) order phenomenon, much longer time scales typically come into play when considering real-world materials. Therefore, real-world structures can be viewed as the result of extremely long-time structural optimization calculations on the scale of \(10^{15}\) or even longer atomic scale simulations. It is also known that the annealing method is a highly robust optimization technique and can arrive at a more globally optimal solution by taking a sufficiently long time. Therefore, it is possible to consider that a material that exists, in reality, has obtained an atomic structure for which the global optimization problem has been virtually solved.
Thus, methods that allow atomistic simulations to find more globally optimal solutions in a shorter time have been developed. This is generally a tradeoff against computational time and is used in different cases. Suppose the structure of interest is already known, such as a crystalline structure, and a sufficiently close structure can be taken as the initial structure. In that case, the BFGS-like method shown in the example above is the preferred method for finding the local optimum. Conversely, when the target structure is unknown, such as an amorphous structure, more attention is paid to finding a more globally optimal solution. (For more realistic results, melt-quench methods are used, for example, in which the temperature is slowly lowered.) This is why libraries of atomic simulations, including ASE, include a variety of optimal solution methods. As a side note, there are cases in the real world where the solution is not globally optimal. For example, diamond, a high-pressure phase of carbon, is not stable on earth, and graphite is the most stable allotrope. Because of the high energy required for structural change, the rate of structural change is extremely slow and is treated as a practically stable structure (metastable state) on the human time scale.
A famous example of active use in manufacturing is the quenching of iron. The technique of hardening heated iron by immersing it in water is well known in the forging of Japanese swords. The time required for structural relaxation is limited by rapidly quenching the temperature, and the transition to a stable structure is prevented. This treatment yields a metastable state with a more desirable strength.
Thus, phenomena involving phase transformations and chemical changes often occur slowly, even on an everyday scale. This is true for atomistic simulations as well, and if one wants to look at phase transformations or chemical reactions, a more efficient computational approach may be used than simply performing long kinetic calculations. One specific example is the NEB method described in chapter 5.