New Tools for Detecting MultirefPredict Character in Transition Metal Chemical Space

Fang Liu1, Chenru Duan12, Aditya Nandy12, Heather Kulik1
1 Department of Chemical Engineering, MIT, Cambridge, MA
2 Department of Chemistry, MIT, Cambridge, MA
MolSSI Mentor(s): Daniel Smith

Introduction

Open-shell transition metal complexes is essential for materials design and catalysis, but notoriously difficult to study accurately with electronic structure (ES) methods.

TMC

Computationally predicted molecular properties can be strongly affected the choice of ES methods. Density functional theory (DFT) results can vary significantly with the choice of functionals and parameters.

Impacts of method selection

For example, the spin-state energetic is defined as

\[\Delta E_{\textrm{H-L}}= E_{\textrm{HS}} - E_{\textrm{LS}}\]

spinsplit

First principle calculation

The DFT predicted spin-splitting energy shifts significantly with DFT parameters.

drawing Figure 1: DFT results shift with parameters. F Liu, et al. Front. Chem. (2019))

Machine learning

Performance of machine learning models trained on DFT calculated datasets also shift with parameters.

drawing Figure 2: Machine learning predicted spin-crossover compound leads also shift with DFT parameters. J. P. Janet, F Liu et al., Inorg. Chem. (2019)

Objectives

A crucial step to obtain accurate results is to choose between DFT and correlated wavefunction based based methods.

Multireference character

Molecules with strong static correlation, i.e., multireference (MR) character, need to be calculated with correlated methods.

drawing

Decision Engineering

For high throughput computational design and discovery, an automated decision engine for method selection is needed. drawing

Methods

MR diagnostics: measurement of MR characters, essential for method selection

Energy based diagnostics

BE
bonding energy.
TAE
total atomization energy
  1. \(B_1\) diagnostics

    \[B_{1} = \frac{\text{BE}_{\text{BLYP}}-\text{BE}_{\text{B1LYP/BLYP}}}{n_{\text{bonds}}}\]
  2. \({A_{25\% }}[\rm{PBE}]\) diagnostics

    \[{A_{25\% }}[\rm{PBE}] = 4\times \frac{\rm{1} - \rm{TAE[PBE0]}}{\rm{TAE[PBE]}}\]
  3. \(\% \rm{TAE}\) diagnostics

    \[\% \rm{TAE} = 100 \times \frac{\rm{TAE}_{\rm{e}}\rm{[CCSD(T)]} - \rm{TAE}_{\rm{e}} \rm{[CCSD]}}{\rm{TAE}_{\rm{e}} \rm{[CCSD(T)}}\]

CASSCF based diagnostics

drawing

\[\Psi ^{\text{CAS}}=\sum\limits_{I}^{\text{CI}}{C_I}\Phi_I\]

\(C_I\): CI coefficients

\(C_0^2\) diagnostics: square of the CI coefficient of the leading determinant

Coupled-cluster (CC) based diagnostics

\(\Psi _{\rm{CC}} \rangle = {e^{\hat T}}\left| {\Phi _I} \right\rangle ,\;{\kern 1pt} \hat T = {\hat T_1} + {\hat T_2} + {\hat T_3} + ...\)

  1. \(T_1\) diagnostics

    \[T_1 = \frac{\sqrt {\overrightarrow {t_1} \cdot \overrightarrow {t_1} } }{\sqrt {N_{\rm{corr}}} }\]
  2. \(D_1\) diagnostics

    \[D_1\rm{(CCSD) } = \left\| \bf{T} \right\|_{2} , {\bf{T}}_{ia} = t_i^a\]
  3. \(D_2\) diagnostics

    \[D_2 \rm{(CCSD) } = \rm{max} \{ D_2^o,D_2^v\} , \bf{T}_{jab,i}^o = \bf{T}_{ijb,a}^v = t_{ij}^{ab}\] \[D_2^o = \rm{max} \left\| {\bf{T}}^o{\bf{u}} \right\|_2, D_2^v = \rm{max} \left\| {\bf{T}}^v{\bf{v}} \right\|_2\]

Fractional occupation number (FON) based diagnostics

  1. \(I_{\rm{ND}}\) diagnostics

    \[I_{\rm{ND}} = \frac{1}{2}\sum\limits_{\sigma ,i} {n_i^\sigma } (1 - n_i^\sigma )\] \[I_{\rm{D}} = \frac{1}{4}\sum\limits_{\sigma ,i} {[n_i^\sigma (1 - n_i^\sigma )]^{1/2}} - \frac{1}{2}\sum\limits_{\sigma ,i} {n_\sigma ^i} (1 - n_\sigma ^i)\]
  2. \(r_{\rm{ND}}\) diagnostics

    \[r_{\rm{ND}} = \frac{I_{\rm{ND}}}{I_{\rm{D}} + I_{\rm{ND}}}\]
  3. FOD (fractional occupation density) diagnostics

    \[\rho _{\rm{FOD}} = \sum\limits_{\sigma ,i} {(\delta _1 - \delta _2 n_i^\sigma )|\phi _i^\sigma ({\bf{r}})|^2}\] \[N_{\rm{FOD}} = \int {\rho _{\rm{FOD}}({\bf{r}}){\rm{d}}{\bf{r}}}\]

QCEngine Interface for TeraChem

To utilize the advantages of different quantum chemistry (QC) packages, MultirefPredict conducts QC calculations through QCEngine, an open-source executor and IO standardizer. We implemented the interface for the GPU based QC package, TeraChem, in QCEngine to exploit the massive parallel power of GPUs. MRengine

MultirefPredict

MultirefPredict is an open source python module we developed to automate the generation of commonly used multi-reference diagnostics.

  • Source code available on Github
  • Online documentation available
  • MolSSI best practices for programming are enforced in the development: version control, testing and code coverage, continuous integration, code style, documentation (Github badges shown below). multiref1
  • User-friendly interface. Automatically handling I/O for QC packages.
  • Backend QC packages (TeraChem, Psi4, etc.) can be chosen by default based on availability of the software/hardware and the type of required calculation, or from user’s input. multiref2

Example

Example python script to calculate B1 diagnostics through MultirefPredict:

1
2
3
4
5
6
7
8
9
10
import MultirefPredict
import qcelemental

mol = qcelemental.models.Molecule.from_data("""
O                 0.000000000000     0.000000000000    -0.068516245955
H                 0.000000000000    -0.790689888800     0.543701278274
H                 0.000000000000     0.790689888800     0.543701278274
""")

b1 = MultirefPredict.diagnostic_factory("B1",molecule=mol, molname="water", record=False).computeDiagnostic()

It takes only one command to define the molecule to calculate, and another line to tell MultirefPredict what diagnostics to calculate. No need to know the quantum chemistry packages to use, or the format of their input/output files.

Conclusions

  • Implemented QCEngine interface for the GPU based quantum chemistry package TeraChem to enable easy control of TeraChem through python
  • Developed automated module, MultirefPredict, to calculate MR diagnostics.
  • Enabled automated workflow to generate transition metal complexes structure and estimate their multireference character.

References

  1. F. Liu, et al. DOI: 10.3389/fchem.2019.00219
  2. J. P. Janet, F. Liu, et al. DOI: 10.1021/acs.inorgchem.9b00109
  3. molSimplify: https://molSimplify.mit.edu
  4. C. Duan, F. Liu, et al. DOI: 10.1021/acs.jctc.0c00358
  5. E. Ramos-Cordoba, E. Matito, DOI: 10.1021/acs.jctc.7b00293

Acknowledgements

Software Fellow Fang Liu was supported by a fellowship from The Molecular Sciences Software Institute under NSF grant OAC-1547580”

Updated: