Type a search term to find related articles by LIMS subject matter experts gathered from the most trusted and dynamic collaboration tools in the laboratory informatics industry.
Chemical informatics (more commonly known as chemoinformatics and cheminformatics) is the use of computer and informational techniques applied to a range of problems in the field of chemistry. While the field has roughly been around around since the 1990s, the rise in high-throughput screening (a scientific experimentation method primarily used in drug discovery) and combinatorial chemistry (a method of synthesizing a large number of compounds in a single process), as well as increases in computing power and data storage sizes, have increased interest in the field in the twenty-first century.[1]
Outside of pharmaceutical research, other applications of chemical informatics include the area of topology, chemical graph theory, and mining the chemical space. It can also be applied to data analysis for the paper, pulp, and dye industries.[2][1]
The 1960s saw the introduction of databases for the storage and retrieval of chemical structures, as well as three-dimensional molecular modeling methods, laying the groundwork for future generations to improve computational methods of chemical and molecular analysis.[2]
The term "chemoinformatics" was defined by F.K. Brown[3][4] in 1998 as such:
Chemoinformatics is the mixing of those information resources to transform data into information and information into knowledge for the intended purpose of making better decisions faster in the area of drug lead identification and optimization.
Since then, both the "chem" and "chemo" spellings have been used. European academia has seemingly settled on the term "chemoinformatics" for many of its research and teaching workshops.[5] Other entities like the Journal of Cheminformatics and Slovak company Molinspiration have trended towards "cheminformatics."[6][7]
The primary application of chemical informatics is in the storage and retrieval of both structured and unstructured information relating to chemical structures, molecular models and other chemical data. Efficiently querying and retrieving that stored information extends into other realms of computer science like data mining and machine learning. Other forms of data querying include graph, molecule, sequence, and tree mining.[8]
The in silico representation of chemical structures uses specialized formats such as the XML-based Chemical Markup Language or Simplified Molecular-Input Line-Entry System (SMILES) specifications. These representations are often used for storage in large chemical databases. While some formats are suited for visual representations in two or three dimensions, others are more suited for studying physical interactions, modeling, and docking studies.[8]
Stored chemical data can pertain to both real and virtual molecules. Virtual libraries of such molecules and compounds may be generated in various ways to explore chemical space and hypothesize novel compounds with desired properties. The Fragment Optimized Growth (FOG) algorithm, for example, was developed to "grow" novel classes of compounds like drugs, natural products, and diversity-oriented synthetic products from a training database of existing compounds.[9][10]
In contrast to high-throughput screening, virtual screening involves computationally screening in silico libraries of compounds, by means of various methods such as docking, to identify members likely to possess desired properties such as biological activity against a given target. In some cases, combinatorial chemistry is used in the development of the library to increase the efficiency in mining the chemical space. More commonly, a diverse library of small molecules or natural products is screened.[1]
This is the calculation of quantitative structure-activity relationship and quantitative structure property relationship values, used to predict the activity of compounds from their structures. In this context there is also a strong relationship to chemometrics, the science of extracting information from chemical systems by data-driven means. Chemical expert systems are also relevant since they represent parts of chemical knowledge as an in silico representation.[1]
This article reuses portions of content from the Wikipedia article.