ContentsBackgroundThere are a number of references out there on basic biology. I like the freshman textbook Life: The Science of Biology by Purves et al1. What I will attempt to do here is give a quick overview of some of the basics of biological chemistry with hands on work with relevant software and real data. Also, I want to make the barriers of entry to using the software and data a minimum both from a computing skills perspective and a biology background perspective. |
|
With this approach readers with a stronger computing background
than natural sciences background will not have to spend a great deal of
time learning biology just to get to some programming
applications. Also people with a biology, chemistry, or medicine
background will only need basic knowledge of computing.
I will be using Chemical Markup Language (CML) to illustrate basic biological chemistry. To get started, I will introduce CML. For a review of XML as it relates to chemistry and biosciences see the article Introduction to XML for Chemistry and Biosciences and for a review of tools for use with CML see the article Tools for Working with Chemical Markup Language.
Another source for chemical structures is http://www.chmoogle.com/index.htm. There are also some non CML based visualization tools at the web site for the Theoretical and Computational Biophysics Group at the University of Illinois at Urbana Champaign. Page on Visual Molecular Dynamics15.
Let's have a look at some CML now. Here is part of a CML
document for
arginine, an amino acid discussed below. The full file is arginine2.cml.
This exampe focusses on geometry. The molecule element contains an atomArray, which enumerates the atoms, and a bond array, which specifies how to connect the atoms. This position of each atom in a cartesian plane is specified by the floating point numbers in the atom attribute elementType. Within the CML schema there is also the ability to represent other chemical properties, including charge, isotope, shell occupancy, and so on.
The current version of CML is CML2. I provide the data files in CML2
for all the molecules described, with a couple of exceptions, in his
article. To view them in two dimensions follow these steps

In JChemPaint try changing the view options to switch on and off
display of carbon and hydrogen atoms using the View | Rendering Options
menu. You can also explore editing the molecules or creating new
ones and saving them to CML files.
To view the molecules in three-dimensions follow these steps
A screenshot of Jmol with a cholesterol molecule loaded is shown
below.

Screenshot of Jmol with a Cholesterol
Molecule Loaded

| Functional
Group |
Class |
Formula |
CML |
| Hydroxyl (OH) |
Alchohol |
|
hydroxyl.cml |
| Aldehyde (CHO) |
Aldehyde |
![]() |
aldehyde.cml |
| Keto (CO) |
Ketones |
![]() |
keto.cml |
| Carboxyl (COOH) |
Carboxylic acids |
|
carboxyl.cml |
| Amino (NH2) |
Amines |
|
amino.cml |
| Phosphate (OPO32-) |
Organic phosphates |
|
phosphate.cml |
| Sulfhydryl (SH) |
Thiols |
|
sulfhydryl.cml |