The recipe for much of modelling in systems and network biology is comparatively easy. First one establishes the topology or ‘structure’ of the network (the curly arrow version seen in wallcharts – such as those for metabolism, now available electronically – of ‘who talks to whom?’). Then one finds out the equations – such as that of Henri, Michaelis and Menten – describing the effect of the concentrations of the various interacting partners on the rate of the reaction step involved. Finally one determines the parameters of those equations (such as the rate constants, measured using the methods of molecular enzymology, and ‘fixed concentrations of substrates and other effectors). Armed with the model, it is then straightforward to run the model using appropriate software (e.g. the COPASI software found on the web) either using ordinary differential equations or when necessary stochastic methods. These give the time evolution of the system variables – typically the concentrations and fluxes of molecules – and one may also determine the local or global sensitivities and summation laws of every variable to every parameter. Even for large models, modern algorithms (including those for solving so-called ‘stiff’ ordinary differential equations) require comparatively little computer power.
The first problem is that we do not necessarily know any of the above (neither the topology, nor the equations nor the parameters), and all we have – as nowadays produced by ‘omics’ methods – is a set of measurements of the time series of variables. The other problem is that the parameters determine the variables and not the other way round. How then do we infer the network – the systems biology model – that is best capable of fitting experimental measurements of variables (or even the desired behaviour of variables, as may be of importance in synthetic biology or metabolic engineering)? This is an ‘inverse’ or ‘system identification’ problem, that often requires considerable computer power and is rather hard as these problems are usually highly underdetermined – many sets of parameters can fit the same data. To avoid (re)inventing the wheel we would like to know which methods work best. (We have recently published a new method of our own, and a separate one on the essentially similar problem of detecting the mode or site of action of something that effectively changes a parameter.)
In these circumstances, a bake-off of competing methods allows a comparison of their strengths and weaknesses – much as the CASP competitions for protein structure prediction – and this is what has been done by the DREAM team (where the acronym stands for Dialogue on Reverse Engineering Assessment and Methods). The results from two published competitions are too detailed to give here, but a number of lessons have already emerged. First, the best algorithm is not the same for all problems (as one may expect from the No Free Lunch theorem, shown and popularised more than ten years ago). Secondly, some algorithms do do remarkably well, and this is very encouraging. One might perhaps summarise the DREAM competition by saying Network Inferencing Gets Heavy Testing Mechanisms, And Results Excel Sometimes – or NIGHTMARES for short…
Last week’s travelogue included the launch of the new Blueprint (PDF) by Lord Drayson and others from the Office for Life Sciences, and a visit to the European Bioinformatics Institute to discuss infrastructure issues.
Finally, I blogged before about how even scientists have been known to copy stories or references without checking them independently. There is a hilarious story from the Times on the speculative semiotics of furniture names, based on the perceived relationships between Swedes and Danes. However, according to the Language Log, this turns out to have been a spoof. Another urban myth exploded.
- Herrgård, M., Swainston, N., Dobson, P., Dunn, W., Arga, K., Arvas, M., Büthgen, N., Borger, S., Costenoble, R., Heinemann, M., Hucka, M., Le Novère, N., Li, P., Liebermeister, W., Mo, M., Oliveira, A., Petranovic, D., Pettifer, S., Simeonidis, E., Smallbone, K., Spasié, I., Weichart, D., Brent, R., Broomhead, D., Westerhoff, H., Kürdar, B., Penttilä, M., Klipp, E., Palsson, B., Sauer, U., Oliver, S., Mendes, P., Nielsen, J., & Kell, D. (2008). A consensus yeast metabolic network reconstruction obtained from a community approach to systems biology Nature Biotechnology, 26 (10), 1155-1160 DOI: 10.1038/nbt1492
- Hoops, S., Sahle, S., Gauges, R., Lee, C., Pahle, J., Simus, N., Singhal, M., Xu, L., Mendes, P. & Kummer, U. (2006). COPASI: a COmplex PAthway SImulator. Bioinformatics 22, 3067-74
- Jayawardhana, B., Kell, D. B. & Rattray, M. (2008). Bayesian inference of the sites of perturbations in metabolic pathways via Markov Chain Monte Carlo. Bioinformatics 24, 1191-1197
- Ljung, L. (1987). System identification: theory for the user. Prentice Hall, Englewood Cliffs, NJ
- Mendes, P. & Kell, D. B. (1998). Non-linear optimization of biochemical pathways: applications to metabolic engineering and parameter estimation. Bioinformatics 14, 869-883
- Michal, G. (1999). Biochemical pathways: an atlas of biochemistry and molecular biology. Wiley, Heidelberg
- Radcliffe, N. J. & Surry, P. D. (1995). Fundamental limitations on search algorithms: evolutionary computing in perspective. Computer Science Today 1995, 275-291. Text via Citeseer
- Rand, D. A. (2008). Mapping global sensitivity of cellular network dynamics: sensitivity heat maps and a global summation law. J R Soc Interface 5 Suppl 1, S59-69
- Saltelli, A., Ratto, M., Andres, T., Campolongo, F., Cariboni, J., Gatelli, D., Saisana, M. & Tarantola, S. (2008). Global sensitivity analysis: the primer. WileyBlackwell, New York
- Stolovitzky, G. & Califano, A. (eds) (2007) Reverse engineering biological networks: opportunities and challenges in computational methods for pathway inference. Ann NY Acad Sci Vol 1115. New York Academy of Sciences, New York
- Stolovitzky, G., Kahlem, P. & Califano, A. (2009). The challenges of systems biology: community efforts to harness biological complexity. Ann NY Acad Sci Vol 1158. New York Academy of Sciences, New York
- Wilkinson, S. J., Benson, N. & Kell, D. B. (2008). Proximate parameter tuning for biochemical networks with uncertain kinetic parameters. Mol Biosyst 4, 74-97
- Wolpert, D. H. & Macready, W. G. (1997). No Free Lunch theorems for optimization. IEEE Trans. Evol. Comput. 1, 67-82
Related posts (based on tags and chronology):
- Data visualisation, and the next generation of bioscientists [22 November 2010]
- Bioengineering and systems biology [11 January 2010]
- In touch with the Dutch [19 October 2009]
- The economy is the network; non-equilibrium systems in the real world [27 April 2009]
- Beyond the magic bullet – network pharmacology meets systems biology [19 January 2009]