Skip to main content
v2026.1714 entries · CC-BY 4.0
CASRAI

Life sciences · Reference

What is bioinformatics?

Bioinformatics is the field that develops and applies computational methods to analyse biological data — especially DNA, RNA, and protein sequences — turning vast datasets into biological understanding.

Computing meets biology

Bioinformatics arose because modern biology generates data on a scale that cannot be analysed by hand. Sequencing a single genome produces billions of data points, and comparing many genomes multiplies that further. Bioinformatics provides the computational methods — algorithms, software, and statistics — to store, search, and make sense of this data, bridging the gap between raw biological measurements and biological insight.

Sequence alignment and analysis

A central task in bioinformatics is sequence alignment: arranging DNA, RNA, or protein sequences to identify regions of similarity that may reflect shared function or evolutionary relationship. Tools such as BLAST allow researchers to search databases for sequences resembling one of interest.

Other core activities include assembling genomes from sequencing reads, identifying genes, predicting protein structure, and analysing patterns of gene expression — all of which depend on well-designed algorithms and reliable reference data.

Databases and resources

Bioinformatics relies on large public databases that curate biological data under shared standards. GenBank and the European Nucleotide Archive hold nucleotide sequences; UniProt curates protein sequences and functional information; and many specialised resources serve particular data types. These databases let researchers worldwide deposit, retrieve, and reuse data, making them a backbone of reproducible biology.

Bioinformatics and FAIR data

Because bioinformatics is fundamentally about managing and reusing biological data, it is closely aligned with the FAIR principles — making data Findable, Accessible, Interoperable, and Reusable. Consistent identifiers, standard file formats, rich metadata, and deposition in trustworthy data repositories are central to good bioinformatics practice and to ensuring that analyses can be reproduced and built upon.

Key facts

At a glance

  • Definition: computational analysis of biological data
  • Disciplines: biology, computer science, statistics
  • Core task: sequence alignment (e.g. BLAST)
  • Key databases: GenBank, ENA, UniProt
  • Underpins: genomics and related "omics" fields
  • Aligned with: the FAIR data principles

Common questions

FAQ

What is bioinformatics?+

Bioinformatics is the use of computational methods to analyse biological data, especially DNA, RNA, and protein sequences. It combines biology, computer science, and statistics to manage and interpret datasets too large to handle by hand.

What is bioinformatics used for?+

It is used to align and compare sequences, assemble and annotate genomes, predict protein structures, analyse gene expression, and curate biological databases — making it essential to modern genomics.

The step most authors miss

Doing CRediT right? Don’t stop at the statement.

A CRediT statement credits you inside one paper. The recognition CRediT was built for happens when those roles are tied to you, persistently. Sign in with your ORCID — free — and claim your CRediT contributions on casrai.org, the home of the standard. They become a verified, portable part of your identity, not a line that disappears into one PDF.

Free: claim your contributions, then export a journal-ready CRediT statement, schema.org structured data, JATS XML, CSV or BibTeX — and preview your public profile. A membership publishes that profile publicly and verifies the journals you serve.

Referenced across the research world

University of Cambridge logoColumbia University logoUniversity of Edinburgh logoHarvard University logoUniversity of Oxford logoPrinceton University logoStanford School of Medicine logoUniversity College London logoORCID logoCrossref logoUniversity of Cambridge logoColumbia University logoUniversity of Edinburgh logoHarvard University logoUniversity of Oxford logoPrinceton University logoStanford School of Medicine logoUniversity College London logoORCID logoCrossref logo
  • University of Cambridge logo
  • Columbia University logo
  • University of Edinburgh logo
  • Harvard University logo
  • University of Oxford logo
  • Princeton University logo
  • Stanford School of Medicine logo
  • University College London logo
  • ORCID logo
  • Crossref logo

View CASRAI adoption →