SOSUI is a free online tool that predicts a part of the secondary structure of proteins from a given amino acid sequence (AAS). The main objective is to determine whether the protein in question is a soluble or a transmembrane protein.


SOSUI's algorithm was developed in 1996 at Tokyo University. The name means as much as "hydrophobic", an allusion to its molecular "clients".

How SOSUI works

First of all, SOSUI looks for α helices that are relatively easy to predict, taking into account the known helical potentials of the given AAS. The much more difficult task is to differentiate between the α helices in soluble proteins and the ones in transmembrane proteins, the α helix being a very common secondary structure pattern in proteins.SOSUI uses 4 characteristics of the AAS in its prediction:
# "hydropathy index" (Kyte und Doolittle 1982)
# weighted presence of amphiphilic amino acids (AA) and their localization: "amphiphilicity index"
# the AA's charge
# the length of the AAS

An important improvement compared to Kyte und Doolittle's "hydropathy index", which relies entirely on one characteristic, is the introduction of the so-called "amphiphilicity index". It is calculated by giving every AA with an amphiphilic residue a certain value which is derived from the AA's molecular structure. To meet SOSUI's criteria for amphiphilicity, the polar, hydrophilic residue may not be linked directly to the beta-carbon; there must be at least one apolar carbon interposed (therefore only lysine, arginine, histidine, glutamic acid, glutamine, tryptophan and tyrosine are relevant).SOSUI then looks for accumulations of amphiphilic AAs at the ends of α helices, which seems to be typical for transmembrane α helices (it makes the transmembrane position the energetically best one for these α helices by placing amphiphilic AAs at the lipid-water boundary and is thus co-responsible for the protein's correct localization). The AA's charge is also taken into consideration; the length is important because biological lipid membranes have a certain thickness determining the length of membrane-spanning proteins.According to a study published by SOSUI's developers it successfully differentiated 99% of a chosen group of proteins with known structure Ref|1. However, another study that had several prediction tools perform on the AAS's of 122 known proteins claimed that SOSUI was correct about the number of α helices in only about 60% of the cases Ref|2. But even if the number of transmembrane domains is not always exact, the differentiation between soluble and transmembrane proteins often works, as it is only necessary to find out if a protein has such a domain at all. Of course, membrane proteins which don't have transmembrane α helices (e.g. porins) or which are fixed with a covalent bond cannot be found by SOSUI.


The result page first shows general information (length, average hydrophobicity). If the protein in question is a transmembrane protein, the number of transmembrane domains and their localization is noted. A "hydropathy-profile" with colored accentation of hydrophobic parts; the helical wheel diagrams of potential transmembrane domains are shown as well. The last image shows a schematic overview of the transmembrane protein's location.


# Hirokawa, Boon-Chieng, Mitaku, "SOSUI: Classification and secondary structure prediction for membrane proteins", Bioinformatics Vol.14 S.378-379 (1998) []
# Masami Ikeda, Masafumi Arai, Toshio Shimizu, "Evaluation of transmembrane topology prediction methods by using an experimentally characterized topology dataset", Genome Informatics 11: 426–427 (2000) []

External links

* [ SOSUI-homepage]

Wikimedia Foundation. 2010.

Look at other dictionaries:

  • SOSUI — ist ein online frei zugängliches Programm, das aus einer vom Benutzer einzugebenden Aminosäuresequenz einen bestimmten Teil der Sekundärstruktur von Proteinen berechnet, die α Helix. Die Hauptaufgabe des Programms ist es jedoch, anhand bestimmter …   Deutsch Wikipedia

  • Personnages de Kekkaishi — Article principal : Kekkaishi. Ceci est la liste des principaux personnages du manga Kekkaishi par Yellow Tanabe et de l’anime adapté de celui ci. Sommaire 1 Protagonistes 1.1 Yoshimori Sumimura 1.2 To …   Wikipédia en Français

  • Lake Biwa Canal — nihongo|Lake Biwa Canal|琵琶湖疏水 or 琵琶湖疎水|Biwako Sosui is a waterway in Japan built during the Meiji Period in order to transport water, freight and passengers from Lake Biwa to the nearby City of Kyoto.This waterway was also used as Japan s first… …   Wikipedia

  • Maruyama Dam — Location Mitake, Gifu Prefecture, Japan. Coordinates …   Wikipedia

  • Bioinformatik-Harvester — Der Bioinformatik Harvester (englisch harvester, „die Erntemaschine, arbeiter“) ist eine Bioinformatik Meta Suchmaschine über Gene und Proteine von Mensch, Maus, Zebrafisch, Arabidopsis, Drosophila und Ratte. Der Harvester vereint oder verlinkt… …   Deutsch Wikipedia

  • Entrez — Gene ist eine vom National Center for Biotechnology Information (NCBI) betriebene Metasuchmaschine, die den zeitgleichen Zugriff auf multiple Datenbanken und damit weitgefächerte Suchen ermöglicht. Weiterhin bietet es eine ganze Reihe von Tools… …   Deutsch Wikipedia

  • Gfp-cdna — Im Rahmen des GFP cDNA Projektes wird die Lokalisation von Proteinen in eukaryotischen Zellen mit Hilfe von Fluoreszenzmikroskopie dokumentiert. Experimentelle Ergebnisse werden durch bioinformatische Analysen ergänzt und im Internet frei… …   Deutsch Wikipedia

  • Harvester42 — (englisch harvester, „die Erntemaschine, arbeiter“) ist eine Meta Suchmaschine über mehrere große Suchmaschinen. Harvester42 verlinkt den Inhalt von ca. 12 häufig verwendeten Suchmaschinen. Harvester42 verwendet dafür die inframe Methode, welche… …   Deutsch Wikipedia

  • Homologene — ist ein Service des National Center for Biotechnology Information (NCBI), welcher Informationen darüber gibt, ob und welche Homologien es für ein bestimmtes Gen in anderen Spezies gibt. Die Verarbeitung der Suchanfragen erfolgt automatisch und… …   Deutsch Wikipedia

  • IHOP (Datenbank) — iHOP (information Hyperlinked Over Proteins) ist eine frei zugängliche Literatursuchmaschine, die in Abstracts von Pubmed Artikeln vorkommende Gen und Proteinbezeichnungen als Hyperlinks verwendet und somit die Suche nach thematisch miteinander… …   Deutsch Wikipedia

Share the article and excerpts

Direct link
Do a right-click on the link above
and select “Copy Link”