Cvitanich et al. (2000): Nodulin genes are specifically expressed in the nitrogen-fixing root nodules. We have identified a novel type of DNA-binding protein (CPP1) interacting with the promoter of the soybean leghemoglobin gene Gmlbc3. The DNA-binding domain of CPP1 contains two similar Cys-rich domains with 9 and 10 Cys, respectively. Genes encoding similar domains have been identified in Arabidopsis thaliana, Caenorhabditis elegans, the mouse, and human. The domains also have some homology to a Cys-rich region present in some polycomb proteins. The cpp1 gene is induced late in nodule development and the expression is confined to the distal part of the central infected tissue of the nodule. A constitutively expressed cpp1 gene reduces the expression of a Gmlbc3 promoter-gusA reporter construct in Vicia hirsuta roots. These data therefore suggest that CPP1 might be involved in the regulation of the leghemoglobin genes in the symbiotic root nodule.


Cvitanich, C; Pallisgaard, N; Nielsen, KA; Hansen, AC; Larsen, K; Pihakaski-Maunsbach, K; Marcker, KA; Jensen, EO. 2000. CPP1, a DNA-binding protein involved in the expression of a soybean leghemoglobin c3 gene. Proc. Natl. Acad. Sci. U.S.A. 97(14):8163-8 PubMed


Name: CPP
Class: TF
Number of species containing the TAP: 114
Number of available proteins: 748

The colour code corresponds to the rules for the domains:

should be contained
should not be contained

Domain rules:

(Domain names are clickable)

Phylogenetic tree for Archeaplastida:

To view the tree click here.

TAP distribution:

The following table shows the distribution of CPP over all species included in TAPscan. The values for e.g. a specific kingdom are shown in the tree below if you expand the tree for that kingdom.

Minimum Maximum Average Median Standard deviation

List of species containing CPP sorted by kingdomcladesupergrouporderfamily:

hide all | show all

expand kingdom Archaeplastida (758 proteins in 114 species)
select all | unselect | select genome only

A list of species letter codes included in the protein names can be found here (opens in new tab).