Molecular modelling: a tool for understanding the cytoplasm–nucleus–nucleolus shuttling of the ceramide kinase-like protein
L Ayadi, I benrebeh, H Ayadi, S Masmoudi
Citation
L Ayadi, I benrebeh, H Ayadi, S Masmoudi. Molecular modelling: a tool for understanding the cytoplasm–nucleus–nucleolus shuttling of the ceramide kinase-like protein. The Internet Journal of Bioengineering. 2008 Volume 4 Number 2.
Abstract
Ceramide kinase like (CERKL) is a protein implicated in the pathology of retinitis pigmentosa. In this study, the 3D predicted model of CERKL was determined by molecular modelling techniques. Our results predicted the presence in the CERKL protein of an N-terminal Pleckstrin Homology (PH) domain. Presence of a nuclear localization signal within the PH domain may suggest the role played by this domain in the CERKL cytoplasm–nucleus shuttling. We also predicted a putative nucleolar localization signal (NoLS) sequence within the Diacylglycerol Kinase (DAGK) C-terminal domain. We suggest that this NoLS sequence located in an exposed helix retains the CERKL protein in the nucleolus. Absence of the predicted NoLS sequence in the R257X and R379X variants may explain their location in the nucleus and their absence from nucleoli.
Introduction
Ceramide kinase like (CERKL) is a protein implicated in a form of autosomal recessive retinitis pigmentosa (RP), RP26 [1]. CERKL is a homolog of Ceramide kinase (CERK) sharing 50% of similarity. CERK is an enzyme that phosphorylates ceramide to form ceramide 1-phosphate (C1P) [2]. CERK and C1P have been implicated in the regulation of apoptosis [3], neurotransmitter secretion [4], and inflammation [5]. Surprisingly, CERKL did not phosphorylate ceramide and no target substrate is known for this enzyme [2]. Furthermore, wild type CERKL has been localized in different cell compartments [6]. The variant of CERKL missing 205 a.a. at the N-terminus (1–205) was excluded from the nucleus and the nucleolus and was localized in the cytoplasm. Inversely, the variant of CERKL missing 154 a.a. at the C-terminus (R379X) was found in the nucleus but did not localize to nucleolus nor did a CERKL variant with a point mutation in the putative ATP binding site. The naturally occurring CERKL mutant (R257X), linked to the pathology of RP is also accumulated in the nucleus but was not associated with nucleoli.
In this study, we have derived a putative 3D model for the N-terminal and C-terminal of the CERKL protein based on molecular modelling techniques. Pleckstrin Homology (PH) domain and a putative nucleolar localization signal (NoLS) were predicted. Our results gave a rational explanation for the different puzzling localizations of CERKL in the cytoplasm-nucleus-nucleolus.
Material and methods
The wild type of the CERKL protein was obtained from GenBank at the National Center for Biotechnology Information (accession number AJ640141). When the entire CERKL protein sequence was analyzed for comparison, no significantly matching sequence/structure was found, therefore the N-terminal and C-terminal domains of CERKL protein were modelled separately. In an attempt to predict the 3D model of the two domains, we used secondary prediction and fold recognition methods (PSIPRED [7], MGENTHREADER [8], 3D-PSSM [9], FUGUE [10] and 123D+[11]). Fold recognition method results were retained if E-value < 0.001 for the MGENTHREADER, E-value below 0.1 for 3D-PSSM (80% certainty), Z-score >= 6.0 for the FUGUE (99% confidence), and Z-score > 5 for 123D+ (high significant).
The predicted model of the N-terminal CERKL (amino acids 1 to 164) was built using the program MODELLER [12] based both on a template and sequence structure alignment obtained from 3D-PSSM. Due to the lack of homology the 43 first amino acids were not considered.
The predicted model of the C-terminal CERKL (amino acids from 165 to 532) of the wild type was generated using the FUGUE alignment. A cycle of side chain fixing /energy minimization was repeated until no abnormality was detected in the models. The quality of the predicted models was examined using the PROCHECK program [13]. Structural analysis, visualization and figures generation was done using Swiss-PDB Viewer [14].
Results
N-terminal CERKL domain
Fold recognition methods were used in an attempt to identify structural homologs to N-terminal CERKL domain. Twenty hits were generated by 3D-PSSM, seven
The predicted model was built using the crystal structure of the PH domain from human dynamin (PDB entry: 1dyn). Analysis of the Ramachandran plot of the N-terminal PH domain showed that 79.8% (85.6 % for the 1dyn) of the residues lie in the most favorable regions, 11.9% in the additional allowed regions, 6.4% generously allowed regions and 1.8% in the disallowed region. Visualization and analysis of the model using Swiss-PDB Viewer showed that the predicted model correlated with the structural features of PH domains super fold: a seven stranded β-sheet with a flanking C-terminal α-helix (Figure 1B).
C-terminal CERKL domain
Fold-recognition methods MGENTHREADER and FUGUE identified the YegS (PDB entry 2bon [17]) as the best template for the sequence of the C-terminal domain with an E value of 7e-05 (MGENTHREADER) and Z score of 27.8 (FUGUE) respectively. YegS is a cytosolic protein in
Structural analysis of the predicted model showed that the C-terminal domain is further divided into two lobes (two sub domains) Figure 2B. The N-terminal sub domain is formed by residues (Arg165 to Val304) and (Leu518 to Lys532) and comprises five helices (2-6) and six strands (8-12 and 25). This sub domain resembles the Rossman fold nucleotide binding domain found in YegS, in NAD kinase [19] and PFK family [20] members of the lipid kinase family. Two insertions (Leu193-Leu196) and (Ala277-Ala290) were found within the helix 2 and in the loop between 4 and 5 respectively. These solvent exposed insertions were not modelled since their absence from the predicted model did not compromise the integrity of the Rossman fold.
The beta-sandwich fold of C–terminal sub domain (Asp331-Arg543) is formed by twelve beta strands (13-24) and two helices (7 and 8) Figure 2B. This sub domain was found in the YegS protein and in NAD kinases [19]. The beta-sandwich sub domain comprises six insertions: (Lys314-Leu316), (Ala332-Met340), (Phe369-Trp392), (Cys409-Cys411), (Tyr449-Lys453), and (Pro482-V499). All these insertions are located at the surface of the protein which did not compromise the folding of the sub domain.
The conserved GGDG motif present in the YegS as well as in NAD kinase and sphingosine kinase is also present in the CERKL protein (231-GGDGS-235) and superimposes well in the structure.
We identified the KHLK motif (residues from 444 to 447) in the wild type CERKL protein. Such motif, R/K(R/K)X(R/K) (where the R or K are occasionally replaced by H ) was identified as the NoLS in a number of nucleolar proteins [21], [22], [23], [24], and [25]. Furthermore in the CERKL protein, this motif is located in a solvent exposed helix (8) Figure 2B.
Figure 1
Figure 2
Discussion
CERKL gene encodes a ceramide kinase that is assumed to be involved in sphingolipid-mediated apoptosis in the retina. In absence of structure solved by x-ray crystallography, our results predicted that the N-terminal CERKL protein is a Pleckstrin Homology (PH) domain. Although the invariant tryptophan PH domain signature residue was identified (Trp151) by alignment with the N-terminal PH domain of the CERK protein [6], others groups failed to detect the N-terminal PH domain of CERKL protein [1] and [6]. PH domains share only limited sequence similarity making them relatively difficult to detect by the conventional methods.
Two putative nuclear localization signal sequences termed NLS1 (1-MPWRRRRNRVSA-12) and NLS2 (100-SVKLKRRCSVKQ-111) were identified in the N-terminal domain of CERKL protein by Inagaki group [2]. They also showed that a mutation in NLS1 did to not alter the subcellular localization of CERKL, while a mutation in NLS2 disrupted its nuclear import. NLS1 did not appear in our predicted model since the N-terminal 43 residues were not modeled due to the lack of homology. However, NLS2 is located within the PH domain (in the strand 5 and in the loop downstream this latter) Figure 1B.
Presence of a nuclear localization within the PH domain was recently reported in the PIKE PH domain [26]. The cytoplasmic–nuclear shuttling of PIKE was shown to be regulated by the balancing actions of the lipid-binding property of both the split PH domain and the nuclear targeting function of its nuclear localization sequence. We suggest that both NLS2 and PH domain in the CERKL protein are implicated in the regulation of the travelling trough nuclear membrane since they are present in the Wild type, in the C-terminal splice variant R379X and in the naturally occurring mutant R257X. Our suggestions were rationalised by the fact that the N-terminal splice variant (1-205) which lacks the regulatory PH domain (NLS2) was found in the cytoplasm excluded from the nucleus.
Structural analysis of the predicted model showed that the C-terminal domain is further divided into two sub domains: An N-terminal sub domain which resembles the Rossman fold nucleotide binding domain and a twelve stranded beta-sandwich fold which was found in the YegS protein and in NAD kinases. The conserved GGDG motif which has been implicated in binding to phosphate of ATP in 6-phosphofructokinase [27] and involved in nucleotide-binding in sphingosine kinase [28] is present in the CERKL protein. Mutation of the third glycine (G234D) in this motif was found to prevent the localisation of the mutant to nuclei in CERKL protein [6]. A functional ATP-binding site may be a requirement for nuclear import.
The R/K(R/K)X(R/K) motif identified as the NoLS in a number of nucleolar proteins is present in the wild type CERKL protein. This motif is located in a solvent exposed helix (8) allowing a direct interaction with other molecules which may direct or retain the CERKL protein in the nucleolus. This motif is absent from the R379X variant because the 154 missing amino acid residues are located in 17, 18, 19, 20, 21, 22, 23, 25 and more importantly in helix 8 containing the NoLS. Absence of nucleolar localization from the R379X or R257X CERKL mutants may explain their exclusion from the nucleolus and their accumulation in the nucleus.
In conclusion, molecular modelling was used to predict nuclear/nucleolar localizations in the CERKL protein implicated in the regulation of its subcellular localisation. Further studies should be carried out to substantiate our predictions.