Schallus T et al. (2008), Malectin: a novel carbohydrate-binding protein ...

XB-ART-38618

Mol Biol Cell 2008 Aug 01;198:3404-14. doi: 10.1091/mbc.e08-04-0354.

Show Gene links Show Anatomy links

Malectin: a novel carbohydrate-binding protein of the endoplasmic reticulum and a candidate player in the early steps of protein N-glycosylation.

Schallus T , Jaeckh C , Fehér K , Palma AS , Liu Y , Simpson JC , Mackeen M , Stier G , Gibson TJ , Feizi T , Pieler T , Muhle-Goll C .

???displayArticle.abstract???
N-Glycosylation starts in the endoplasmic reticulum (ER) where a 14-sugar glycan composed of three glucoses, nine mannoses, and two N-acetylglucosamines (Glc(3)Man(9)GlcNAc(2)) is transferred to nascent proteins. The glucoses are sequentially trimmed by ER-resident glucosidases. The Glc(3)Man(9)GlcNAc(2) moiety is the substrate for oligosaccharyltransferase; the Glc(1)Man(9)GlcNAc(2) and Man(9)GlcNAc(2) intermediates are signals for glycoprotein folding and quality control in the calnexin/calreticulin cycle. Here, we report a novel membrane-anchored ER protein that is highly conserved in animals and that recognizes the Glc(2)-N-glycan. Structure determination by nuclear magnetic resonance showed that its luminal part is a carbohydrate binding domain that recognizes glucose oligomers. Carbohydrate microarray analyses revealed a uniquely selective binding to a Glc(2)-N-glycan probe. The localization, structure, and binding specificity of this protein, which we have named malectin, open the way to studies of its role in the genesis, processing and secretion of N-glycosylated proteins.

???displayArticle.pubmedLink??? 18524852
???displayArticle.pmcLink??? PMC2488313
???displayArticle.link??? Mol Biol Cell
???displayArticle.grants??? [+]

Species referenced: Xenopus laevis
Genes referenced: acta4 canx man1a1 mlec pdia2 sult2a1 txk

???attribute.lit??? ???displayArticles.show???

	Figure 1. Broad expression of malectin in embryonic and adult X. laevis. (A) Analysis of embryos by whole mount in situ hybridization showing malectin expression in the anterior neuroectoderm (ne) and neural crest (nc) at stages 18 and 20 (A1, A2), and at later stages, e.g., 32, in the hatching gland (hg), retina (re), otic vesicle (ot), epibranchial placodes (eb), pronephros (pn), and the tail tip (tp); anterior (a); posterior (p). At stage 41 (A4), transcripts are detected in the liver (li), dorsal and ventral pancreas (dp and vp), branchial arches (ba), and the proctodeum (pd). (B) Analysis of embryonic expression of malectin by RT-PCR showing expression in the oocyte (stage VI) and continued expression throughout development, and the contrasting expression of protein disulfide isomerase, xPDIp, only from late tadpole stage 39 onward. Abbreviations: stage VI oocyte (VI); unfertilized egg (0) and fertilized egg (1). (C) The expression analysis in adult tissue by semiquantitative RT-PCR showing a broad distribution of malectin in comparison with xPDIp, which is detected in pancreas and stomach only. Additional abbreviations not defined in A: gall bladder (gb); heart (he); intestine (in); kidney (ki); lung (lu); muscle (mu); pancreas (pa); stomach (st).
	Figure 2. Sequence alignment of malectin proteins in animals. Malectin proteins are composed of an N-terminal signal peptide (SP, AA 1-26), a C-terminal transmembrane helix (TM; AA 255-274) and a highly conserved central part of 190 residues followed by an acidic, glutamate-rich region. The secondary structure elements derived from the experimental structure (see Figure 4A) are shown on top of the amino acid sequence; and the four aromatic residues (Y67, Y89, Y116, and F117) and D186 mediating the carbohydrate interaction are marked by red crosses (Xen, Xenopus laevis 100/100; Hum, Homo sapiens 89/95; Mou, Mus musculus 86/94; Hen, Gallus gallus 84/96, Fly, Drosophila melanogaster 41/58; Aed, Aedes aegyptii 44/62; Cae, Caenorhabditis elegans 36/58; Sch, Schistosoma japonicum 42/59; Nem, Nematostella vectensis 51/69). The bracketed numbers represent the percentage amino acid conservation in comparison with the X. laevis malectin protein (identities/similarities).
	Figure 3. ER localization of malectin. (A) Schematic drawing of the FLAG-tagged malectin constructs used for transient transfection experiments. N-FLAG-malectin represents the full-length protein including a FLAG-tag (F) between the predicted N-terminal signal peptide (red, SP; AA 1-26) and the conserved lectin-like domain (blue, LLD; AA 27-213) that is followed by a hydrophobic C-terminal domain (yellow, HD; AA 255-274). The deletion construct NFLAG- malectin lacks the N-terminal SP. (B) Immunofluorescence analysis of U-2 OS cells for FLAG-tagged malectin (green) and the ER marker calnexin (red) showing that malectin and calnexin colocalize in ER structures (arrows indicate presence of both markers in the nuclear envelope). (C) Cytoplasmic localization of N-FLAGmalectin after deletion of the predicted N-terminal signal peptide. Note that N-FLAG-malectin also diffuses to the nucleus (asterisks) and can be found in protein aggregates (arrow heads). Bar, 10 m. Malectin Binds Glc2-N-glycan Vol. 19,
	Figure 4. Structure of the main domain of malectin and of the malectin–nigerose complex. (A) Ensemble of the 10 lowest energy structures (out of 100 calculated) after water refinement. The four loops (L1–L4), which could only be assigned in the presence of a carbohydrate ligand, are highlighted in green: L1, G62-G68; L2, T86-N90; L3, E114-A118; and L4, Y185-N187. (B) Ribbon representation of malectin. Secondary structure elements are colored in red and blue for -helices (1–3) and -strands (1–12), respectively. (C) Ensemble of the 10 lowest energy structures of the malectin–nigerose complex. Only the ligand-binding pocket is shown. The four aromatic residues and D186 mediating the interaction are relatively well defined. Yellow, Y89; cyan, Y67; magenta, Y116; blue, F117; and brown, D186. Nigerose is presented in green (D). Detailed view of the malectin–nigerose interaction. Nigerose is sandwiched by Y67, Y89, Y116, and F117. The nonreducing and reducing residues of nigerose are labeled Glc-A and Glc-B, respectively. Oxygen atoms of the carbohydrate are highlighted as red spheres. The orange arrow points to the oxygen atom of the C-2 hydroxyl group of Glc-A, where the outermost glucose residue of Glc3-N-glycan would be attached. The magenta arrow highlights the oxygen atom of the C-1 hydroxyl group of Glc-B (-form), where the polymannose part of the Glc2-N-glycan would be continued. The oxygen atom of the equatorial C-2 hydroxyl group of Glc-B is marked by the yellow arrow. If at the Glc-B position there was a mannose residue, as in Glc1-N-glycan, the stacking interaction would be hindered as there would be an axial hydroxyl group pointing toward Y116 and F117. (E) Schematic drawing of the Glc3-Man9-N-glycan. Glucose is depicted as green circles, mannose as yellow squares, and GlcNAc as blue circles.
	Figure 5. One dimensional STD spectra for glucose and glucose disaccharides: glucose (A), cellobiose (B), maltose (C) nigerose (D), kojibiose (E), and isomaltose (F). Structures of the tested carbohydrates are depicted on the left of the corresponding spectra. and denote the stereoisomers of the anomeric center of the reducing-end glucose ring. The ring protons of the nonreducing residues are labeled 1â€“6, and those of the reducing residues 1 â€“6 . Assignment tables of the carbohydrate ligands are in Supplemental Table S4.
	Figure S1: Sequence alignment of animal malectin proteins vs. Arabidopsis thaliana and Oryza sativa RLK proteins. (A) Alignment of representative arabidopsis and rice RLK-malectin-like domains with the malectin core domain of X. laevis, humans, rat and mouse. Sequences are labelled with UniProtKB/TrEMBL accession numbers. (XENLA = Xenopus laevis, DROME = Drosophila melanogaster, SCHJA = Schistosoma japonicum, CAEEL = Caenorhabditis elegans, CRYPV = Cryptospridium parvum, ARATH = Arabidopsis thaliana, ORYSA = Oryza sativa). The aromatic residues and the aspartate that in X. laevis mediate interactions with the glucose residues (Fig.4D) are marked by red crosses, and are not conserved in plants. (B) Domain topologies of plant and animal proteins that contain the malectin core domain. The two topologies among plant RLKs are shown. Labels: SP - signal peptide; TM- transmembrane helix; LRR - leucine rich repeat: STKinase – serine/threonine receptor-like kinase.
	Figure S2: Malectin binding to glucose disaccharides studied by isothermal titration calorimetry. Kojibiose (A), nigerose (B) and maltose (C). The raw data are shown in the upper panel, and the integrated heat data, corrected for dilution, are shown in the lower panel. ITC measurements were carried out using a VP-ITC Mircocal clorimeter (Mircocal, Northhampton, MA, USA) in 20mM phosphate buffer (pH 6.8), 150mM KCl and 1mM TCEP. A typical titration consisted of injecting 10μl of the sugar into the malectin sample, at time intervals of 5min, to ensure that the titration peak returned to the baseline.
	Figure S3: NOEs between malectin and nigerose. (A) Part of a 13C-edited half-filtered-NOESY experiment (mixing time 150 ms) showing intermolecular NOEs between malectin and nigerose. (B) Structure of α-nigerose.
	Figure S4: Microarray analyses of the interactions of malectin with Glc1-, Glc2- and Glc3-high mannose N-glycans and gluco-oligosaccharide probes. The oligosaccharide probes were printed as duplicate spots and binding was assayed with malectin at 20, 5, 1 and 0.5 Î¼g/ml (panels A to D, respectively). Numerical scores are shown for the binding signals [means of duplicate values at 2 and 7 fmol/spot, (blue and red bars, respectively) with error bars]. At a malectin concentration of 20 Î¼g/ml, the binding signals for the Glc2-high mannose N-glycan probe, both at 2 and 7 fmol, were too high to be accurately quantified (asterisk in A) and were annotated as >> 50000 in Table 1. Other oligosaccharide probes tested included the glucose disaccharides kojibiose (GlcÎ±1-2Glc) nigerose (GlcÎ±1-3Glc), maltodextrins (GlcÎ±1-4Glc, dp 2-7); and oligosaccharides from dextran (isomalto) (GlcÎ±1-6Glc, dp 2-7); laminarin (GlcÎ²1-3Glc, dp 2-7); cellulose (GlcÎ²1-4Glc, dp 2-6); and pustulan (GlcÎ²1-6Glc, dp 2-7). Abbreviations G3N, G2N and G1N designate Glc3Man7(D1)GlcNAc, Glc2Man7(D1)GlcNAc and Glc1Man9GlcNAc2 N-glycan probes, respectively; dp, degree of polymerization of the gluco-oligomers.
	mlec (malectin) gene expression in Xenopus laevis embryo, assayed via in situ hybridization, NF stage 20, dorsal view, anterior left, dorsal up.
	mlec (malectin) gene expression in Xenopus laevis embryo, assayed via in situ hybridization, NF stage 32, lateral view, anterior left, dorsal up.
	mlec (malectin) gene expression in Xenopus laevis embryo, assayed via in situ hybridization, NF stage 41, lateral view, anterior left, dorsal up.

References [+] :

Afelik, Pancreatic protein disulfide isomerase (XPDIp) is an early marker for the exocrine lineage of the developing pancreas in Xenopus laevis embryos. 2004, Pubmed, Xenbase

Afelik, Pancreatic protein disulfide isomerase (XPDIp) is an early marker for the exocrine lineage of the developing pancreas in Xenopus laevis embryos. 2004, Pubmed , Xenbase
Alonso, Effect of bromoconduritol on glucosidase II from rat liver. A new kinetic model for the binding and hydrolysis of the substrate. 1993, Pubmed
Altschul, Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. 1997, Pubmed
Apweiler, On the frequency of protein glycosylation, as deduced from analysis of the SWISS-PROT database. 1999, Pubmed
Banerjee, The evolution of N-glycan-dependent endoplasmic reticulum quality control factors for glycoprotein folding and degradation. 2007, Pubmed
Barile, Large scale protein identification in intracellular aquaporin-2 vesicles from renal inner medullary collecting duct. 2005, Pubmed
Boraston, Carbohydrate-binding modules: fine-tuning polysaccharide recognition. 2004, Pubmed
Brada, Isolation of a homogeneous glucosidase II from pig kidney microsomes. 1984, Pubmed
Burda, The ALG10 locus of Saccharomyces cerevisiae encodes the alpha-1,2 glucosyltransferase of the endoplasmic reticulum: the terminal glucose of the lipid-linked oligosaccharide is required for efficient N-linked glycosylation. 1998, Pubmed
Cornilescu, Protein backbone angle restraints from searching a database for chemical shift and sequence homology. 1999, Pubmed
Cosson, Coatomer interaction with di-lysine endoplasmic reticulum retention motifs. 1994, Pubmed
Dejgaard, The ER glycoprotein quality control system. 2004, Pubmed
Delaglio, NMRPipe: a multidimensional spectral processing system based on UNIX pipes. 1995, Pubmed
Deprez, More than one glycan is needed for ER glucosidase II to allow entry of glycoproteins into the calnexin/calreticulin cycle. 2005, Pubmed
Dosztányi, IUPred: web server for the prediction of intrinsically unstructured regions of proteins based on estimated energy content. 2005, Pubmed
Finn, Pfam: clans, web tools and services. 2006, Pubmed
Grinna, Substrate specificities of rat liver microsomal glucosidases which process glycoproteins. 1980, Pubmed
Hebert, An MBoC favorite: Malectin: a novel carbohydrate-binding protein of the endoplasmic reticulum and a candidate player in the early steps of protein N-glycosylation. 2012, Pubmed
Helenius, Roles of N-linked glycans in the endoplasmic reticulum. 2004, Pubmed
Johnson, NMR View: A computer program for the visualization and analysis of NMR data. 1994, Pubmed
Kaushal, Purification to homogeneity and properties of glucosidase II from mung bean seedlings and suspension-cultured soybean cells. 1990, Pubmed
Letunic, SMART 5: domains in the context of genomes and networks. 2006, Pubmed
Linge, Automated assignment of ambiguous nuclear overhauser effects with ARIA. 2001, Pubmed
Mayer, Characterization of Ligand Binding by Saturation Transfer Difference NMR Spectroscopy. 1999, Pubmed
Meyer, NMR spectroscopy techniques for screening and identifying ligand binding to protein receptors. 2003, Pubmed
Mizushima, Structural basis of sugar-recognizing ubiquitin ligase. 2004, Pubmed
Niehrs, Mesodermal patterning by a gradient of the vertebrate homeobox gene goosecoid. 1994, Pubmed , Xenbase
Palma, Ligands for the beta-glucan receptor, Dectin-1, assigned using "designer" microarrays of oligosaccharide probes (neoglycolipids) generated from glucan polysaccharides. 2006, Pubmed
Petrescu, The solution NMR structure of glucosylated N-glycans involved in the early stages of glycoprotein biosynthesis and folding. 1997, Pubmed
Schrag, The Structure of calnexin, an ER chaperone involved in quality control of protein folding. 2001, Pubmed
Shiu, Expansion of the receptor-like kinase/Pelle gene family and receptor-like proteins in Arabidopsis. 2003, Pubmed
Sousa, Recognition of the oligosaccharide and protein moieties of glycoproteins by the UDP-Glc:glycoprotein glucosyltransferase. 1992, Pubmed
Totani, Substrate specificity analysis of endoplasmic reticulum glucosidase II using synthetic high mannose-type glycans. 2006, Pubmed
Wishart, 1H, 13C and 15N chemical shift referencing in biomolecular NMR. 1995, Pubmed
Woods, The high degree of internal flexibility observed for an oligomannose oligosaccharide does not alter the overall topology of the molecule. 1998, Pubmed