the community site for and by developmental biologists

The long road to understanding homeobox genes in the nervous system

Posted by , on 1 October 2020

Following the initial discovery of the homeobox in the 1980s in invertebrates and then vertebrates, it became quickly clear that homeobox genes come in two flavors – that of the Antennapedia-like HOX cluster genes and that of the many more non-clustered genes with diverse sequence and expression features (Gehring, 1998). One theme that became evident through expression and mutant analysis in a variety of organisms was the selective expression and function of homeobox genes within the nervous system (Gehring, 1998).

When I started to look for postdoctoral positions in the early 1990s, I was particularly intrigued by mutant phenotypes of several fly and worm homeobox genes (Blochlinger et al., 1988; Doe et al., 1988; Finney and Ruvkun, 1990; Way and Chalfie, 1988), but also by the work of the late Tom Jessell, who proposed a LIM homeobox code in the vertebrate spinal cord (Tsuchida et al., 1994). The simplicity and well-characterized nature of the C. elegans nervous system, as well as its genetic amenability was very appealing to me and, in 1996, I decided to join Gary Ruvkun’s lab. Gary’s lab had not only characterized one of the first C. elegans homeobox genes, unc-86 (Finney and Ruvkun, 1990; Finney et al., 1988); Thomas Bügrlin in Gary’s lab had also used library screening with degenerate probes, a method that now, in the post-genome era, seems quite archaic, to discover the abundance of homeobox genes in this simple organism (Burglin et al., 1989).

In Gary’s lab, I set out to study the expression and function of the LIM homeobox subfamily, which were discovered initially by Marty Chalfie (Way and Chalfie, 1988) and implicated further in neuronal identity specification by Tom Jessell’s lab (Tsuchida et al., 1994). Using emerging GFP reporter technology (Chalfie et al., 1994) and mutant analysis, I determined what turned out to be mostly incomplete expression patterns (owing to the shortcomings of “classic” reporter genes which often just contained fractions of their surrounding gene regulatory regions) and mutant phenotypes that could only be very superficially analyzed (owing to a shortage of markers that allowed for a more in-depth analysis of mutant phenotypes)(Hobert et al., 1998; Hobert et al., 1997; Hobert et al., 1999).

After starting my own lab at Columbia University in 1999, a string of students and postdocs (Zeynep Altun, Adam Wenick, Ephraim Tsalik, Feifan Zhang, Pat Gordon, Vincent Bertrand, Maria Doitsidou, Nuria Flames, Rich Poole, Paschalis Kratsios, Marie Gendrel, Esther Serrano-Saiz, Laura Pereira, among others) continued to work on a small number of specific homeobox genes, digging much deeper into what these genes did in the nervous system. One theme that continued to emerge throughout this analysis was that not only the classic unc-86 and mec-3 genes, studied in impressive depth by Marty Chalfie over the years (Chalfie, 1995), but other homeobox genes as well had a remarkably broad effect on the differentiation of specific neuron types. Rather than regulating only some subset of specific identity features in a neuron, several homeobox genes fulfilled a “master regulatory” role in controlling most, if not all, known identity features of a neuron, through direct initiation and maintenance of terminal differentiation gene batteries. This led me to propose the concept of “terminal selectors” of neuronal identity, a term extended from the Drosophila field where “selector genes” were coined as genes that act earlier in development to specify the identity of developing fields and tissues (Hobert, 2016).

This trajectory finally led to the work of Molly Reilly, a graduate student in my lab, who recently set out to achieve the ambitious goal of describing the expression patterns of the entire homeobox gene family across the entire C. elegans nervous system (Reilly et al., 2020). This tremendous leap forward was, as so often is the case, enabled by novel technology. First, gene expression patterns, or even better, protein expression patterns, can now be much more reliably identified by not just extracting some arbitrary small regulatory region adjacent to your gene of interests to drive a reporter gene. Rather, bacterial recombineering technology enables the reporter tagging of genes in the context of very large genomic intervals containing many genes up- and downstream of the gene of interest (Tursun et al., 2009). Moreover, CRISPR/Cas9 technology even allowed for reporter tagging of an entire locus in the endogenous context (Dickinson et al., 2013). But even with good reagents at hand, identifying sites of expression of a reporter gene across the entire nervous system has traditionally not been a small feat because neurons in C. elegans are tightly packed and their position can be locally variable. Here is where Eviatar Yemini, a postdoc in my lab, came in to solve the long-standing problem of neuronal cell identification. Using multiple distinct fluorophores (excluding GFP), Eviatar built a multicolor landmark strain, NeuroPAL, which unlike Brainbow-style technology, assigned neurons a strictly deterministic color code (Yemini et al., 2019). Crossing NeuroPAL with a GFP reporter strain enables unambiguous identification for the sites of gene expression, anywhere in the nervous system (Figure 1).


Figure 1: Examples of homeobox reporter gene expression patterns. The NeuroPAL transgene (left panel) was crosses to these reporters to unambiguously identify sites of homeobox gene expressions. Images courtesy of Molly Reilly and Ev Yemini.


Molly exploited these technological advances to (a) tag all but one of the 102 homeobox genes of C. elegans with a fluorescent reporter and (b) identify their sites of expression throughout the entire nervous system. What she found was something I could barely have dreamed of when starting my postdoc in Gary’s lab: Most of the conserved homeobox genes are not only sparsely expressed throughout the nervous system of the worm, but each of the 118 different neuron classes displayed a unique combination of homeobox genes (Figure 2).


Figure 2: Homeobox codes. Shown are all the homeobox gene expression patterns that contribute to neuron class specific expression. Homeobox genes are on top, neuron classes on the left. Reproduced from Reilly et al., 2020.


Homeobox genes are thus a comprehensive “descriptor” of neuronal diversity throughout an entire nervous system – a homeobox code for all neurons! Furthermore, the mapping of these homeobox genes led another graduate student, Cyril Cros, to find that neurons previously not known to express or require a homeobox gene, do indeed also require a homeobox gene for their identity specification (Reilly et al., 2020).

This is not the end of the road. The lab remains motivated to test whether indeed every single C. elegans neurons not only expresses, but requires a homeobox gene for their identity specification. Moreover, it remains little explored to what extent we can reprogram the identity of neurons by respecifying their homeobox codes. I am looking forward to see whether work in other systems with more complex brains will also uncover the broad employment of homeobox codes. Recent transcriptome analysis in restricted parts of the flies and mice CNS indeed provides tantalizing hints for similar specificity and selectivity of homeobox gene expression in more complex nervous systems (Allen et al., 2020; Davis et al., 2020; Sugino et al., 2019).




Allen, A.M., Neville, M.C., Birtles, S., Croset, V., Treiber, C.D., Waddell, S., and Goodwin, S.F. (2020). A single-cell transcriptomic atlas of the adult Drosophila ventral nerve cord. eLife 9.

Blochlinger, K., Bodmer, R., Jack, J., Jan, L.Y., and Jan, Y.N. (1988). Primary structure and expression of a product from cut, a locus involved in specifying sensory organ identity in Drosophila. Nature 333, 629-635.

Burglin, T.R., Finney, M., Coulson, A., and Ruvkun, G. (1989). Caenorhabditis elegans has scores of homoeobox-containing genes. Nature 341, 239-243.

Chalfie, M. (1995). The differentiation and function of the touch receptor neurons of Caenorhabditis elegans. Prog Brain Res 105, 179-182.

Chalfie, M., Tu, Y., Euskirchen, G., Ward, W.W., and Prasher, D.C. (1994). Green fluorescent protein as a marker for gene expression. Science 263, 802-805.

Davis, F.P., Nern, A., Picard, S., Reiser, M.B., Rubin, G.M., Eddy, S.R., and Henry, G.L. (2020). A genetic, genomic, and computational resource for exploring neural circuit function. eLife 9.

Dickinson, D.J., Ward, J.D., Reiner, D.J., and Goldstein, B. (2013). Engineering the Caenorhabditis elegans genome using Cas9-triggered homologous recombination. Nat Methods 10, 1028-1034.

Doe, C.Q., Hiromi, Y., Gehring, W.J., and Goodman, C.S. (1988). Expression and function of the segmentation gene fushi tarazu during Drosophila neurogenesis. Science 239, 170-175.

Finney, M., and Ruvkun, G. (1990). The unc-86 gene product couples cell lineage and cell identity in C. elegans. Cell 63, 895-905.

Finney, M., Ruvkun, G., and Horvitz, H.R. (1988). The C. elegans cell lineage and differentiation gene unc-86 encodes a protein with a homeodomain and extended similarity to transcription factors. Cell 55, 757-769.

Gehring, W.J. (1998). Master Control Genes in Development and Evolution: The Homeobox Story (Yale University Press;).

Hobert, O. (2016). Terminal Selectors of Neuronal Identity. Curr Top Dev Biol 116, 455-475.

Hobert, O., D’Alberti, T., Liu, Y., and Ruvkun, G. (1998). Control of neural development and function in a thermoregulatory network by the LIM homeobox gene lin-11. J Neurosci 18, 2084-2096.

Hobert, O., Mori, I., Yamashita, Y., Honda, H., Ohshima, Y., Liu, Y., and Ruvkun, G. (1997). Regulation of interneuron function in the C. elegans thermoregulatory pathway by the ttx-3 LIM homeobox gene. Neuron 19, 345-357.

Hobert, O., Tessmar, K., and Ruvkun, G. (1999). The Caenorhabditis elegans lim-6 LIM homeobox gene regulates neurite outgrowth and function of particular GABAergic neurons. Development 126, 1547-1562.

Reilly, M.B., Cros, C., Varol, E., Yemini, E., and Hobert, O. (2020). Unique homeobox codes delineate all the neuron classes of C. elegans. Nature 584, 595-601.

Sugino, K., Clark, E., Schulmann, A., Shima, Y., Wang, L., Hunt, D.L., Hooks, B.M., Trankner, D., Chandrashekar, J., Picard, S., et al. (2019). Mapping the transcriptional diversity of genetically and anatomically defined cell populations in the mouse brain. eLife 8.

Tsuchida, T., Ensini, M., Morton, S.B., Baldassare, M., Edlund, T., Jessell, T.M., and Pfaff, S.L. (1994). Topographic organization of embryonic motor neurons defined by expression of LIM homeobox genes. Cell 79, 957-970.

Tursun, B., Cochella, L., Carrera, I., and Hobert, O. (2009). A toolkit and robust pipeline for the generation of fosmid-based reporter genes in C. elegans. PLoS ONE 4, e4625.

Way, J.C., and Chalfie, M. (1988). mec-3, a homeobox-containing gene that specifies differentiation of the touch receptor neurons in C. elegans. Cell 54, 5-16.

Yemini, E., Lin, A., Nejatbakhsh, A., Varol, E., Sun, R., Mena, G.E., Samuel, A.D.T., Paninski, L., Venkatachalam, V., and Hobert, O. (2019). NeuroPAL: A Neuronal Polychromatic Atlas of Landmarks for Whole-Brain Imaging in C. elegans. bioRxiv.


Thumbs down (2 votes)

Tags: , , ,
Categories: Research

Leave a Reply

Your email address will not be published. Required fields are marked *

Get involved

Create an account or log in to post your story on the Node.

Sign up for emails

Subscribe to our mailing lists.

Contact us

Do you have a question or suggestion for the Node?