To dump a chunk of trunk

The Mammal has deemed that Hox genes and good old-fashioned feel-good evo-devo are a good way to blink back to life*. Also, tardigrades. Tardigrades are awesome. Here is one viewed from above, from the Goldstein lab via Encyclopedia of Life:


Tardigrades or water bears are also a bit unusual. Their closest living relatives are velvet worms (Onychophora) and arthropods. Exactly who’s closest to whom in that trio of phyla collectively known as the Panarthropoda is not clear, and I don’t have the energy to wade into the debate – besides, it’s not really important for the purposes of this post. What Smith et al. (2016) concluded about these adorably indestructible little creatures holds irrespective of their precise phylogenetic position.

Anyway. I said tardigrades were unusual, and I don’t mean their uncanny ability to survive the apocalypse¬†and pick up random genes in the process (Boothby et al., 2015). (ETA: so apparently there may not be nearly as much foreign gene hoarding as the genome paper suggests – see Sujai Kumar’s comment below! Doesn’t change the fact that tardigrades are tough little buggers, though ūüôā ) The oddity we’re interested in today¬†lies in the fact that all known species are built to the exact same compact body plan.¬†Onychophorans and many arthropods are elongated animals with lots of segments, lots of legs, and often lots of variation in the number and type of such body parts. Tardigrades? A wee head, four chubby pairs of legs, and that’s it.

How does a tardigrade body relate to that of a velvet worm, or a centipede, or a spider? Based solely on anatomy, that’s a hell of a question to answer; even the homology¬†of body parts between different kinds of arthropods can be difficult to determine. I have so far remained stubbornly uneducated on the minutiae of (pan)arthropod segment homologies, although I do see papers purporting to match brain parts, appendages¬†and suchlike between different kinds of creepy-crawlies¬†on a fairly regular basis. Shame on me for not being able to care about the details, I guess – but¬†the frequency with which the subject comes up suggests that the debate is far from over.

Now, when I was first drawn¬†to the evo-devo field, one of the biggest attractions was the notion that the expression of genes as a body part forms¬†can tell us what that body part really is even when anatomical clues are less than clear. That, of course, is too good to be simply true, but sometimes the lure of genes and neat¬†homology stories is just too hard to resist. Smith et al.‘s investigation of¬†tardigrade Hox genes is definitely that kind of story.

Hox genes are generally a good place to look if you’re trying to decipher¬†body regions, since their more or less neat, orderly expression patterns are remarkably conserved between very distantly related animals (they are¬†probably as old as the Bilateria, to be precise). A polychaete worm, a vertebrate and an arthropod show the same general pattern – there is no active Hox gene at the very front of the embryo, then Hoxes 1, 2, 3 and so on appear in roughly that order,¬†all the way to the rear end. There are variations in the pattern – e.g. the expression of a gene¬†can have sharp boundaries or fade in and out gradually; different genes can overlap to different extents, the order isn’t always perfect, etc. – but staggered Hox gene expression domains, with the same genes starting up in the same general area along the main body axis, can be found all across the Bilateria.

Tardigrades are no exception, in a sense – but they are also quite exceptional. First, their complement of Hox genes is a bit of a mess. At long last, we have a tardigrade genome to hand, in which Smith et al. (2016) found good honest¬†Hox genes. What they didn’t find was a Hox cluster, an orderly series of Hox genes sitting like beads on a DNA string. Instead, the Hox genes in Hypsibius dujardini, the sequenced species,¬†are all over the genome, associating with all kinds of dubious fellows¬†who aren’t Hoxes.

What Smith et al.¬†also didn’t find was half of the Hox genes¬†they expected. A typical arthropod has ten or so Hox genes, a pretty standard ballpark for an animal that isn’t a vertebrate. H. dujardini has only seven, three of which are triplicates of Abdominal-B, a gene that normally exists in a single copy in arthropods. So basically, only five kinds of Hox gene – number two and most of the “middle” ones are missing. What’s more, two more tardigrades that aren’t closely related to H. dujardini also appear to have the same five Hox gene types (though¬†only one Abd-B each), so this massive loss is probably a common feature of Tardigrada. (No word on whether the scattering of the Hox ¬†cluster¬†is also shared by the¬†other two species.)

We know that the genes are scattered and decimated, but are their expression patterns similarly disrupted? You don’t actually need an intact Hox cluster for orderly Hox expression, and indeed, tardigrade Hox genes are activated in a perfectly neat and perfectly usual pattern that resembles what you see in their panarthropod cousins. Except for the bit where half the pattern is missing!

Here’s¬†part of Figure 4 from the paper, a schematic comparison of tardigrade Hox expression to that of other panarthropods – a generic arachnid, a millipede and a velvet worm. (otd is a “head” gene that¬†lives in the Hox-free anterior region;¬†lab is the arthropod equivalent of¬†Hox1, Dfd is Hox4, and I’m not sure which of Hox6-8 ftz is currently supposed to be.) The interesting thing about this is that according to Hox genes, the entire body of the tardigrade corresponds to just the front end of arthropods and velvet worms.


In addition, one thing that is not shown on this diagram is that Abdominal-B, which normally marks the butt¬†end of the animal, is still active in the tardigrade, predictably in the last segment (L4, that is). So if you take the Hox data at face value, a tardigrade is the arse end of an arthropod tacked straight onto its head. Weird. It’s like evolution took a perfectly ordinary velvet worm-like creature and chopped out most of its trunk.

The tardigrade data suggest¬†that the original panarthropod was probably more like arthropods and velvet worms than tardigrades – an elongated animal with many segments.¬†The strange tardigrade situation can’t be the ancestral one, since the Hox genes that tardigrades lack long predate the panarthropod ancestor. Now, it¬†might be¬†possible to lose half your Hox genes while keeping your¬†ancestral body plan, but an unusual body plan and an unusual set of Hox genes is a bit of a big coincidence, innit?

Smith et al. point out that the loss of the Hox genes was unlikely to be the cause of the loss of the trunk region¬†– Hox genes only specify what grows on a segment, they don’t have much say in how many segments develop in the first place. Instead, the authors reason, the loss of the trunk in the tardigrade ancestor probably made the relevant¬†Hox genes dispensable.

Damn, this story makes me want to see the Hox genes of all those oddball lobopodians from the Cambrian. Some of them are bound to be tardigrade relatives, right?



Boothby TC et al. (2015) Evidence for extensive horizontal gene transfer from the draft genome of a tardigrade. PNAS 112:15976-15981

Smith FW et al. (2016) The compact body plan of tardigrades evolved by the loss of a large body region. Current Biology 26:224-229


*The Mammal has been pretty depressed lately. As in mired up to her head in weird energy-sucking flu. Unfortunately, writing is one of those things that the damn brain monster has eaten most of the fun out of. Also, I have a shitty normal person job at the moment, and shitty job taking up time + barely enough motivation to crawl out of bed and pretend to be human¬†means I have, at best, one afternoon per week that I actually spend on catching up with science. That is just enough to scroll through my feeds and file away the interesting stuff, but woefully insufficient for the writing of posts, not to mention that my ability to concentrate is, to be terribly technical, absolutely fucked. It’s not an ideal state of affairs by any stretch, and I’m pretty sure that if I made more of an effort to read and write about cool things, it would pay off in the mental health department, but‚Ķ well. That sort of reasonable advice is hard to hear with the oozing fog-grey suckers of that thing clamped onto my brain.

In which a “living fossil’s” genome delights me

I promised myself I wouldn’t go on for thousands and thousands of words about the Lingula genome paper (I’ve got things to do, and there is a LOT of stuff in there), but I¬†had¬†to indulge myself a little bit. Four or five years ago when I was a final year undergrad trying to figure out things about Hox gene evolution, I would have killed for a complete brachiopod genome. Or even a complete brachiopod Hox cluster. A year or two ago, when I was trying to sweat out something resembling a PhD thesis, I would have killed for some information about the genetics of brachiopod shells that amounted to more than tables of amino acid abundances. Too late for my poor dissertations, but a brachiopod genome is finally sequenced! The paper is right here, completely free (Luo et al., 2015). Yay for labs who can afford open-access publishing!

In case you’re not familiar with Lingula, it’s this guy (image from Wikipedia):

In a classic case of looks being deceiving, it’s not a mollusc, although it does look a bit like one except for the weird white stalk sticking out of the back of its shell. Brachiopods, the phylum to which Lingula belongs, are one of those strange groups no one really knows where to place, although nowadays¬†we are pretty sure they are somewhere in the general vicinity of molluscs, annelid worms and their ilk.¬†Unlike bivalve molluscs, whose shell valves are on the left and right sides of the animal, the shells of brachiopods like Lingula have top and bottom¬†valves. Lingula‘s shell is also made of different materials: while bivalve¬†shells contain¬†calcium carbonate deposited into a mesh of chitin and silk-like proteins,* the subgroup of brachiopods Lingula belongs to uses calcium phosphate, the same mineral that dominates our bones, and a lot of collagen (again like bone). But we’ll come back to that in a moment…

One of the reasons the Lingula genome is particularly interesting is that Lingula is a classic “living fossil”. In the Paleobiology Database, there’s even an entry for a Cambrian fossil classified as Lingula, and there are plenty of entries from the next geological period. If the database is to be believed, the genus¬†Lingula has existed for something like 500 million years, which must be some kind of record for an animal.** Is its genome similarly conservative? Or did the DNA hiding under a deceptively conservative shell design evolve as quickly as anyone’s?

In a heroic feat of self-control, I’m not spending all night¬†poring over the paper, but I did give a couple of interesting sections a look. Naturally, the first thing I dug out was the Hox cluster hiding in the rather large supplement. This was the first clue that Lingula‘s genome¬†is definitely “living”¬†and not at all a fossil in any sense of the word. If it were, we’d expect one neat string of Hox genes, all in the order we’re used to from other animals. Instead, what we find is two missing genes, one plucked¬†from the middle of the cluster and tacked onto its “front” end, and two genes totally detached from the rest. It’s not too bad as Hox cluster disintegration goes – six out of nine genes are still neatly ordered – but it certainly doesn’t look like something left over from the dawn of animals.

The bigger clue that caught my eye, though, was this little family tree in Figure 2:


The red numbers on each branch indicate the number of gene families that expanded or first appeared in that lineage, and the green numbers are the families shrunk or lost. Note that our “living fossil” takes the lead in both. What I find funny is that it’s miles ahead of not only the animals generally considered “conservative” in terms of genome evolution, like the limpet Lottia and the lancelet Branchiostoma, but also the sea squirt (Ciona). Squirts are notorious for having incredibly fast-evolving genomes; then again, most of that notoriety was based on the crazily divergent sequences and often wildly scrambled order of its genes. A genome can be conservative in some ways and highly innovative in others. In fact, many of the genes involved in basic cellular functions are very slow-evolving in Lingula. (Note also: humans are pretty slow-evolving as far as gene content goes. This is not the first study to find that.)

So, Lingula, living fossil? Not so much.

The last bit I looked at was the section about shell genetics. Although it’s generally foolish to expect the shell-forming gene sets of two animals from different phyla to be similar (see my first footnote), if there are similarities, they could potentially go at least two different ways. First, brachiopods might be quite close to molluscs, which is the hypothesis Luo et al.‘s own treebuilding efforts support. Like molluscs, brachiopods also have a specialised mantle that secretes shell material, though having the same name doesn’t mean¬†the two “mantles” actually share a common origin. So who knows, some molluscan shell proteins, or shell regulatory genes, might show up in Lingula, too.

On the other hand, the composition of Lingula’s shell is more similar to our skeletons’. So, since they have to capture the same mineral, could the brachiopods share some of our skeletal proteins? The answer to both questions seems to be “mostly no”.

Molluscan shell matrix proteins, those that are actually built into the structure of the shell, are quite variable even within Mollusca. It’s probably not surprising, then, that most of the relevant genes¬†that are even present in Lingula are not specific to the mantle, and those that are are the kinds of genes that are generally involved in the handling of calcium or the building of the stuff around cells¬†in all kinds of contexts. Some of the regulatory mechanisms might be shared – Luo et al. report that BMP signalling seems to be going on around the edge of the mantle in baby Lingula, and this cellular signalling system is also involved in molluscan shell formation. Then again, a handful of similar signalling systems “are involved” in bloody everything in animal development, so how much we can deduce from this similarity is anyone’s guess.

As for “bone genes” – the ones that are most characteristically¬†tied to bone are missing (disappointingly or reassuringly, take your pick). The SCPP protein¬†family is so far known only from vertebrates, and its various members are involved in the mineralisation of bones and teeth. SCPPs originate from an ancient protein called SPARC, which seems to be generally present wherever collagen is (IIRC, it’s thought¬†to help collagen fibres arrange themselves correctly). Lingula has a gene for SPARC all right, but nothing remotely resembling an SCPP gene.

I mentioned that the shell of Lingula is built largely on collagen, but it turns out that it¬†isn’t “our” kind of collagen. “Collagen”¬†is just a protein¬†with a particular kind of repetitive sequence. Three amino acids (glycine-proline-something else, in case you’re interested) are¬†repeated¬†ad nauseam in the collagen chain,¬†and these repetitive regions let¬†the protein twist into characteristic rope-like fibres¬†that make collagen such a wonderfully tough basis for connective tissue. Aside from the repeats they all share, collagens are¬†a large and diverse bunch. The ones that form most of the organic matrix in bone¬†contain a non-repetitive and rather easily recognised domain at one end, but when Luo et al. analysed the genome and the proteins extracted from the Lingula shell, they found that none of the shell collagens possessed this domain. Instead, most of them had EGF domains, which are pretty widespread in all kinds of extracellular proteins. Based on the genome sequence, Lingula has a whole little cluster of these collagens-with-EGF-domains that probably originated from¬†brachiopod-specific gene duplications.

So, to recap: Lingula is not as conservative as its looks would suggest (never judge a living fossil by its cover, right?) We also finally¬†have actual sequences for lots of its shell proteins, which reveal that when it comes to building shells, Lingula does its own thing. Not much of a surprise, but still, knowing is a damn sight better than thinkin’ it’s probably so. We are scientists here, or what.

I am Very Pleased with this genome. (I just wish it was published five years ago ūüėõ )



*This, interestingly, doesn’t seem to be the general case for all molluscs. Jackson et al. (2010) compared the genes building the pearly layer of snail (abalone, to be precise) and bivalve (pearl oyster) shells, and found that the snail¬†showed no sign of the chitin-making enzymes and silk type proteins that were so abundant in its bivalved cousins. It appears that even within molluscs, different groups have found different ways to make often very similar shell structures. However, all molluscs shells regardless of the underlying genetics are predominantly composed of calcium carbonate.

**You often hear about sharks, or crocodiles, or coelacanths, existing “unchanged” for 100 or 200 or whatever million years, but in reality, 200-million-year-old crocodiles aren’t even classified in the same families, let alone the same genera, as any of the living species. Again, the living coelacanth is distinct enough from its relatives in the Cretaceous, when they were last seen, to warrant its own genus in the eyes of taxonomists. I’ve no time to check up on¬†sharks, but I’m willing to bet the situation is similar. Whether Lingula‘s jaw-dropping 500-million-year tenure on earth is a result of taxonomic lumping or the shells genuinely looking that similar, I don’t know.¬†Anyway, rant over.



Jackson DJ et al. (2010) Parallel evolution of nacre building gene sets in molluscs. Molecular Biology and Evolution 27:591-608

Luo Y-J et al. (2015) The Lingula genome provides insights into brachiopod evolution and the origin of phosphate biomineralization. Nature Communications 6:8301

Lamprey Hox clusters and genome duplications, oh my!

What the hell is up with lamprey Hox clusters?

Lampreys are among the few living jawless vertebrates, creatures that parted evolutionary ways with our ancestors somewhere on the order of 500 million years ago. If you want to know where things like jaws, paired fins or our badass adaptive immune systems came from, a vertebrate that doesn’t possess some of these things and may have diverged from the rest of the vertebrates soon after others originated is just what you need for comparison.

The vertebrate fossil record is pretty rich thanks to us having hard tissues, so a lot can be inferred about these things from the wealth of extinct fishes we have at our disposal. However, there are times when comparisons of living creatures are just as useful, if not more, than examinations of fossils. (Fossils, for example, tend not to have immune systems. ;))

One of the things you absolutely need a living animal to study is, of course, genome evolution. Vertebrates – well, at least jawed vertebrates – are now generally accepted to have the remnants of four genomes. Our long-gone ancestors underwent two rounds of whole genome duplication. Afterwards, most of the extra genes were lost, but evidence for the duplications can still be found in the structure of our genomes, where entire recognisable gene neighbourhoods of our close invertebrate relatives often still exist in up to four copies (Putnam et al., 2008).

Among these neighbourhoods are the four clusters of Hox genes most groups of jawed vertebrates possess. A “normal” animal like a snail or a centipede only has one of these. Since Hox genes are involved in the making of body plans, you have to wonder how suddenly having four sets of them and other developmental “master genes” might have influenced the evolution of vertebrate bodies.

Of course, to guess that, you need to know precisely when these duplications happened. That’s where lampreys come in: their lineage branched off from our definitely quadruple-genomed one after the next closest, definitely single-genomed group. But was it before, between, or after, the two rounds of duplication?

A few years ago, a phylogenetic analysis of 55 gene families by Kuraku et al. (2009) suggested that the lamprey-jawed vertebrate split happened after the 2R. Just this year, the genome of the sea lamprey Petromyzon marinus was finally published (Smith et al., 2013), and its authors agreed that yes, lampreys probably split off from us post-2R. (I don’t entirely get all the things they did to arrive at this conclusion. Groups of linked genes show up again, among other approaches.)

However, that isn’t the whole story, the latest lamprey genomics paper argues (Mehta et al., 2013). The P. marinus genome assembly couldn’t stitch all the Hox clusters properly together. There were two that sat on nice big scaffolds with the whole row of Hox genes and a few of their neighbours, and then there were a bunch of “loose” Hox genes that they couldn’t link to anything (diagram comparing humans and P. marinus below from Smith et al., 2013; the really pale blue boxes under the numbers represent Hox genes):


Given that Hox9 genes exist in four copies in this species, it seems like there may be four clusters. However, in hagfish, the other kind of living jawless vertebrate, a study found Hox genes that seemed to have as many as seven copies (Stadler et al., 2004). Another round of duplication? It wouldn’t be unheard of. Most teleosts, which include most of the things we call “fish” in everyday parlance, have seven Hox clusters courtesy of an extra genome duplication and loss of one cluster*. Salmon and kin have thirteen, after yet another duplication. Maybe hagfish also had another one – but did lampreys? How many more clusters do those lonely Hox genes belong to?

Mehta et al. hunted down the Hox clusters of Japanese lampreys (Lethenteron japonicum), hoping to pin down exactly how many there were. They used large chunks of DNA derived partly from the testicles, where sperm cells and their precursors keep the full genome throughout the animal’s life (lampreys throw away large chunks of the genome in most non-reproductive cells [Smith et al., 2009]). They probed these for Hox genes and sequenced the ones that tested positive. Plus they also got about two-thirds of the full genome together in fairly big pieces. Together, these data allowed them to get a better idea of the mess that is lamprey Hox cluster genomics.

They assembled four whole clusters, including their neighbouring genes, and a partial fifth cluster. A bunch of other genes sat on smaller sequence fragments containing only a couple of Hoxes, or a Hox and a non-Hox, but they were tentatively assigned to a total of eight clusters, eight being the number of different Hox4 genes in the data (no known vertebrate Hox cluster contains more than one Hox4 gene). The L. japonicum equivalents of the 31 publicly available Hox sequences from P. marinus spread out over six of these, which indicates that both species have at least six clusters. Seems like lampreys had another round of genome duplication after 2R? (Summary of L. japonicum Hox clusters from Mehta et al. below.)

But wait, that’s not the end of it.

First of all, although there are undoubtedly four complete Hox clusters in there L. japonicum, the relationships of these clusters to our four are terribly confused. Whether you look at the phylogenetic trees of individual genes, or the arrangement of non-Hox genes on either side of the cluster, only a big pile of what the fuck emerges. Phylogenies are problematic because the unusual composition of lamprey genes and proteins (Smith et al., 2013) could easily throw them off. All the complete lamprey clusters have a patchwork of neighbours that look like a mashup of more than one of our Hox clusters. Might it mean that lampreys’ proliferation of Hox clusters occurred independently of ours? Did we split before 2R after all?

Hox genes are not the only interesting things in a Hox cluster. In the long gaps between them, there are all sorts of little DNA switches that regulate their behaviour. Some of these are conserved across the jawed vertebrates. Mehta et al. aligned complete Hox clusters of humans, elephant sharks and lampreys to look for such sequences – called conserved non-coding elements or CNEs – in the lamprey.

They only found a few, but that’s enough for a bit more head-scratching. Most CNEs in, say, the human HoxA cluster are only found in one elephant shark cluster, and vice versa. Humans have a HoxA cluster, elephant sharks have a HoxA cluster, they’re clearly the same thing, pretty straightforward. Not so for lampreys. Homologues of individual CNEs in the complete lamprey clusters are spread out over all four human/elephant shark clusters. More evidence for independent duplications?

Mehta et al. are cautious – they point out that the silly mix of Hox cluster neighbours in lampreys could just be due to independent post-2R losses, which is plausible if the split between lamprey and jawed vertebrate lineages happened not too long after 2R. There’s also the fact that the weird lamprey sequences are phylogenetic minefields – however, that’s a double-edged sword, since the same caveat applies to analyses that support a post-2R divergence. Then, perhaps the same argument that goes for Hox cluster neighbours could also apply to CNEs. And, of course, this is just Hox clusters. Smith et al.‘s (2013) findings about overall genome structure don’t go away just because lamprey Hox clusters are weird.

So, in summary, thanks, lampreys. Fat lot of help you are! ūüėõ


*Actually, two losses of two separate clusters in two different teleost lineages. Because Hox evolution wasn’t already complicated enough.



Kuraku S et al. (2009) Timing of genome duplications relative to the origin of the vertebrates: did cyclostomes diverge before or after? Molecular Biology and Evolution 26:47-59

Mehta TK et al. (2013) Evidence for at least six Hox clusters in the Japanese lamprey (Lethenteron japonicum). PNAS 110:16044-16049

Putnam NH et al. (2008) The amphioxus genome and the evolution of the chordate karyotype. Nature 453:1064-1071

Smith JJ et al. (2009) Programmed loss of millions of base pairs from a vertebrate genome. PNAS 106:11212-11217

Smith JJ et al. (2013) Sequencing of the sea lamprey (Petromyzon marinus) genome provides insights into vertebrate evolution. Nature Genetics 45:415-421

Stadler PF et al. (2004) Evidence for independent Hox gene duplications in the hagfish lineage: a PCR-based gene inventory of Eptatretus stoutii. Molecular Phylogenetics and Evolution 32:686-694