Since the last common ancestor of these two species existed 450 million years ago, the authors reasoned that non-coding sequences conserved between them are likely to be fundamental to vertebrate development. Through sequence alignment with increasingly strict criteria, they identified 1,373 highly conserved non-coding elements (CNEs), with an average length of about 200 base pairs. The average sequence match is 84%: not perfect, but much higher than for coding regions shared by humans and pufferfish. A quick check showed that virtually all the sequences also occurred in rodents, chickens, and zebrafish, but not in the nematode, fruitfly, or even the sea squirt, a primitive non-vertebrate chordate.
CNEs are not spread uniformly throughout the genome. Instead, they are bunched together in fewer than 200 clusters, most of them in close proximity to genes implicated in transcriptional regulation or development. This clustering of CNEs suggests they may not only attract transcription factors, but may also influence the local topology of the DNA, thereby controlling access to their associated gene. Several clusters also appear in regions without any known genes—the identification of these clusters might lead to the discovery of new developmentally significant gene