1.8 M長的嗜血桿菌(Haemophilus influenzae)基因組帶有1471個攝取訊號序列(uptake signal sequence USS) 。完整的嗜血桿菌攝取訊號序列為一個29 base長的序列,當中有9 base長的序列(AAGTGCGGT)是完全保守的。嗜血桿菌中有66%的攝取訊號序列嵌入在1738個已知或猜測的基因中。攝取訊號序列出現在嗜血桿菌基因組中的頻率為一個9 base長序列平均出現在嗜血桿菌基因組頻率的200倍。完整的攝取訊號序列總共大約佔了嗜血桿菌基因組長度的2.4%。基因組裡要存在這麼多的攝取訊號序列必須在維持細胞正常運作及攝取功能的演化上達到一定程度上的利益平衡。在此我們就從擁有這些攝取訊號序列所需付出的代價作定性的研究。我們分析結果是,細菌以各種可能的方法儘量減低它們擁有眾多攝取訊號序列所付出的代價。第二部分的分析之中,我們想要知道攝取訊號序列的演化是否穩定。嗜血桿菌跟巴斯德桿菌的共同祖先出現在兩億七千萬年前。在比對攝取訊號序列之後,我們發現有大約百分之六十的攝取訊號序列是屬於同源的序列,所以我們推測攝取訊號序列的演化是穩定而且緩慢的。 The 1.8 Mbp genome of Haemophilus influenzae carries 1471 copies of its DNA uptake signal sequence (USS), a 29 bases long oligonucleotide including a 100% conserved 9-base core, AAGTGCGGT, 66% of the which are embedded in the 1738 known and putative genes of the genome. The frequency of occurrence of the 9-base (core) USS is about 200 times statistical expectation. Together with its flanks, the USS in H. influenzae take up 2.4% of the otherwise streamlined genome. The embedding of so many USS in the genome is a penalty to the organism that must be balance by benefit to the organism derived from maintaining and evolving the uptake system. As a first step to understanding this balance we want to characterize and determine this cost. We find that at every possible instance the genome tried to minimize the cost. In the second part, we want to characterize the stability of USS evolution. H. influenzae and P. multocida have a common ancestor 270 million years ago. We find that about 60% USSs are homologs and conjecture that the evolution of USS has been stable and slow.