遗传多态性和选择

fly

PDF

  多态性在进化中扮演的角色引起了争论。但是如果多态性从根本上遵循中性进化,那么可以用其作为参照来研究自然选择。生态学家也将多态性用于保护生物学中重建物种历史。

1. 突变,随机漂变和中性进化

  多态性包含突变,这些突变在细胞分裂时逃脱了DNA修复系统。因此突变率是一个生物学变量。在人类和黑猩猩中,每一世代产生的核苷酸核酸的基本元素如DNA或RNA。它包含一个核酸基团(或含氮碱基),和一个五碳原子组成的糖,成为戊糖,它们的结合形成一个核苷,最后是一到三个磷酸基团。突变率约为μ≈10-8。雄性哺乳动物产生大量的精子,这意味着雄性生殖系比雌性生殖系的细胞分裂从干细胞到配子的所有细胞更多:30岁时是380:23(即16倍以上),并且随着雄性的年龄增长会更多(50岁时是840:23,即36倍以上)。这意味着,在这些物种中突变主要涉及到雄性,且取决于父亲的年龄。每次生育都会在每个基因组生物体的遗传物质。它包含编码蛋白质的遗传信息。大多数生物体基因组与DNA相对应。然而一些称为逆转录病毒(例如HIV)的病毒中,其遗传物质是RNA。中产生约100个新的突变,但因为基因组中只有很小一部分在编码描述基因翻译蛋白质的那部分DNA或RNA。只代表基因来源的一部分,以及注册的mRNA。,所以99%的突变不影响生存和繁殖能力。这些突变被称为中性突变。一个新的等位基因对于两个同源基因,当它们具有不同的形状,在给定的观察水平下可以分辨彼此则成为等位基因。因此一个等位基因对应于一条单一序列,或一组虽然是不同的,但在表型水平上是不可分辨的序列(例如蓝色/棕色/绿色的眼睛颜色,但在核苷酸水平上,有更多不同的等位基因,数种颜色)。可以是中性的有害的或者是有益的。中性突变得到最多研究,这是因为其允许编写预测模型来研究种群历史。中性突变的分布也可以作为一个无效假设指的是基本观点,对于给定现象的默认位置。一般来说,反对无效假说的假说有举证责任。,通过比较来解释有害或有益的突变。

  可以认为,在一个只包含中性等位基因的基因组中,等位基因频率的漂移将弥补另一个等位基因的波动,等位基因多样性H将长期保持稳定。但是,这种印象是错误的。多样性将会逐渐遭到侵蚀。这一现象与人类姓氏多样性的丧失非常相似。在与世隔绝的人群中,比如偏远村庄,姓氏多样性的丧失是一种缓慢但重要的现象。一个没有男孩的家庭无法将家族姓氏传承下去。虽然该姓氏可以由亲戚家庭传承下去,但随着这个家族人口减少,该姓氏消亡的可能性就会增大。这显然与男孩出生时携带Y染色体的任何生物学特性无关。概率就足以解释这一点。这一特性反映了一个事实,即亲代群体的子代遵循随机抽取替换在一个装有n个代币的盒子中进行连续的抽签,取第一个代币,读取其价值,将其放回盒子中,取第二个代币,读取其价值,将其放回盒子中,等等,直到第p个代币。这意味着重复(可以多次选择相同的对象)并按顺序(选择对象的顺序很重要)在n个代币中选择p个代币。在n中连续抽出代币的次数为:n×n×n×…×n=np。的原则。

  和Y染色体一样,亲代的一些基因并没有衍生,也没有在子代群体中发现。如果一个固定规模种群的后代基因是随机抽取的,则未抽取的基因的概率由参数为1的泊松分布给出,即q(0) = e-1 = 0.367。这些未抽取的基因(超过三分之一)没有遗传就消失了。它们的缺失被那些碰巧留下了更多后代的亲代基因所补偿。如果情况不是这样,祖先的谱系就会保持平行,但从不相遇。过去的祖先谱系的分型与现在相比,在多样性的丧失上没有区别,在所谓的血缘关系上也没有区别。

测量多样性的公式(1)具有一个有用属性:它取决于样本量估计。当子代“t+1”从亲代“t”中抽取n个基因作为样本,根据公式E(Hs) = Ht (1 – 1/n),子代表现出的变异丢失为1/n。因为子代是随机抽取替换的,即使子代群体规模大于亲代群体,这一特性也同样适用,但是种群越大,多样性受到的侵蚀就越少。所有现实中的种群规模都有限,这就足够了。按照惯例,遗传学家认为,这种变异丢失为1/Ne,其中Ne是指染色体的有效数目[1]。因此从第一代到第二代:

E(H2) = H1 (1 – 1/Ne) (3)

  有效大小几乎总是比染色体数量的实际大小低得多,具体原因将在下文讨论。例如,据估计,在过去的人类谱系中,染色体的有效数量约为10000条。如果没有突变,该公式表明种群在时间T(期望)后将变成单态:

E(T) = 2 Ne (4)

  这将导致两个结果:首先,一个物种的多态性在其持续时间范围内总是“最近的”。因为多态性依赖突变,尽管伴随着等位基因频率的漂移多样性遭到侵蚀,但突变能恢复多态性。第二,多态性水平是两种对立机制的折中,形成了中性突变漂移平衡

多态性随着时间的推移而消失可以反过来表示:我们往回追溯,会发现同一位点基因在染色体上的位置。在群体遗传学中,对于所有同源基因(同源类),如果两条染色体或两个基因在减数分裂时相互匹配并排除对方,则它们是同源的。的两个基因总是有一个最近的共同祖先。这就是约翰·金曼(John Kingman)所说的溯祖过程。不同位点上的祖先不一样,因为性别使祖先的数量倍增,因此基因的共同祖先也不一样。如果上一代拥有共同祖先的概率q = 1/Ne始终不变,那么祖先的分布遵循指数定律t = q . e-qt。这些祖先的期望年龄等于Ne。如果从那以后没有发生突变,两个基因在遗传学上是相似的。但只要在从祖先传承到这两个基因的过程中,其中一个谱系发生了突变,就足以让这两个基因都成为等位基因。由此推断,这两个基因之间核苷酸差异的数量为θ = Ne × 2µ,其中µ中性突变率。用公式θ = 2Neμ来定义的参数θ是群体遗传学的一个基本参数。

环境百科全书-遗传多态性和选择-一些种群结构模型
图1. 一些种群结构模型。椭圆表示种群,实线箭头表示种群间的迁移率,虚线箭头表示种群之间由于随机漂变造成的差异。在星形系统发育和奠基效应模型中,分化是随时间一直持续的。在其他模型中,种群内部的随机漂变和迁移导致的种群同质化之间存在着平衡。观察到的种群对之间的遗传变异和TSFs模式可以表明给定物种的历史是否与上述模型之一或多或少相似。

自然种群的中性进化在保护生物学中非常重要,因为这能帮助重建物种的历史。遗传学家早就知道,随机遗传漂变能帮助推断种群分化和物种结构空间模型(图1)。在20世纪下半叶,研究种群构建亚种群最常用的指标是FST,其公式为:

FST = 1-HS/HT(5)

  其中HS为各亚种群多样性的平均值,HT为总种群多样性[2]

环境百科全书-遗传多态性和选择-溯祖和人口结构变化
图2. 溯祖和人口结构变化。来自同一位点的6个基因样本(白色圆圈)在三个具有不同历史的群体中被用于谱系研究,图中的灰色方框代表了群体的规模:规模不变(左)、瓶颈后的近期扩张(中)和更早的扩张(右)。时间(以代为单位)可以追溯到位于图表顶部的过去。突变用黑色圆形(“内部”分支的突变,和样本中的几个基因是相同的)或黑色星形( “外部”分支的突变,在样本中是独一无二的,因为它们导致样本中单个基因的出现)表示。我们看到,样本中共同祖先的年龄分布因不同条件而有较大差异。系谱的外部(末端)分支与内部分支的长度比是不同的。对于一个恒定的突变率,随着时间的推移,外部和内部突变的相对比例将会不同。样本中罕见/频繁突变的比例是用来重建种群历史的指标之一。其他多态性指标(包括文中看到的H和π)也具有这个属性(temps passé 过去的时间)

  在21世纪数字基因组分析的时代,金曼(Kingman)、哈德逊(Hudson)和田岛(Tajima)在1982-83年独立提出了溯祖理论[3],除了用于研究种群结构,还能确定种群是否保持了稳定或发生了人口统计学上的变化(图2)。

2. 中性模型和生物多样性管理

图1和图2说明了遗传变异全貌如何受到种群历史的影响:空间结构、殖民、迁移、种群变化等,这些事件都赋予了物种分子多态性特定的标识,生态学家从事生态学研究。生态学家的工作是研究生物体与周围世界的关系。不应与生态保护论者混淆,他们发起保护生态的运动。也可以凭此追溯种群历史。在第四纪(即当前的地质时期),全球气候发生了周期性变化,这导致了海岸线、生物群丛和冰川的南北转移,以及各纬度的干湿气候时期的周期性变化。在采取任何自然种群管理措施之前,种群生物学家系统地记录了种群的移动、减少、增加和入侵,以及物种对环境变化的响应指数。目前,群体遗传学主要应用于保护生物学领域。

3. 有害突变

由于基因编码蛋白质,编码区域的大多数突变都会修改蛋白质序列(约占突变的3/4,这个比例根据序列的组成而变化)。在人类谱系中,这些变化中大约40%是有害突变,也就是说,当评估从黑猩猩谱系中分离出来的基因组进化时,这些变化就消失了。如果突变为中性,某一天它将有1/Ne的机会取代在这个位点的其他基因(在有效大小为Ne的种群中,其他基因的总比例是1-1/Ne,每个基因也都有1/Ne的机会取代其他所有基因)。但突变可能有害,会影响携带者的健康或繁殖能力。突变的频率可能通过随机漂移波动几代,然后通过选择而消失(在果蝇中平均为40代)。一个物种的所有成员都是有害突变的携带者。所有成员都不例外。它们几乎都是杂合子描述一个生物体的同一基因在每条同源染色体上的同一位点具有两个不同的等位基因。,因为假如一个突变率为1/1000,那么它在纯合子中的代表数比在杂合子中的代表数就少1000倍。杂合子消除突变的不利影响较为轻微,而纯合子的不利影响往往大得多。由于几个位点上的有害突变的影响累加,突变负担成为一个数量变量,其加性效应可能无法检测,但在长期内仍然有效,可以永久清除基因组。这就解释了为什么蛋白质仍然有功能,且有害突变率仍然较低。这可能是解释维持遗传重组的众多因素之一。这就让组合有害突变以消除它们和在染色体相邻区域限制其消除产生的影响成为可能。

4. 有利突变

60%的突变影响蛋白质,却不产生有害影响。这些突变是什么?就像影响基因组其他区域的突变一样,它们可以是“中性的”,也就是说,在特定环境下,在物种的特定交流系统中,这些突变对健康或繁殖能力没有任何影响。其频率在自然种群中随机波动。但如果条件改变,这些突变可能有利。它们是达尔文(Darwin)设想中的自然选择性别选择的一部分,也是选择这个词的最初意思,即人类对家养物种的选择。所选的多态性分为两类:瞬时多态性和平衡多态性。

环境百科全书-遗传多态性和选择-选择性扫描
图3. 选择性扫描。重组使得基因组上相邻区域的进化有可能被分离。如果没有选择,中性多态性将通过一个非常缓慢的平衡过程沿着染色体达到一个可比值。当一个突变在某个区域具有优势且频率变为1时,这个过程会非常快,并且它会扫除该区域的中性多态性,但不会扫除相邻区域的中性多态性。通过对比扫描区域和中性区域的中性多态性水平,可以肯定确实是选择在前者中起了作用,并排除了认为“适应的就是我们所看到的”的循环推理。本例显示了模拟果蝇X染色体上两个相邻区域的选择性扫描。它们使识别基因的这两个复合体成为可能,这两个复合体同时作用于改变果蝇后代的孟德尔基因比例(参见 [6])(所谓的“自私”基因)。在这两个区域(SR1和SR2),选择消除了中性多态性。
(régions codantes 编码区;déficit de polymorphisme neutre 中性多态性缺乏;polymorphisme neuter(π) 中性多态性(π);position des genes sur le chromosome 基因在染色体上的位置)
  瞬时多态性是指一个有利突变通过消除可选等位基因而逐渐自我“修复”,这个过程可能导致该基因频率为1。例如,蚊子的杀虫剂抗性基因、细菌的抗生素抗性基因和疟疾寄生虫的抗疟药物抗性基因。这些突变在自然条件下也许没有优势,但在施加药物的环境下,这些等位基因的频率就会增加。调节乳糖酶表达的三种等位基因也是如此,乳糖酶不仅能让人类像其他哺乳动物一样,在新生儿时期消化乳糖,在成人时期也能消化乳糖。这些突变在牲畜群体中变得有益,而我们的狩猎-采集祖先只有在成年后才有机会消化果糖(蔗糖)。在所有这些瞬时多态性的案例中,与选择相关的位点遭到基因组中的一个分子标签的“背叛”:其频率迅速扩大,这让染色体上相邻的中性突变消失。这是一个选择性扫描的例子,我们可以确定等位基因的固定不是基于随机漂变,而是基于选择(图3,[4])。

环境百科全书-遗传多态性和选择-平衡多态性
图4. 平衡多态性。当自然选择维持两个等位基因的共存时,它们的序列分歧越来越大,以至于这两个基因之间积累的突变比基因组其余部分的中性模型预测的要多。图中的例子是果蝇tan基因上的两个等位基因(坐标0),这两个等位基因维持了浅色腹部和黑色腹部雌性果蝇大约300万年的共存。这些颜色模式涉及雄性和雌性交配过程之间的交流,但它们互相不能消除彼此,可能是因为它们的选择优势在频率过大时减弱了。黑白横线:染色体间差异的期望值:蓝色:实际观测值(参见 [7])(Divergence 分化程度;femelle noire 黑色雌性;femelle claire 浅色雌性)

  平衡多态性是指两个等位基因共存的情况,因为每一个基因在特定条件下都有优势,但在所有的时间或空间情境下,这两个基因又不能胜过对方。例如基因型的选择优势由于其反向频率而增加。这被称为频率依赖选择。这种平衡多态性的情况在性别选择的案例中经常出现(图4,[5])。

5. 多态性是有用的吗?

  20世纪30年代至60年代,自然种群遗传学家在自然界中发现了越来越多的多态性。他们想要评估多态性的程度和发现其在进化方面的潜在用途。那些认为遗传多样性对其本身有利,而自然选择将其维持在较高水平的研究人员,和那些认为选择导致在一个有限规模种群[6]中形成一个表型一个相当同质的野生个体所有可观察到的特征,其余的变异相当有害。他们中没有一个是对的。法国人 gustave malécot 在1950年代就已经证明,中性多态性是孟德尔定律的结果【Lois关于生物遗传的原则,由捷克僧侣和植物学家孟德尔(1822-1884)提出】的研究人员观点对立。最终在1966年,研究人员发现了极高水平的分子多态性,这无法单独用自然选择解释[7]。于是,日本的木村(Kimura)和太田(Ohta)提出了中性理论[8]。人们意识到,达尔文自然选择理论的替代品不是物种的固定性(这也是达尔文的反对者所持观点),而是中性模型预测的持续基因变化,这类似于物理学扩散现象中的随机漫步。这一观点最终在20世纪80年代得到接受。然而,与繁殖种群的大小相比,所有物种的有效种群大小都非常低,这表明外界力量侵蚀遗传多样性的程度远远超过中性模型的预测。尽管研究人员的估计还不够准确,这种侵蚀在一定程度上是由于自然选择,自然选择消除了有害突变,固定了有利突变,从而增加了漂移对中性变异的影响。虽然研究对物种的未来极其重要,但所选多态性肯定只占了多态性案例的一小部分。

  中性分子多态性为研究群体选择和种群历史提供了基础理论和参考模型。矛盾之处在于,从今以后,研究人员需要应用中性理论,以便在基因组中寻找自然选择的分子标签。

维持重组系统的选择性力量、孟德尔定律(Mendel’s laws)[9],以及性行为的基因混合都是考虑多态性的参数,它们以这种方式维持多态性而多态性在自然种群中具有短期优势。

 


参考资料及说明

[1] 直到2000年,有效大小仍表示的是个体有效大小,而不是染色体有效大小。因此,如果繁殖时雌雄个体数量相同,常染色体的有效大小为2Ne,X染色体的有效大小为1.5Ne,Y染色体和线粒体的有效大小为0.5Ne。这些公式可在手册中找到。

[2] 这个公式在这里非常普遍,根据其所使用的遗传模型有几种形式和名称:Wright的FST(两个等位基因),Nei的GSTFST的推广,上面公式5的变体),ΦSTρST,等等。它可以被类似性质的统计数据所替代:DXY, AMOVA。这种冗余首先表明了“F-统计学”在生态学中的成功。由于估计值依赖于样本量,无偏估计值的使用也必须考虑到观测设计的特殊性。Cf: Weir B.S. & Cockerham C.C. (1984) Estimating F-statistics for the analysis of population structure. Evolution 38:1358-1370

[3] Kingman J.F.C. (1982) On the genealogy of large populations. Journal of Applied Probability 19A:27-43; Hudson R.R. (1983) Properties of a neutral allele model with intragenic recombination. Theoretical Population Biology 23:183-201; Tajima F. (1983) Evolutionary relationship of DNA sequences in finite populations. Genetics 105:437-460.

[4] Derome N., K. Métayer, C. Montchamp-Moreau & M. Veuille (2004) Signature of selective sweep associated with the evolution of sex-ratio drive in Drosophila simulans. Genetics 166: 1357-1366; Derome N., E. Baudry, D. Ogereau, M. Veuille & C. Montchamp-Moreau (2008) Selective sweeps reveal a two-locus model for sex-ratio meiotic drive in Drosophila simulans. Molecular Biology and Evolution, 25:409-416.

[5] Yassin A., Bastide H., Chung H., Veuille M., David J.R. & Pool J.E. (2016) Ancient Balancing Selection at tan Underlies Female Colour Dimorphism in Drosophila erecta. Nature Communications DOI: 10.1038/ncomms10400.

[6] Malecot G. (1948) The mathematics of heredity. Masson et Cie; Nagylaki T. (1989) Gustave Malécot and the transition from classical to modern population genetics. Genetics 122, 253-268.

[7] Lewontin R.C. & Hubby J.L. (1966) Molecular Approach to the Study of Genic Heterozygosity in Natural Populations. II. Amount of Variation and Degree of Heterozygosity in Natural Populations of Drosophila pseudoobscura, Genetics 54: 595-609; Lewontin R.C. (1974) The Genetic Basis of Evolutionary Change. Columbia Univ. Press, New York.

[8] Kimura M. (1969) The Rate of Molecular Evolution Considered from the Standpoint of Population Genetics. Proceedings of the National Academy of Sciences, 63:1181-1188.

[9] http://uel.unisciel.fr/biologie/analgen/analgen_ch01/co/learn_ch1_01_01_01.html


环境百科全书由环境和能源百科全书协会出版 (www.a3e.fr),该协会与格勒诺布尔阿尔卑斯大学和格勒诺布尔INP有合同关系,并由法国科学院赞助。

引用这篇文章: VEUILLE Michel (2024年1月28日), 遗传多态性和选择, 环境百科全书,咨询于 2024年9月15日 [在线ISSN 2555-0950]网址: https://www.encyclopedie-environnement.org/zh/vivant-zh/genetic-polymorphism-and-selection/.

环境百科全书中的文章是根据知识共享BY-NC-SA许可条款提供的,该许可授权复制的条件是:引用来源,不作商业使用,共享相同的初始条件,并且在每次重复使用或分发时复制知识共享BY-NC-SA许可声明。

Genetic polymorphism and selection

fly

Polymorphism has caused controversy about its role in evolution. But if it essentially follows a neutral evolution, it serves as a reference, in contrast, for the study of natural selection. It is also used by ecologists in conservation biology to reconstruct the past history of species.

1. Mutations, random drift and neutral evolution

Polymorphism consists of mutations that escape DNA repair systems over cell divisions. Their rate of appearance is therefore a biological variable. In humans and chimpanzees, it is µ ≈ 10-8 mutations by nucleotideBasic element of a nucleic acid such as DNA or RNA. It is composed of a nucleic base (or nitrogenous base), a ose with five carbon atoms, called pentose, whose association forms a nucleoside, and finally one to three phosphate groups. and by generation. The considerable amount of sperm produced by male mammals means that there is much more cell division in male germ lineAll cells from stem cells to gametes than in female germ line: 380 against 23 at age 30 (i.e. 16 times more), and even more so when men age (840 against 23 at age 50, i.e. 36 times more). This means that in these species the mutations are mainly produced in male lines and depend on the age of the father. Each birth produces about 100 new mutations per genomeGenetic material of a living organism. It contains genetic information encoding proteins. In most organisms, the genome corresponds to DNA. However, in some viruses called retroviruses (e.g. HIV), the genetic material is RNA., but because only a small part of the genome is codingDescribes the part of the DNA or RNA of a gene translated into protein. Represents only a part of the gene from which it originates, as well as the mRNA in which it is registered., 99% of them have no effect on survival or fertility. They are called neutral. A new allelTwo homologous genes are called alleles when they have different shapes, distinguishable at a given level of observation. An allele can therefore correspond to a single sequence, or to a set of sequences that are different but not distinguishable at the phenotype level. (e.g. blue/brown/green eye colour but at the nucleotide level there are many more different alleles, several per colour). can be neutral, harmful or advantageous. Neutral mutations are the most studied, as they allow predictive models to be written to explore population history. Their distribution also serves as a null hypothesisRefers to the basic point of view, to the default position regarding a given phenomenon. In general, hypotheses opposing the null hypothesis have the burden of proof. to interpret, by comparison, that of deleterious or beneficial mutations.

We could think that in a genome comprising only neutral alleles, the drift of allelic frequencies would compensate for one fluctuation on the other and that the allelic diversity H would remain stable in the long term. But this impression is false. Gradually, diversity is eroding. This phenomenon is very similar to the loss of diversity of family names, a slow but significant phenomenon in human isolates such as remote villages. When a family does not have a boy, it does not pass on its surname. The same surname can be transmitted by related families, but the smaller the population, the greater the probability of names being lost. This is obviously not due to any biological property of the Y chromosome, which accompanies male births. Chance is enough to explain it. This property reflects the fact that the constitution of a daughter generation from a parental population follows the principle of a draw with replacementDrawing successively with delivery of tokens in an urn containing n tokens, means taking a first token, reading its value, putting it in the urn, taking a second token, reading its value, putting it in the urn, etc. until the p-th token. This means choosing p objects among n with repetition (you can choose the same object several times) and in order (the order in which you choose the objects is important). The number of successive draws with tokens among n is: n × n × n × … × n = np..

Like the Y chromosomes, some genes of the parental generation are not derived, and are not found in the daughter population. If the genes of the progeny are randomly drawn in a population of constant size, the probability that genes are not drawn is given by a Poisson’s law of parameter 1 as q(0) = e-1 = 0.367. These undrawn genes (more than a third) disappear without offspring. Their absence is compensated by parental genes which, by chance, have left more descendants. If this were not the case, the ancestral lines would remain parallel without ever meeting. The grouping of ancestral lines when going back in time is no different from a loss of diversity when going down to the present, nor is it different from what is called consanguinity.

The measurement of the diversity of formula (1) has a useful property: it depends on the sample size on which it is estimated. When a daughter population “t+1” is sampled by drawing n genes from a parental generation “t” the daughter generation shows a loss of variation equal to 1/n, according to the formula E(Hs) = Ht (1 – 1/n). This is true even if the daughter population is larger than the mother population, since it is a draw with replacement, but the larger the population, the less diversity is eroded. It is sufficient that the population be finite in size, which is what all real populations are. By convention, geneticists refer to this loss of variation as 1/Ne, where Ne is referred to as the effective number of chromosomes [1]. Thus from a generation 1 to a generation 2:

E(H2) = H1 (1 – 1/Ne)   (3)

The effective size is almost always much smaller than the actual size of the chromosomes, for reasons that will be discussed later. For example, it is estimated that in the past of the human lineage, the effective number of chromosomes was in the order of 10,000. If there were no mutations, it is shown that the population would become monomorphic after a time T, of hope:

E(T) = 2 Ne    (4)

There are two consequences to this: first, the polymorphism of a species is always “recent” on the scale of the duration of a species, since it depends on mutations that have restored the polymorphism despite the erosion of diversity that accompanies the drift of allelic frequencies. Second, the level of polymorphism is a compromise between two opposing mechanisms, creating the neutral mutation-drift balance.

The disappearance of polymorphism over time can be expressed in the opposite direction: when we go back in time, there is always a last common ancestor between two genes of the same locusPosition of the gene on the chromosome. In population genetics, all homologous genes (homology class). Two chromosomes or two genes are homologous if they match and exclude each other at meiosis.. This is what John Kingman called the coalescence process. The ancestor is not the same for different locus, because sexuality multiplies the number of ancestors, therefore also the common ancestors of genes. If the probability of having an ancestor common to the previous generation q = 1/Ne, remains constant over time, the distribution of ancestors follows an exponential law t = q. e-qt. The age expectation of these ancestors is equal to Ne. Two genes will be genetically similar if no mutation has occurred since then. But it is enough that a mutation has occurred in one of the lines leading from the ancestor to each of the two genes for both genes to be alleles. It can be deduced that the number of nucleotide differences between these two genes is θ = Ne x 2µ, where µ is the neutral mutation rate. This θ value, defined as θ = 2Neµ, is a fundamental parameter of population genetics.

Encyclopedie environnement - polymorphisme - modele structuration population - polymorphism
Figure 1. Some models of population structure. The continuous arrows represent migration rates between populations, represented by ellipses. The interrupted arrows represent the differentiation between populations due to random drift. In star phylogeny and the founding effect, the divergence is continuous over time. In other models, there is a balance between random drift within populations and homogenization of populations by migration. Observed patterns of genetic variation and TSFs between population pairs indicate whether the history of a given species is more or less similar to one of these scenarios.

The neutral evolution of natural populations is very important in conservation biology, as it allows the history of species to be reconstructed. Geneticists have long known that random genetic drift allows them to infer models of population differentiation and species structure in space (Figure 1). During the second half of the 20th century, the most commonly used indicator to study the structuring of a population into sub-populations was the FST of the formula:

FST = 1-HS/HT     (5)

where HS is the average of the diversities of the sub-populations and HT is the diversity of the total population [2].

Encyclopedie environnement - polymorphisme - Coalescence et changements demographiques - coalescence and demographic changes - polymorphism
Figure 2. Coalescence and demographic changes. The genealogy of a sample of 6 genes (white circles) from the same locus is examined in three populations with different demographic histories where the framing represents the size of the population: constant size (left), recent expansion after a bottleneck (centre) and old expansion (right). Time (in generations) goes back to the past at the top of the diagram. Mutations are represented by black circles (mutations of the “internal” branches, common to several genes in the sample) or by black stars (mutations of the “external” branches, unique in the sample because they lead to a single gene in the sample). We see that the age distribution of the common ancestors in the sample is very different depending on the conditions. The length ratio between the external (terminal) and internal branches of genealogy is different. This will result, for a constant mutation rate over time, in a different relative proportion of external and internal mutations. This ratio of rare/frequent mutations in the sample is one of the indicators used to reconstruct the history of the population. Other sets of polymorphism indicators (including H and π, seen in the text) also have this property.

In the 21st century, the age of numerical genome analysis, the theory of coalescence [3], independently developed by Kingman, Hudson and Tajima in 1982-83, makes it possible, in addition to studying structuring, to determine whether populations have remained stable or have undergone demographic changes (Figure 2).

2. Neutral model and biodiversity management

Figures 1 and 2 illustrate how genetic variation profiles are affected by population history: spatial structuring, colonization, migration, population change are all events that impart a specific signature in the molecular polymorphism of species, and allow ecologistswork in ecology. The job of an ecologist is to study the relationships between organisms and the surrounding world. Should not be confused with the ecologist, who campaigns to protect ecology. to trace its history. During the Quaternary era – the current geological period – the world’s climates changed cyclically, resulting in periodic changes in the coastline, a north-south shift of biological associations and glaciers, and periods of wet or dry climate at all latitudes. The resulting movements, decreases, increases, invasions of populations, indices of species’ responses to changes in their environment, are systematically recorded by population biologists before any natural population management initiative is undertaken. Most of the applications of population genetics today are in conservation biology.

3. Harmful mutations

Because genes code for proteins, most mutations in coding regions modify the protein sequence (about 3/4 of the mutations, a proportion that varies according to the composition of the sequence). In the human lineage, about 40% of these changes are deleterious, i.e. they are missing when the evolution of the genome of this species is assessed since its separation from the chimpanzee lineage. If a mutation were neutral, it would have a 1/Ne chance of replacing the other genes present at this locus one day (in a population of effective size Ne, the other genes taken together are in a 1-1/Ne proportion, and each also has a 1/Ne chance of replacing all the others). But a mutation can be harmful and affect the health or fertility of the individuals who carry it. Its frequency may fluctuate for a few generations by random drift before disappearing by selection (forty generations on average in Drosophila). All members of a species are carriers of deleterious mutations. You and I are. They are almost always in the heterozygousstate. This characterizes an organism that has two different alleles of the same gene at the same locus for each of its homologous chromosomes., because if a mutation has a frequency, for example, of 1/1000, it will have a thousand times fewer representatives in the homozygotestate, characterized by an organism that has two identical alleles of this gene at the same locus for each of its homologous chromosomes. than in the heterozygote state. It is the slight disadvantage of heterozygotes that eliminates the mutation rather than the often much greater disadvantage of the homozygous. Since the effects of deleterious mutations on several locus are cumulative, the mutation burden becomes a quantitative variable like any other whose additive effects may be undetectable, but nevertheless effective over the long term to purge the genome permanently. This explains why proteins remain functional and harmful mutations remain of low frequency. They are probably one of the factors that explain the maintenance of genetic recombination. This makes it possible both to group harmful mutations together to eliminate them and to limit the consequences of their elimination on adjacent regions of the chromosomes.

4. Advantageous mutations

What are the 60% of mutations affecting proteins without deleterious effect? Like mutations affecting other regions of the genome, they can be “neutral”, i.e. without any effect on health or fertility in a particular environment and in a particular communication system of a species. Their frequency fluctuates randomly in natural populations. But if conditions change, they can be advantageous. They are then part of the natural selection and sexual selection imagined by Darwin, but also of the selection in the first sense of the word, i.e. the selection made by man on his domestic species. There are two types of polymorphism selected: transient polymorphism and balanced polymorphism.

Encyclopedie environnement - polymorphisme - Balayage selectif - polymorphism
Figure 3. Selective scanning. Recombination makes it possible to decouple the evolution of adjacent regions of the genome. If there was no selection, neutral polymorphism would reach comparable values along the chromosome through a very slow equilibrium process. When a mutation is advantageous in a region and sets at the frequency of 1, the process is very fast, and it sweeps away the neutral polymorphism of that region, but not that of adjacent regions. The contrast of the level of neutral polymorphism in the scanned regions and in the neutral regions makes it possible to affirm that it is indeed the selection that has acted in the former, and excludes the circular reasoning that would admit that “what is adapted is what we see”. This example shows two contiguous areas of selective scanning on the X chromosome of Drosophila simulans. They make it possible to identify two complexes of genes that act simultaneously to modify for their benefit the Mendelian proportions in the offspring of fruit flies (see ref [6]) (so-called “selfish” genes). In these two zones (SR1 and SR2), selection has eliminated neutral polymorphism.
Transient polymorphism is the case of an advantageous mutation that gradually “fixes” itself by eliminating alternative alleles, which can lead to a frequency of 1. This is the case, for example, of insecticide resistance genes in mosquitoes, antibiotic resistance in bacteria and antimalarial drug resistance in the malaria parasite: these mutations would probably not have had an advantage under natural conditions, but in the environmental context imposed by medicine, these alleles increase in frequency. This is also the case for the three alleles that regulate the expression of lactase, an enzyme that allows humans to digest milk sugar (lactose) not only in the newborn state, as in other mammals, but also in adults. These mutations have become beneficial in livestock populations, while our hunter-gatherer ancestors only had the opportunity to digest fruit sugar (sucrose) as adults. In all these cases of transient polymorphism, the locus to which the selection relates is “betrayed” by a signature in the genome: the rapid expansion of its frequency makes the adjacent neutral variation on the chromosome disappear. This is a case of selective scanning, which makes it possible to affirm that the fixation of an allele is not due to random drift, but to selection (Figure 3, [4]).

Encyclopedie environnement - polymorphisme - Polymorphisme equilibre - polymorphism
Figure 4. Balanced polymorphism. When natural selection maintains the coexistence of two alleles, their sequences diverge more and more, to the point of accumulating more mutations between them than the neutral model for the rest of the genome predicts. This is the case for two alleles of the Drosophila tan gene (coordinate 0), whose two alleles have maintained the coexistence of females with light or black abdomen for about three million years. These patterns of coloration are involved in communication between males and females during mating, but neither can eliminate the other, probably because their selective advantage decreases when they become too frequent. Interrupted line: expected value of the divergence between chromosomes; in blue: value actually observed (see ref [7]).
Balanced polymorphism refers to situations where two alleles coexist because each is favoured under certain conditions, but where neither can prevail over the other in all circumstances of time or space. An example is given by cases where the selective advantage of a genotype increases due to its inverse frequency. This is called frequency-dependent selection. Such situations of balanced polymorphism are frequent in cases of sexual selection (Figure 4, [5]).

5. Is polymorphism useful?

In the 1930s to 1960s, natural population geneticists discovered an increasing number of polymorphisms in nature. They wanted to assess its extent and discover its potential utility in terms of evolution. Debates opposed researchers who considered that genetic diversity conferred an advantage in itself and that selection maintained it at high levels, to researchers who considered that selection led to a phenotypeAll the observable characteristics of a fairly homogeneous wild individual, the remaining variations being rather harmful. None of them were right. The Frenchman Gustave Malécot had already demonstrated in the 1950s that neutral polymorphism was a consequence of Mendel’s {tooltip}laws{ind-text}Lois concerning the principles of biological heredity, set out by the Czech monk and botanist Gregor Mendel (1822-1884). in a finite size population [6]. It was finally the discovery in 1966 of extremely high levels of molecular polymorphism, which could not be explained by natural selection alone [7], that allowed the Japanese Kimura and Ohta to put forward the neutralist theory [8]. It was realized that the alternative to Darwin’s theory of natural selection was not the fixity of species (as thought by Darwin’s opponents, for example) but a continuous genetic change predicted by the neutral model, similar to the random walk of a diffusion phenomenon in physics. This vision was definitively accepted in the 1980s. However, the very low value of the effective population size measured in all species, compared to the reproductive population size, indicates that forces are eroding genetic diversity much more than neutralist models predict. This erosion is due in part, still poorly estimated, to natural selection, which eliminates harmful mutations and fixes advantageous variations, and thus increases the effects of drift on the neutral variation. Although extremely important for the future of the species, the selected polymorphisms certainly represent only a small fraction of the cases of polymorphism.

Neutral molecular polymorphism provides the basic theory, the reference model, from which the selection and history of populations are studied. The paradox is that, from now on, the molecular signatures of natural selection are sought in the genome using neutralist theory.

The existence of selective forces that maintain the recombination system, Mendel’s laws [9], and the genetic mixing of sexuality is an argument for considering that polymorphism, which they maintain in this way, has a short-term advantage in natural populations.

 


References and notes

[1] Until about 2000, the effective size was expressed in individuals and not in chromosomes, so the effective size of the chromosomes was 2Ne for autosomes, 1.5Ne for ,X chromosomes, and 0.5Ne for Y chromosomes and mitochondria, provided that the number of males and females at breeding is the same. These formulas can be found in manuals.

[2] This formula, here very general, takes several forms and denominations according to the genetic model used: Wright’s FST (for two alleles), Nei’s GST (its generalization, of which formula 5 above is a variant), ΦST,ρST, etc. It can be replaced by statistics with similar properties: DXY, AMOVA. This redundancy shows above all the success of “F-statitics” in ecology. Because of the dependence of the estimate on the sample size, the use of unbiased estimators must also take into account the particularities of the observation design. Cf: Weir B.S. & Cockerham C.C. (1984) Estimating F-statistics for the analysis of population structure. Evolution 38:1358-1370

[3] Kingman J.F.C. (1982) On the genealogy of large populations. Journal of Applied Probability 19A:27-43; Hudson R.R. (1983) Properties of a neutral allele model with intragenic recombination. Theoretical Population Biology 23:183-201; Tajima F. (1983) Evolutionary relationship of DNA sequences in finite populations. Genetics 105:437-460.

[4] Derome N., K. Métayer, C. Montchamp-Moreau & M. Veuille (2004) Signature of selective sweep associated with the evolution of sex-ratio drive in Drosophila simulans. Genetics 166: 1357-1366; Derome N., E. Baudry, D. Ogereau, M. Veuille & C. Montchamp-Moreau (2008) Selective sweeps reveal a two-locus model for sex-ratio meiotic drive in Drosophila simulans. Molecular Biology and Evolution, 25:409-416.

[5] Yassin A., Bastide H., Chung H., Veuille M., David J.R. & Pool J.E. (2016) Ancient Balancing Selection at tan Underlies Female Colour Dimorphism in Drosophila erecta. Nature Communications DOI: 10.1038/ncomms10400.

[6] Malecot G. (1948) The mathematics of heredity. Masson et Cie; Nagylaki T. (1989) Gustave Malécot and the transition from classical to modern population genetics. Genetics 122, 253-268.

[7] Lewontin R.C. & Hubby J.L. (1966) Molecular Approach to the Study of Genic Heterozygosity in Natural Populations. II. Amount of Variation and Degree of Heterozygosity in Natural Populations of Drosophila pseudoobscura, Genetics 54: 595-609; Lewontin R.C. (1974) The Genetic Basis of Evolutionary Change. Columbia Univ. Press, New York.

[8] Kimura M. (1969) The Rate of Molecular Evolution Considered from the Standpoint of Population Genetics. Proceedings of the National Academy of Sciences, 63:1181-1188.

[9] http://uel.unisciel.fr/biologie/analgen/analgen_ch01/co/learn_ch1_01_01_01.html


环境百科全书由环境和能源百科全书协会出版 (www.a3e.fr),该协会与格勒诺布尔阿尔卑斯大学和格勒诺布尔INP有合同关系,并由法国科学院赞助。

引用这篇文章: VEUILLE Michel (2019年3月23日), Genetic polymorphism and selection, 环境百科全书,咨询于 2024年9月15日 [在线ISSN 2555-0950]网址: https://www.encyclopedie-environnement.org/en/life/genetic-polymorphism-and-selection/.

环境百科全书中的文章是根据知识共享BY-NC-SA许可条款提供的,该许可授权复制的条件是:引用来源,不作商业使用,共享相同的初始条件,并且在每次重复使用或分发时复制知识共享BY-NC-SA许可声明。