Skip to main content

Molecular evolution of gustatory receptors in the Anopheles gambiae complex

Abstract

Background

Mosquitoes in the Anopheles (An.) gambiae species complex are major vectors of Plasmodium falciparum malaria. One reason for this is the high anthropophily of the constituent species An. coluzzii, An. gambiae sensu stricto, and An. arabiensis. In contrast, their sister species An. quadriannulatus is highly zoophilic. Anopheles mosquitoes largely rely on chemical cues for host-seeking, which are primarily detected by four chemosensory gene families: olfactory receptors (Ors), ionotropic receptors (Irs), gustatory receptors (Grs), and odorant binding proteins (Obps). Genes from these families that have been implicated in host adaptation show evidence of positive selection in other insect species, including other mosquitoes. As such, we analyzed the molecular evolutionary patterns of the gustatory receptors within the Anopheles gambiae complex, with a particular interest in identifying Grs that show evidence of positive selection in highly anthropophilic species.

Results

We identified sixteen Grs that show evidence of potential positive selection using the McDonald-Kreitman test, including four putative sugar receptors and two Grs with unknown ligands that are relatively highly expressed in chemosensory organs of either An. coluzzii or An. quadriannulatus. In addition, we identified twelve Grs that show evidence of potential purifying selection using the McDonald-Kreitman test, and twelve Grs that may have experienced a selective sweep using the DH test, including three putative sugar receptors and the carbon dioxide receptor Gr24. We also identified both positive and purifying selection in the coastal species An. melas (West Africa) and An. merus (East Africa).

Conclusions

Our results, together with transcriptomic data, identify four Grs as possible candidates for involvement in the evolution of vertebrate host preference in the An. gambiae complex, as may have occurred in the An. farauti complex. They also point to sugar receptors as playing a role in recent adaptation of some of these species. As the vast majority of Grs have unknown functions and much is still unknown about the role of Grs in these species, a more complete interpretation of our data necessitates further characterization of these genes.

Peer Review reports

Background

Mosquitoes in the Anopheles (An.) gambiae complex are the major vectors of malaria in sub-Saharan Africa, and consequently are responsible for the deaths of hundreds of thousands of children every year. One reason for this high mortality is the high anthropophily of two complex members (An. coluzzii and An. gambiae s.s.). As such, elucidating the genetic basis of their human host preference is important for understanding a critical aspect of the biology of these species, but may also provide candidate genes for developing novel control methods. Because chemosensory perception plays a crucial role in host seeking and preference, these gene families have been studied extensively in mosquitoes, particularly through RNA-seq studies [1,2,3,4,5] and their ligand binding properties [6, 7].

In insects, the chemosensory genes comprise four major gene families: the gustatory receptors (Grs), ionotropic receptors (Irs), odorant binding proteins (Obps), and olfactory receptors (Ors), in addition to several others that play minor roles. The GRs, ORs and IRs form ligand-activated ion channels [7]. Unlike ORs, which form heterodimers of the obligate olfactory co-receptor (ORCO) and one specific OR [7], GRs can be multimeric and tend to be co-expressed with multiple other GRs [7], with two to four gustatory neurons per gustatory sensillum [8, 9] and two to five GRs per gustatory neuron [10]. GRs mediate perception of a wide variety of chemical cues, the majority of which are concentrated tastants, rather than airborne volatile chemicals [10].

However, three Grs, AgGrs22-24, together encode the carbon dioxide receptor in An. coluzzii [11]. These receptors are highly conserved among the major disease vectors. Recent work on Aedes CO2 receptors suggests that only two of these receptors are necessary to detect CO2, while the third (the AgGr22 homolog AeGr1) also detects other molecules [11]. Other GRs respond to sugars, salts, pheromones, and bitter compounds in complex ways, with different combinations of Grs eliciting specific responses to particular concentrations of chemicals, and sometimes inhibiting one another [8, 9, 12,13,14,15,16].

Studies in Drosophila have localized Gr expression to the taste organs, the brain, olfactory neurons, digestive tract, as well as non-chemosensory neurons, providing evidence of non-gustatory roles [15, 17,18,19,20,21,22,23]. Although many Grs have been deorphanized in Drosophila, the vast majority of anopheline Grs have unknown ligands. The functional characterization of Grs has been more difficult than for other receptors, due to the inability to heterologously express these genes. As such, while the carbon dioxide receptors and some sugar receptors have been identified in Anopheles, the biological function of most anopheline Grs remains unclear.

Grs are an ancient gene family, being already present in such basal animals as placozoans [24], and deriving from a superfamily that is present in basal eukaryotes [25]. The Grs are therefore the basal member of a larger insect chemosensory receptor superfamily that also includes the Ors, which are derived from within the Grs [26]. This superfamily is characterized by large numbers of lineage-specific expansions throughout Arthropoda [24, 27]. The Gr repertoire tends to be smallest in specialists and largest in generalists [26], and can vary substantially within orders and even genera. Although there are numerous pseudogenizations documented in Drosophila (reviewed in Robertson [26]), the Gr repertoires of both Anopheles and another hematophagous fly genus, Glossina, are much more stable [28, 29].

The evolution of the insect chemosensory gene superfamily tends to follow a birth-and-death model with numerous lineage-specific duplications, followed by pseudogenization and eventual gene loss [27]. While Ors are derived from Grs, the evolutionary dynamics of these two gene families are not identical: Grs have increased replacement divergence (i.e. fixed amino acid changes between species) relative to Ors, as well as lower neutrality indices. This difference could stem from either stronger positive selection or weaker purifying selection in Grs vs. Ors [30, 31]. Grs likely underwent gene duplications followed by subsequent differentiation following speciation events, and exhibit low sequence similarities to one another both within and between species [24].

Positive selection during host shifts associated with speciation has been detected in the Grs of several insect taxa, including multiple Drosophila spp. [30, 32, 33], the butterfly Heliconius melpomene [34], the pea aphid Acyrthosiphon pisum [35], and may have combined with gene family expansion to facilitate adaptive radiation throughout Lepidoptera [36]. For example, in the butterfly Vanessa cardui, frequent Gr duplications occurred in the transition from a specialist to a generalist lifestyle [37]. Gr plasticity in responding to environmental change and a role of mutations in Grs in modulating species-specific behaviors is also found in Drosophila spp. [38], the German cockroach Blattella germanica [39], and the butterfly Papilio xuthus [40].

Nonetheless, positive selection on the whole is thought to play a relatively minor role in the evolution of Grs, with strong purifying selection acting as the dominant evolutionary force in Drosophila and Lepidoptera [27, 41] while most diversification, particularly in the Grs, appears to be due to relaxed purifying selection, rather than positive selection [32]. In addition, selection on chemosensory gene expression may be responsible for altering sensitivity to certain odors, rather than changes in the protein structure resulting in changes in ligand binding affinity [42]. Numerous chemosensory genes, including some Grs, are differentially expressed between members of the An. gambiae complex with different host preferences [1,2,3], which could reflect differential sensitivity to odors. Intriguingly, work in Aedes aegypti has demonstrated that differential expression as well as nucleotide substitutions in a single Or can influence host preference between two closely related subspecies [43]. In addition, 22 genes, including six Grs, show evidence of involvement in vertebrate host preference in the An. farauti complex [44]. One of these Grs is a strong candidate for anthropophily [44].

The An. gambiae complex consists of nine cryptic species with varying host preferences and distributions, six of which are included in this study: the highly anthropophilic major vectors An. coluzzii (formerly M form) and An. gambiae sensu stricto (formerly S form) [45, 46]; An. arabiensis, which is also anthropophilic and a major vector, but which exhibits substantial host choice plasticity and prefers dryer habitats [47, 48]; the range-limited locally important vectors An. melas and An. merus, which are found in brackish habitats in West and East Africa, respectively, and are less anthropophilic than the others [48,49,50]; and finally the non-vector An. quadriannulatus, which feeds primarily on non-human animals, particularly cattle [47]. The species complex split into the following major branches approximately 1.85 Ma: (1) An. coluzzii/An. gambiae s.s., (2) An. merus, (3) An. arabiensis/An. melas/An. quadriannulatus. Approximately 1.47 Ma, An. melas diverged from An. arabiensis/An. quadriannulatus, and the latter two species split approximately 1.28 Ma. An. coluzzii and An. gambiae s.s. diverged much more recently, approximately 0.54 Ma [51].

Because An. arabiensis is both a major vector and a more generalist feeder, multiple studies have assessed this species for genes that could be involved in human host preference, and have identified genes within inversions on chromosomes 2R and 3R that are associated with differences in vertebrate host preference [52, 53]. There is also substantial introgression primarily of autosomal regions between An. arabiensis, An. coluzzii, and An. gambiae s.s., as well as between An. merus and An. quadriannulatus [51]. Selection maintains the 2La inversion in An. arabiensis [54], and it is likely that the other genes which have introgressed into An. arabiensis are adaptive for its shift to anthropophily [51].

Previous work has focused on differential expression of chemosensory genes in the major mosquito sensory organs between the anthropophilic An. coluzzii and zoophilic An. quadriannulatus [1,2,3], as well as the physical ablation of said organs [55]. In the present study, we survey the molecular evolutionary patterns of the Grs in the An. gambiae complex in the context of species-specific host-seeking behaviors. We extracted whole Gr sequences from each of our six focal species from the 16 Anopheles genomes project [29], and screened for evidence of selection using the McDonald-Kreitman test [56] and direction of selection (DoS) [57], as well as the DH and E tests [58]. Most Grs in the An. gambiae complex have unknown ligands, with the exception of the CO2 receptors listed above, the putative sugar receptors Grs 14–21 [59], and the sugar receptor Gr25 [60]. We are particularly interested in identifying Grs that diverged between the anthropophilic An. coluzzii/An. gambiae s.s. clade and the zoophilic An. quadriannulatus. Furthermore, we describe signatures of positive selection in other lineages, as well as evidence of purifying selection and recovery from selective sweeps. We focus especially on genes with notable expression levels in the chemosensory organs of An. coluzzii and An. quadriannulatus. We identify potential evidence of selection in four putative sugar receptors, two Grs with unknown functions that are highly expressed in chemosensory organs of male An. quadriannulatus, as well as several Grs with unknown functions and unknown expression patterns. These results point to multiple avenues for further detailed exploration of anopheline Grs.

Methods

Data

Whole genome sequences for six members of the An. gambiae complex (An. arabiensis, An. coluzzii, An. gambiae s.s., An. melas, An. merus, and An. quadriannulatus) from the 16 Anopheles Genomes Project [29, 61] were downloaded from NCBI. No comparable data exist for the remaining constituent species of the complex, none of which has been extensively studied. Genome data for a total of 96 An. arabiensis, 12 An. coluzzii, 26 An. gambiae, 65 An. melas, 72 An. merus, and 72 An. quadriannulatus specimens were used for analysis.

Variant discovery pipeline

Whole genome sequences were processed according to the GATK 4 Best Practices Workflow for DNA Variant Discovery [62]. Reads were aligned to their respective genomes in BWA [63], with the exception of An. coluzzii, which was aligned to the An. gambiae genome due to the lower quality of the An. coluzzii genome and their mostly shared genetic make-up. All genomes were downloaded from VectorBase [64].

Following mapping in BWA, Picard Tools (http://broadinstitute.github.io/picard) was used to add read group information and mark duplicate reads. A bed file was created to mark intervals corresponding to the genomic coordinates of the genes of interest, which were determined by running a local BLAST search [65] using the An. gambiae gene sequences. In cases where more than one nonconsecutive match was recovered, regions were prioritized by length, e-value, and bit score.

This interval file was used to restrict the use of HaplotypeCaller to only the genes of interest, thereby increasing processing speed. Haplotype Caller was run using gVCF mode to improve the accuracy of physical phasing where possible, and all samples from a given species were processed together through the remaining steps. Indels were excluded from further analysis, as were any reads not meeting the following quality standards: QD < 2.0|| FS > 55.0|| MQ < 40.0|| MQRankSum < -12.5|| ReadPosRankSum < -8.0.

A custom bash script incorporating bcftools [66] was used to screen for low coverage sequences by counting the number of variants genotyped in the entire species dataset, and then comparing individual sequences for the presence of those genotypes (whether variant or non-variant). All individuals with an average of at least 50% of variants successfully genotyped across all genes were used for McDonald-Kreitman analysis, while sequences with missing genotypes were excluded from DH test analyses. bcftools was also used to extract two phased fasta sequences for each individual. BEDTools [67] was used to extract individual genes from each genome sequence. Two data sets were generated using the masked gene data sets: one with full gene sequences including introns and another including only coding sequence (CDS). In the case of genes with multiple splice variants, multiple CDS data sets were generated.

Data analysis

Sequences were obtained for all species for AgGrs1-60 with the following exceptions: a complete Gr6 sequence was not found in the An. arabiensis genome; complete Gr2, Gr10, Gr44, Gr56, and Gr57 sequences were not found in the An. melas genome; complete Gr44, Gr49, Gr53 and Gr57 sequences were not found in the An. merus genome. Gr5, Gr9, and Gr10 were not analyzed due to a low number of available sequences and the presence of premature stop codons.

CDS data sets including all individuals were imported into DnaSP version 6.12.03 [68], which was then used to perform a McDonald-Kreitman test [56] between each pair of species to detect signs of positive selection. All sequences were used for this test, as it relies solely on comparisons between polymorphic sites in coding sequences. Different splice variants were treated as unique genes for this test, but not for any others which do not rely on coding sequences only. Grs were considered to show signatures of positive selection if a significant p-value was calculated in addition to a direction of selection (DoS) value > 0.01, whereas they were considered to show signatures of purifying selection if a significant p-value was combined with DoS < -0.01. Grs with DoS between − 0.01 and 0.01 were considered to show signatures of neutral evolution. These conservative cutoffs were chosen based on simulated performance of DoS, which shows that DoS accurately identifies deviations from neutrality within a given gene [57]. DoS is less susceptible to bias than the neutrality index, particularly when estimates are based on relatively few SNPs, as is likely to be the case in closely related species pairs. DoS is calculated as, DoS = Dn/(Dn + Ds) − Pn/(Pn + Ps) where Dn refers to the number of fixed replacement substitutions between species, Ds refers to the number of fixed synonymous substitutions between species, Pn refers to the number of polymorphic replacement substitutions within a species, and Ps refers to the number of polymorphic synonymous substitutions within a species. DnaSP was also used to generate basic population parameters for each species by the use of a Fu and Li D test [69], from which only the population parameters and not the significance values were considered.

Datasets including the full gene sequence were produced only if at least ten sequences with known genotypes for each variant site existed in a given species. In cases of alternatively spliced genes, the longest possible sequence was used for the DH test. As a result of these more stringent criteria, these datasets were primarily produced for An. coluzzii and An. gambiae s.s., which were sequenced at a higher depth of coverage than the other species in this study. These data sets were exported in fasta format with one outgroup sequence. Where possible, datasets were produced with An. coluzzii and An. gambiae s.s. analyzed both as individual species and as a single clade. These fasta files were loaded in the Readms module of DH (available from https://github.com/drkaizeng/publications-and-software/blob/main/dh/dh.zip), where the following tests were implemented with 10,000 coalescent simulations: D [70], normalized H [58, 71], DH [58], and E [58].

While all of these tests can detect directional selection, DH is unique in its insensitivity to other population genetic forces and is designed to detect evidence of selective sweeps. Tajima’s D can detect balancing or purifying selection but is also sensitive to changes in population size. Fay and Wu’s H is used primarily to detect genetic hitchhiking but is also sensitive to reductions in population size and the presence of population structure. The E test identifies the recovery of genetic diversity following its loss (e.g. following a selective sweep) and is robust to population structure, but sensitive to both background selection and increases in population size [58]. Finally, TCS haplotype networks were generated in POPART [72].

Results

A total of 79 to 260 sequences were obtained for most of the AgGrs1-60 for the six species (Supplemental Table 1). All Grs for which data was available were included in our analyses and data tables, but only those Grs which fulfill the following criteria are discussed in the text: a significant or nearly significant test result in one of our focal species (An. coluzzii, An. gambiae, An. quadriannulatus, and to a lesser extent, An. arabiensis), and either (a) a known or suspected function, or (b) high or differential expression in the chemosensory organs of An. coluzzii and/or An. quadriannulatus.

Fixed differences between species

No fixed differences were identified between An. coluzzii and An. gambiae in any Gr coding sequence (Table 1). Similarly, no fixed differences were found in the majority of Grs between either of these species and An. arabiensis (53 out of 75 Grs for An. coluzzii, and 60 out of 75 Grs for An. gambiae). However, the majority of Grs (between 64 and 71) have fixed differences between these three species and An. quadriannulatus. Furthermore, fixed differences are present at almost every Gr locus between every species pair that includes either An. melas or An. merus.

Table 1 Grs with fixed differences (both synonymous and non-synonymous) between species pairs

McDonald-Kreitman test

Next, the Grs were analyzed using the McDonald-Kreitman (MK) test between every species pair (Supplemental Table 2). Because the large number of tests conducted in this study precludes the conclusive identification of positive selection due to the multiple testing problem, in addition to the high variance produced by comparing sequences with few mutations, all significant MK results are suggestive, but not definitive evidence of selection. While the Benjamini-Hochberg procedure can be used to produce an adjusted p-value and reduce type I error rate, it also substantially decreases power. This is particularly problematic in closely related species pairs with low numbers of fixed differences, in which the power of the MK test is low to begin with. Therefore, unadjusted p-values are given here with the caveat noted above.

When comparing the anthropophilic An. arabiensis, An. coluzzii, and An. gambiae s.s. to the zoophilic An. quadriannulatus, most Grs show signatures of purifying selection (49.3%, 60.9%, and 55.4%, respectively) (Fig. 1A; Table 2). By comparison, signatures of positive selection are detected in 45.5%, 37.5%, and 43.1% of Grs, respectively (Fig. 1A; Table 2). As indicated in Table 1, there are few fixed differences between An. arabiensis and either An. coluzzii or An. gambiae s.s. In both comparisons, the Grs with fixed differences are approximately evenly split between showing signatures of positive selection and signatures of purifying selection (Table 2; Fig. 1B).

When comparing An. melas to An. merus, the majority of DoS values are negative, i.e. consistent with purifying selection (75.0%) (Table 2; Fig. 2A). When comparing An. melas to the three anthropophilic species, purifying selection is more common vs. An. arabiensis (56.5% vs. 29.0% with positive DoS), but positive selection is more common vs. both An. coluzzii (50.8% vs. 42.9% with negative DoS) and An. gambiae s.s. (55.6% vs. 39.7% with negative DoS) (Table 2; Fig. 2). An. merus consistently shows a higher proportion of negative DoS values when compared to the three anthropophilic species (53.0% vs. An. arabiensis, 61.2% vs. An. coluzzii, 58.2% vs. An. gambiae s.s.) (Table 2; Fig. 2). Finally, when comparing An. melas and An. merus to An. quadriannulatus, the most Grs again have negative DoS values consistent with purifying selection (49.2% and 55.2%, respectively) (Table 2; Fig. 2).

Table 2 Prevalence of signatures of positive and purifying selection based on direction of selection between species pairs. Genes with no fixed differences are excluded from these counts
Fig. 1
figure 1

Heat map of direction of selection (DoS) between (A) the three major vector species vs. An. quadriannulatus and (B) An. coluzzii and An. gambiae s.s vs. An. arabiensis. Positive values (consistent with positive selection) are red, while negative values (consistent with purifying selection) are blue and neutral values are yellow. Grs where NI could not be computed because of a lack of fixed non-synonymous differences are represented in grey

Fig. 2
figure 2

Heat map of direction of selection (DoS) between (A) An. melas vs. the other complex species and (B) An. merus vs. the other complex species. Positive values (consistent with positive selection) are red, while negative values (consistent with purifying selection) are blue and neutral values are yellow. Grs where NI could not be computed because of a lack of fixed non-synonymous differences are represented in grey

Genes showing signatures of positive selection when comparing the anthropophilic An. coluzzii and An. gambiae s.s. and the zoophilic An. quadriannulatus are of particular interest as these are candidates to play a role in the divergent host preference between these species. We identify signatures of positive selection as an excess of fixed replacement substitutions according to a significant p-value (< 0.05) on the MK test, with the magnitude indicated by a positive DoS value. Only one of 76 Grs showed signatures of positive selection between these species pairs: the sugar receptor Gr18 in the An. coluzzii– An. quadriannulatus comparison (Table 3). Three additional Grs had marginally significant (0.05 < p < 0.07) excesses of fixed replacement substitutions: Gr21, Gr48, and Gr60 (Table 3). An. arabiensis is also of interest as a relatively anthropophilic close relative of An. quadriannulatus with extensive signatures of introgression with An. coluzzii and An. gambiae s.s [51]. Three Grs showed excesses of fixed replacement substitutions between An. arabiensis and An. quadriannulatus: Gr4, Gr18 (both significant, Table 3), and Gr48 (marginally significant).

In addition to the significant excess of fixed replacement substitutions identified when comparing An. quadriannulatus to An. arabiensis and An. coluzzii, Gr18 also has a positive DoS (0.342) between An. gambiae s.s. and An. quadriannulatus, although the uncorrected p-value was not below 0.05 for this comparison. Interestingly, Gr18 also shows evidence of recovery from a selective sweep in An. gambiae s.s. In addition, haplotype diversity (HD) for Gr18 is lower in An. coluzzii than in An. quadriannulatus, although nucleotide diversity (Ï€) is similar. Despite its known ligand, Gr18 is lowly expressed in both An. coluzzii and An. quadriannulatus chemosensory tissues.

Table 3 Grs with significant (or near-significant) excesses of fixed substitutions and positive direction of selection (DoS), suggestive of positive selection

Gr48 has a marginally significant (p = 0.054) excess of fixed replacement substitutions between An. gambiae and An. quadriannulatus, a nonsignificant excess between An. coluzzii and An. quadriannulatus, and a marginally significant excess (p = 0.06) between An. arabiensis and An. quadriannulatus. Gr48 has no known ligand but is relatively highly expressed in male An. quadriannulatus labella (unpublished data). Furthermore, π and HD are both lower in An. quadriannulatus than in An. gambiae s.s.

Gr60 has a marginally significant (p = 0.053) excess of fixed replacement substitutions between An. coluzzii and An. quadriannulatus and a nonsignificant excess between An. gambiae and An. quadriannulatus, but a nonsignificant lack of fixed replacement substitutions between An. arabiensis and An. quadriannulatus. Gr60 has no known ligand but is relatively highly expressed in male An. quadriannulatus maxillary palps [4]. Like Gr48, both π and HD are lower than in An. coluzzii.

Finally, Gr4 has a significant excess of fixed replacement substitutions between An. arabiensis and An. quadriannulatus, but no fixed replacement substitutions between the latter and either An. coluzzii or An. gambiae. This gene is lowly expressed in both male and female An. coluzzii labella, as well as female An. quadriannulatus labella, but is highly expressed in male An. quadriannulatus labella (unpublished data).

Purifying selection

Ten Grs in the An. gambiae complex show signatures of purifying selection, as determined by the MK test identifying a significant lack of fixed replacement substitutions (Table 4). In addition, two other Grs show marginally significant signatures of purifying selection as determined by the MK test. Only one Gr meeting the criteria identified above is significant: Gr19 shows signatures of purifying selection between An. arabiensis and An. quadriannulatus.

Table 4 Grs with significant (or near-significant lack of fixed substitutions and negative direction of selection (DoS), suggestive of purifying selection

DH test

Selective sweeps can be detected by significantly negative Tajima’s D values, as well as significantly negative Fay and Wu’s H values. However, since both tests are subject to biases from demographic forces, we only consider genes with significantly negative values on the DH test, which is a combination of the two tests and was designed to be robust to the influence of demographic factors [58], to show strong evidence of sweep. As the DH test was only run on fully genotyped sequences, far fewer sequences were available for analysis than for the MK test. For An. coluzzii and An. gambiae s.s., data were available for most Grs: 50 and 53, respectively. An. quadriannulatus data was available for 36 Grs, while much less data was available for the other species: three Grs in An. arabiensis, seven in An. melas, and six in An. merus. As above, An. melas and An. merus are included in data tables but not discussed in the text. Results of all tests are shown in Supplemental Tables 3 and 4.

Table 5 Grs with significant (or near significant) DH values, suggestive of a selective sweep

A total of five Grs show significant evidence of a potential selective sweep based on the DH test, while an additional three Grs show nearly significant (0.05 ≥ p ≥ 0.06) evidence thereof (Table 5). In An. coluzzii, the highly expressed sugar receptor, Gr17, is the only Gr showing significant evidence of sweep. The TCS network of Gr17 in An. coluzzii is somewhat star-shaped, but there is no central high-frequency haplotype, which could be a consequence of the length of time following the sweep (Fig. 3). In An. quadriannulatus, the lowly expressed sugar receptor Gr19 shows significant evidence of a potential selective sweep. The TCS network features a clear star shape with central high-frequency haplotype surrounded by lower-frequency haplotypes, which is a hallmark of sweep (Fig. 4).

Fig. 3
figure 3

TCS Haplotype Network of the sugar receptor Gr17 in An. coluzzii. Each hashed line represents one nucleotide substitution and haplotype nodes are weighted by frequency. There are 17 haplotypes among 24 sequences (π = 0.008 and HD = 0.938)

Fig. 4
figure 4

TCS Haplotype Network of the sugar receptor Gr19 in An. quadriannulatus. Each hashed line represents one nucleotide substitution and haplotype nodes are weighted by frequency. There are 16 haplotypes among 44 sequences (π = 0.002 and HD = 0.943)

E test (Recovery from selective sweep or background Selection)

Two Grs in An. gambiae s.s. show significant E test values, which detect an excess of low-frequency polymorphisms suggestive of recovery from a selective sweep or background selection: Gr18, and Gr24 (Table 6). Gr18 encodes a lowly-expressed sugar receptor, but is adjacent to Gr17, which is a very highly expressed sugar receptor. In addition to its excess of low-frequency variants in An. gambiae (Supplemental Fig. 1), it has an excess of fixed replacement substitutions between An. quadriannulatus and both of the other major vectors, An. arabiensis and An. coluzzii. The CO2 receptor Gr24 also has a significant excess of low-frequent variants (Supplemental Fig. 2), which is surprising due to the high sequence conservation of the CO2 receptors and the importance of CO2 as a host cue.

Table 6 Grs with significant (or near significant) Etest values, indicative of recovery from a selective sweep

Discussion

In this study, we examined the selective forces acting on Grs in the An. gambiae complex to determine whether they are likely to play a role in differing vertebrate host preference between constituent species. Our analyses suggest that sixteen Grs, six of which either have known functions or are highly expressed in chemosensory organs, primarily show signatures of positive selection based on comparisons between An. arabiensis, An. coluzzii, An. gambiae s.s., An. melas, An. merus, and An. quadriannulatus using the McDonald-Kreitman test. Our analyses further suggest purifying selection in twelve Grs. Furthermore, the DH test indicates that eight Grs are under the influence of sweep and the E test indicates that four Grs are either recovering from either sweep or subject to background selection. While the evolution of Drosophila Grs and its relationship to ecological adaptations has been extensively studied [27, 30,31,32,33], and there have been several studies on the evolution of Grs in Lepidoptera [34, 36, 37, 41], the mosquito literature on the evolution of chemosensory genes is much more limited [43, 44]. As such, we can compare what we know of Gr evolution in other lineages while incorporating the evolutionary and ecological contexts of the six An. gambiae s.l. species. There is evidence that Grs are under both positive and purifying selection in the An. gambiae complex, which may have contributed to the development of species-specific behavioral ecology, as is seen in other taxa, most notably the An. farauti complex [44]. As in other taxa [27, 31, 41], in most species comparisons within the An. gambiae complex, most Grs have negative DoS values, consistent with purifying selection.

Selection tests, such as branch tests, that can test for selection on specific lineages within a phylogeny are available [73]. These tests examine if an excess of replacement substitutions is found along specific, pre-defined lineages of interest, of even if specific sites within a lineage have such an excess. We applied the branch test and branch-site test to the chemosensory genes within the An. gambiae complex, but found that the signal was biased by the presence of ancestral polymorphisms, and that no reliable inference could be made. Therefore, these tests are not included in the present paper. MK results are most clearly interpreted in the absence of introgression and when the species involved are less closely related than those in the An. gambiae complex. As such, we present these results to identify candidate genes for further analyses, and do not claim to conclusively identify positive selection in any given lineage based on an MK result. Furthermore, detecting positive selection with the MK test is difficult due to the preponderance of purifying selection on protein-coding genes.

In those Grs that show signatures of positive selection (i.e. a significant p-value and a positive DoS value), biological significance is mostly unclear, as they have unknown ligands, excluding two sugar receptors (Grs 18 and 21). However, homologs of three others (Gr4, Gr59, Gr60) were also identified as candidates for being correlated with host preference in the An. farauti complex [44]. Both sugar receptors are expressed in both male and female labella [5] (unpublished data). Two of the Grs exhibiting signatures of positive selection between An. quadriannulatus and An. coluzzii (Gr60) and An. gambiae s.s. (Gr48) are male-biased and An. quadriannulatus-biased in the maxillary palps [4] and labella (unpublished data), respectively. While females engage in several sex-specific behaviors such as blood-feeding, host-seeking, and oviposition, the only male-specific behaviors are swarming [74] and mating. The antennal fibrillae are known to play an important role in male detection of auditory cues in both swarming and close-range mating behaviors in the An. gambiae complex, and the male claspers recognize if females have mated, but other organs have not been implicated in male mating [74, 75]. While neither the maxillary palps nor labella have been established as playing a role in mating biology in Anopheles, there is evidence that the maxillary palp detects female inhibitory cues in Drosophila [76], which raises the possibility that a similar phenomenon might occur in Anopheles, particularly as Drosophila Grs expressed in the labellum and tarsi are known to play a role in inhibiting male-male courtship [77]. The labellum is well-established in a male mating role in Drosophila [77]. While work on the role (if any) of the tarsi and male mouthparts in anopheline mating is ongoing, there is evidence that females in mixed-species swarms mate assortatively [78], although males are not thought to do so [75], despite the ability to detect females who have already mated.

Aside from Grs 18, 21, 48, and 60, most other Grs with MK test results consistent with positive selection are lowly expressed in chemosensory organs or show signatures of selection in species with uncharacterized transcriptomes. As such, it is more difficult to explain their biological relevance, but they may be expressed in other organs [15, 17,18,19,20,21,22,23]. With the exception of the CO2 and sugar receptors, the ligands of anopheline Grs are unknown. This is in contrast to Drosophila, where they are known to also perceive bitter compounds and cuticular hydrocarbons, as well as mediate light and heat avoidance [8, 9, 12,13,14,15,16, 22, 23]. Furthermore, Grs are difficult to deorphanize, and most anopheline Grs do not have known Drosophila homologs.

With respect to the Grs that show signatures of purifying selection (i.e. a significant p-value and a negative DoS value), one (Gr19) is a sugar receptor, one (Gr22) is a CO2 receptor, and homologs of three are candidates for a host preference association in the An. farauti complex (Gr13, Gr22, and Gr39). Since only Gr19 and Gr22 have known ligands, it is not currently possible to explain why deleterious mutations would be purged in the other Grs under purifying selection. It is intuitive that deleterious mutations in Gr22 would result in a fitness cost, but less clear why this was only detected in the comparison between An. melas and An. merus.

Eight Grs have DH test results consistent with recent selective sweeps (Table 5). As discussed above, for Grs that are expressed in the mouthparts of either An. coluzzii or An. quadriannulatus, this could mean that these Grs are involved in the adaptation to hosts. However, of these eight Grs, only the sugar receptor Gr17 is highly expressed. Its expression, like that of Gr18, is not sex-biased, so the adaptation of these genes presumably does not underly any sex-biased behaviors. An obvious role for Gr17 is nectar-feeding. Nectar is the only resource fed on by adult males, but also prolongs the lives and reduces blood-feeding frequency of adult females [79]. While the other Grs with significant DH test results consistent with selective sweeps are lowly expressed, many of them are located near genes encoding critical cellular functions or other Grs which are more highly expressed and also show evidence of positive selection. These selective signatures could therefore be due to hitchhiking. However, interestingly, four of these Grs (Gr12, Gr36, Gr41, Gr59) have homologs that are candidates for an association with vertebrate host preference in the An. farauti complex, with the strongest evidence for such an association in Gr36 (based on phylogenetic analysis) and Gr41 (which shows evidence of intensified selection in all zoophilic lineages within this complex) [44]. Since Gr41 shows significant evidence of sweep in An. quadriannulatus, it is possible that this Gr plays a role in zoophily in both species complexes.

Four Grs show E test results consistent with recovery from a selective sweep, although the E test is also sensitive to background selection [58]. Gr57, the Gr that may play a role in anthropophily in both the An. farauti and An. gambiae complexes [44], has a near-significant result in An. coluzzii. Besides Gr18, the CO2 receptor Gr24 has a significant E test value in An gambiae s.s. The CO2 receptors are highly conserved across insects [11], and there are no fixed differences in Gr24 between An. arabiensis, An. coluzzii, An. gambiae s.s., and An. quadriannulatus, although there are some when compared to An. melas and An. merus. CO2 is thought to matter less as a host-seeking cue in the anthropophilic members of the complex than in more zoophilic or generalist species in the complex [47]. This lack of fixed differences suggests background selection against deleterious mutations as an explanation for this significant E test result.

While we have high-quality variant data from An. coluzzii and An. gambiae s.s. for these 57 Grs, data for the other four species are of a much lower quality, as indicated by the overall sequencing coverage, the number of variants detected, and the number of complete sequences. While we have sufficient data to draw conclusions about evolutionary patterns, particularly with respect to the zoophilic non-vector An. quadriannulatus, a more comprehensive analysis of evolutionary patterns that differentiate the other species in the complex from one another would require either deeper coverage of the whole genome sequences of the remaining species, or a more targeted sequencing approach, such as amplicon sequencing or molecular inversion probes, which would allow precise sequencing of the Grs at a much greater depth. The state of the other genomes also inhibits our ability to conclusively assess the size of the Gr repertoire within different members of the species complex, as we cannot be sure whether they are actually absent from the genome, or simply not covered/assembled.

Of the Grs showing significant signatures of selection, Gr41, Gr57, Gr59, and Gr60 are the most attractive targets for further study, given that they have previously been identified as candidates for association with vertebrate host preference in An. farauti s.l. and significantly differ from neutral expectations in this analysis when comparing anthropophilic to zoophilic species (Gr60), within anthropophilic species (Gr57, Gr59), or within zoophilic species (Gr41). Gr59 is expressed in both male and female antennae and labella of both An. coluzzii and An. quadriannulatus [2,3,4,5], so could be a viable knockdown target for behavioral or electrophysiological assays to better elucidate its biological role. Similarly, Gr60 is An. quadriannulatus-biased in both male and female maxillary palps [2, 4], and could be treated in the same way. Neither Gr41 nor Gr57 have been detected in An. coluzzii or An. quadriannulatus mouthparts, so determining their functional significance will first require studies to determine if they are expressed in other organs or life stages before any more targeted work can occur.

The sugar receptors Grs 14–21 are closely clustered on the 2R chromosome, and three of them show signatures of directional selection in either an anthropophilic species or An. quadriannulatus. Though their function is not understood to the same degree as in Drosophila, the potential presence of positive selection in them merits further study, as these may be involved in the recent adaptive divergence between species in this complex, including potentially divergence in host preference or preferred habitat, as it could relate to the ability to successfully exploit preferred nectar resources, or to evaluate sugar sources which are found in greater abundance near preferred vertebrate hosts.

There are several limitations to our current knowledge that preclude more specific hypotheses for the biological importance of most of the Grs that show significant signatures of selection in this study. First, transcriptomic data is lacking for species other than An. coluzzii and An. quadriannulatus, as well as for other organs that are known to express Grs in Drosphila, including the tarsi, wings, and internal organs such as the brain and midgut [15, 17,18,19,20,21,22,23]. There is a similar dearth of transcriptomic data on Grs in larval Anopheles, though studies have characterized their Or and Ir repertoires, as well as behavioral responses to odorants [80]. Research on Aedes aegypti larvae has shown that they rely on chemokinesis to navigate chemical gradients, and has further suggested that they likely rely on Grs and Irs rather than Ors to do so [81]. As such, characterization of Gr expression profiles in larval Anopheles could potentially illuminate biological meaning for some lowly expressed Grs in adults.

Even within Grs that have known ligands or expression profiles, signatures of selection do not necessarily reflect selection on the Gr in question, and may instead reflect hitchhiking due to other linked genes that are unrelated to vertebrate host preference. Therefore, functional studies of these Grs are needed to determine what role, if any, they play in determining host preference. In addition, GRs are frequently co-expressed [8,9,10], meaning that it is difficult to disentangle their roles from one another.

Conclusions

In this study, we have presented the first analysis of the molecular evolution of gustatory receptors within the Anopheles gambiae complex of malaria vectors, including six species with genomic data available. We have identified 16 Grs that with signatures suggesting positive selection within this complex, six of which either have known functions or are highly expressed in the chemosensory organs of either the highly anthropophilic An. coluzzii or the zoophilic An. quadriannulatus. In addition, we have identified twelve Grs that may be undergoing purifying selection and twelve Grs that may be under the influence of sweep. Based on signatures of selection and gene expression data, this study identifies four Grs as possible candidates in the adaptation to distinct vertebrate hosts in the An. gambiae complex, as may have occurred with homologous genes in the An. farauti complex. However, further elucidating this question will require additional study.

Data availability

All sequences used in this study are publicly available from NCBI. A full list of accession numbers can be found in Supplementary Table 5.

References

  1. Rinker DC, Zhou X, Pitts RJ, Rokas A, Zwiebel LJ. Antennal transcriptome profiles of anopheline mosquitoes reveal human host olfactory specialization in Anopheles gambiae. BMC Genomics. 2013;14:749–749.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  2. Athrey G, Cosme LV, Popkin-Hall Z, Pathikonda S, Takken W, Slotman MA. Chemosensory gene expression in olfactory organs of the anthropophilic Anopheles coluzzii and zoophilic Anopheles quadriannulatus. BMC Genomics. 2017;18:751–751.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  3. Athrey G, Popkin-Hall Z, Cosme LV, Takken W, Slotman MA. Species and sex-specific chemosensory gene expression in Anopheles coluzzii and An. quadriannulatus antennae. Parasit Vectors. 2020;13:212–212.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  4. Athrey G, Popkin-Hall ZR, Takken W, Slotman MA. The expression of chemosensory genes in male maxillary palps of Anopheles coluzzii (Diptera: Culicidae) and An. quadriannulatus. J Med Entomol. 2021;58:1012–20.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  5. Saveer AM, Pitts RJ, Ferguson ST, Zwiebel LJ. Characterization of chemosensory responses on the labellum of the malaria vector mosquito, Anopheles coluzzii. Sci Rep. 2018;8:5656–5656.

    Article  PubMed  PubMed Central  Google Scholar 

  6. Pitts RJ, Derryberry SL, Zhang Z, Zwiebel LJ. Variant ionotropic receptors in the malaria vector mosquito Anopheles gambiae tuned to amines and carboxylic acids. Sci Rep. 2017;7:40297–40297.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  7. Joseph RM, Carlson JR, Chemoreceptors. A Molecular Interface Between the Chemical World and the Brain. Trends Genet. 2015;31:683–95.

  8. Weiss LA, Dahanukar A, Kwon JY, Banerjee D, Carlson JR. The molecular and cellular basis of bitter taste in Drosophila. Neuron. 2011;69:258–72.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  9. Dweck HKM, Carlson JR. Molecular logic and evolution of bitter taste in Drosophila. Curr Biol. 2020;30:17–e303.

    Article  CAS  PubMed  Google Scholar 

  10. Amrein H. Mechanism of taste perception in Drosophila. In: Zufall F, Munger SD, editors. Chemosensory transduction: the detection of odors, tastes, and other chemostimuli. Elsevier; 2016. pp. 245–69.

  11. Kumar A, Tauxe GM, Perry S, Scott CA, Dahanukar A, Ray A. Contributions of the conserved insect carbon dioxide receptor subunits to odor detection. Cell Rep. 2020;31:107510–107510.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  12. Bray S, Amrein H. A putative Drosophila pheromone receptor expressed in male-specific taste neurons is required for efficient courtship. Neuron. 2003;39:1019–29.

    Article  CAS  PubMed  Google Scholar 

  13. Miyamoto T, Amrein H. Suppression of male courtship by a Drosophila pheromone receptor. Nat Neurosci. 2008;11:874–6.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  14. Delventhal R, Carlson JR. Bitter taste receptors confer diverse functions to neurons. eLife. 2016;5:1–23.

    Article  Google Scholar 

  15. Fujii S, Yavuz A, Slone J, Jagge C, Song X, Amrein H. Drosophila sugar receptors in sweet taste perception, olfaction, and internal nutrient sensing. Curr Biol. 2015;25:621–7.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  16. Sung HY, Jeong YT, Lim JY, Kim H, Oh SM, Hwang SW, et al. Heterogeneity in the Drosophila gustatory receptor complexes that detect aversive compounds. Nat Commun. 2017;8:1484–1484.

    Article  PubMed  PubMed Central  Google Scholar 

  17. Dunipace L, Meister S, Mcnealy C, Amrein H. Spatially restricted expression of candidate taste receptors in the Drosophila gustatory system. Curr Biol. 2001;:822–35.

  18. Miyamoto T, Slone J, Song X, Amrein H. A Fructose receptor functions as a nutrient sensor in the Drosophila brain. Cell. 2012;151:1113–25.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  19. Scott K, Brady R, Cravchik A, Morozov P, Rzhetsky A, Zuker C, et al. A chemosensory gene family encoding candidate gustatory and olfactory receptors in Drosophila. Cell. 2001;104:661–73.

    Article  CAS  PubMed  Google Scholar 

  20. Park J-H, Kwon JY. Heterogeneous expression of Drosophila gustatory receptors in enteroendocrine cells. PLoS ONE. 2011;6:e29022–29022.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  21. Thorne N, Amrein H. Atypical expression of Drosophila gustatory receptor genes in sensory and central neurons. J Comp Neurol. 2008;506:548–68.

    Article  CAS  PubMed  Google Scholar 

  22. Ni L, Bronk P, Chang EC, Lowell AM, Flam JO, Panzano VC, et al. A gustatory receptor paralogue controls rapid warmth avoidance in Drosophila. Nature. 2013;500:580–4.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  23. Xiang Y, Yuan Q, Vogt N, Looger LL, Jan LY, Jan YN. Light-avoidance-mediating photoreceptors tile the Drosophila larval body wall. Nature. 2010;468:921–6.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  24. Eyun S-I, Soh HY, Posavi M, Munro JB, Hughes DST, Murali SC, et al. Evolutionary history of Chemosensory-Related gene families across the Arthropoda. Mol Biol Evol. 2017;34:1838–62.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  25. Benton R, Himmel NJ. Structural screens identify candidate human homologs of insect chemoreceptors and cryptic Drosophila gustatory receptor-like proteins. Elife. 2023;12:e85537.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  26. Robertson HM. Molecular evolution of the major arthropod chemoreceptor gene families. Annu Rev Entomol. 2019;64:227–42.

    Article  CAS  PubMed  Google Scholar 

  27. Gardiner A, Barker D, Butlin RK, Jordan WC, Ritchie MG. Drosophila chemoreceptor gene evolution: selection, specialization and genome size. Mol Ecol. 2008;17:1648–57.

    Article  CAS  PubMed  Google Scholar 

  28. Macharia R, Mireji P, Murungi E, Murilla G, Christoffels A, Aksoy S, et al. Genome-Wide comparative analysis of chemosensory gene families in five Tsetse fly species. PLoS Negl Trop Dis. 2016;10:1–30.

    Google Scholar 

  29. Neafsey DE, Waterhouse RM, Abai MR, Aganezov SS, Alekseyev MA, Allen JE, et al. Highly evolvable malaria vectors: the genomes of 16 Anopheles mosquitoes. Science. 2014;347:1258522.

    Article  PubMed  PubMed Central  Google Scholar 

  30. McBride CS, Arguello JR. Five Drosophila Genomes Reveal Nonneutral Evolution and the Signature of Host Specialization in the Chemoreceptor Superfamily. Genetics. 2007;177 vember 2007:1395–416.

  31. Gardiner A, Butlin RK, Jordan WC, Ritchie MG. Sites of evolutionary divergence differ between olfactory and gustatory receptors of Drosophila. Biol Lett. 2009;5:244–7.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  32. McBride CS. Rapid evolution of smell and taste receptor genes during host specialization in Drosophila Sechellia. Proc Natl Acad Sci U S A. 2007;104:4996–5001.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  33. Diaz F, Allan CW, Matzkin LM. Positive selection at sites of chemosensory genes is associated with the recent divergence and local ecological adaptation in cactophilic Drosophila. BMC Evol Biol. 2018;18:144–144.

    Article  PubMed  PubMed Central  Google Scholar 

  34. Briscoe AD, Macias-Muñoz A, Kozak KM, Walters JR, Yuan F, Jamie GA et al. Female behaviour drives expression and evolution of gustatory receptors in butterflies. PLoS Genet. 2013;9.

  35. Smadja C, Shi P, Butlin RK, Robertson HM. Large gene family expansions and adaptive evolution for odorant and gustatory receptors in the pea aphid, Acyrthosiphon pisum. Mol Biol Evol. 2009;26:2073–86.

    Article  CAS  PubMed  Google Scholar 

  36. Engsontia P, Sangket U, Chotigeat W, Satasook C. Molecular evolution of the odorant and gustatory receptor genes in lepidopteran insects: implications for their adaptation and speciation. J Mol Evol. 2014;79:21–39.

    Article  CAS  PubMed  Google Scholar 

  37. Suzuki HC, Ozaki K, Makino T, Uchiyama H, Yajima S, Kawata M. Evolution of gustatory receptor gene family provides insights into adaptation to diverse host plants in nymphalid butterflies. Genome Biol Evol. 2018;10:1351–62.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  38. Wisotsky Z, Medina A, Freeman E, Dahanukar A. Evolutionary differences in food preference rely on Gr64e, a receptor for glycerol. Nat Neurosci. 2011;14:1534–41.

    Article  CAS  PubMed  Google Scholar 

  39. Wada-Katsumata A, Silverman J, Schal C. Changes in taste neurons support the emergence of an adaptive behavior in cockroaches. Science. 2013;340:972–5.

    Article  CAS  PubMed  Google Scholar 

  40. Ozaki K, Ryuda M, Yamada A, Utoguchi A, Ishimoto H, Calas D et al. A gustatory receptor involved in host plant recognition for oviposition of a swallowtail butterfly. Nat Commun. 2011;2.

  41. Wanner KW, Robertson HM. The gustatory receptor family in the silkworm moth Bombyx mori is characterized by a large expansion of a single lineage of putative bitter receptors. Insect Mol Biol. 2008;17:621–9.

    Article  CAS  PubMed  Google Scholar 

  42. Nourmohammad A, Rambeau J, Held T, Kovacova V, Berg J, Lässig M. Adaptive evolution of gene expression in Drosophila. Cell Rep. 2017;20:1385–95.

    Article  CAS  PubMed  Google Scholar 

  43. McBride CS, Baier F, Omondi AB, Spitzer SA, Lutomiah J, Sang R, et al. Evolution of mosquito preference for humans linked to an odorant receptor. Nature. 2014;515:222–7.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  44. Ambrose L, Popovic I, Hereward J, Ortiz-Barrientos D, Beebe NW. Comparisons of chemosensory gene repertoires in human and non-human feeding Anopheles mosquitoes link olfactory genes to anthropophily. iScience. 2022;25:104521–104521.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  45. Pates HV, Takken W, Stuke K, Curtis CF. Differential behaviour of Anopheles gambiae sensu stricto (Diptera: Culicidae) to human and cow odours in the laboratory. Bull Entomol Res. 2001;91:289–96.

    Article  CAS  PubMed  Google Scholar 

  46. Coetzee M, Hunt RH, Wilkerson R, della Torre A, Coulibaly MB, Besansky NJ. Anopheles coluzzii and Anopheles amharicus, new members of the Anopheles gambiae complex. Zootaxa. 2013;3619:246–74.

    Article  PubMed  Google Scholar 

  47. Dekker T, Takken W. Differential responses of mosquito sibling species Anopheles arabiensis and An. quadriannulatus to carbon dioxide, a man or a calf. Med Vet Entomol. 1998;12:136–40.

    Article  CAS  PubMed  Google Scholar 

  48. Sinka ME, Bangs MJ, Manguin S, Coetzee M, Mbogo CM, Hemingway J, et al. The dominant Anopheles vectors of human malaria in Africa, Europe and the middle East: occurrence data, distribution maps and bionomic précis. Parasit Vectors. 2010;3:117–117.

    Article  PubMed  PubMed Central  Google Scholar 

  49. Bryan JH. Anopheles gambiae and A. melas at Brefet, the Gambia, and their role in malaria transmission. Ann Trop Med Parasitol. 1983;77:1–12.

    Article  CAS  PubMed  Google Scholar 

  50. Mutero CM, Mosha FW, Subra R. Biting activity and resting behaviour of Anopheles merus Donitz (Diptera: Gulicidae) on the Kenya Coast. Ann Trop Med Parasitol. 1984;78:43–7.

    Article  CAS  PubMed  Google Scholar 

  51. Fontaine MC, Pease JB, Steele A, Waterhouse RM, Neafsey DE, Sharakhov IV, et al. Extensive introgression in a malaria vector species complex revealed by phylogenomics. Science. 2015;347:1258524–1258524.

    Article  PubMed  Google Scholar 

  52. Petrarca V, Beier JC. Intraspecific chromosomal polymorphism in the Anopheles gambiae complex as a factor affecting malaria transmission in the Kisumu area of Kenya. Am J Trop Med Hyg. 1992;46:229–37.

    Article  CAS  PubMed  Google Scholar 

  53. Main BJ, Lee Y, Ferguson HM, Kreppel KS, Kihonda A, Govella NJ, et al. The genetic basis of host preference and resting behavior in the major African malaria vector, Anopheles arabiensis. PLoS Genet. 2016;12:e1006303.

    Article  PubMed  PubMed Central  Google Scholar 

  54. White BJ, Hahn MW, Pombi M, Cassone BJ, Lobo NF, Simard F, et al. Localization of candidate regions maintaining a common polymorphic inversion (2La) in Anopheles gambiae. PLoS Genet. 2007;3:e217–217.

    Article  PubMed  PubMed Central  Google Scholar 

  55. Popkin-Hall ZR, Slotman MA. The role of the major chemosensory organs in the host-seeking activity of Anopheles coluzzii (Diptera: Culicidae). J Med Entomol. 2024. https://doiorg.publicaciones.saludcastillayleon.es/10.1093/jme/tjae062

    Article  PubMed  Google Scholar 

  56. McDonald JH, Kreitman M. Adaptive protein evolution at the Adh locus in Drosophila. Nature. 1991;351:652–4.

    Article  CAS  PubMed  Google Scholar 

  57. Stoletzki N, Eyre-Walker A. Estimation of the neutrality index. Mol Biol Evol. 2011;28:63–70.

    Article  CAS  PubMed  Google Scholar 

  58. Zeng K, Fu YX, Shi S, Wu CI. Statistical tests for detecting positive selection by utilizing high-frequency variants. Genetics. 2006;174:1431–9.

    Article  PubMed  PubMed Central  Google Scholar 

  59. Kent LB, Walden KKO, Robertson HM. The Gr family of candidate gustatory and olfactory receptors in the yellow-fever mosquito Aedes aegypti. Chem Senses. 2008;33:79–93.

    Article  CAS  PubMed  Google Scholar 

  60. Freeman EG, Wisotsky Z, Dahanukar A. Detection of sweet tastants by a conserved group of insect gustatory receptors. Proc Natl Acad Sci U S A. 2014;111:1598–603.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  61. Miles A, Harding NJ, Bottà G, Clarkson CS, Antão T, Kozak K, et al. Genetic diversity of the African malaria vector Anopheles gambiae. Nature. 2017;552:96–100.

    Article  Google Scholar 

  62. Van der Auwera GA, Carneiro MO, Hartl C, Poplin R, del Angel G, Levy-Moonshine A et al. From FastQ Data to High-Confidence Variant Calls: The Genome Analysis Toolkit Best Practices Pipeline. 2013.

  63. Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009;25:1754–60.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  64. Giraldo-Calderón GI, Emrich SJ, MacCallum RM, Maslen G, Emrich S, Collins F, et al. VectorBase: an updated bioinformatics resource for invertebrate vectors and other organisms related with human diseases. Nucleic Acids Res. 2015;43:D707–13.

    Article  PubMed  Google Scholar 

  65. Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. J Mol Biol. 1990;215:403–10.

    Article  CAS  PubMed  Google Scholar 

  66. Li H. A statistical framework for SNP calling, mutation discovery, association mapping and population genetical parameter Estimation from sequencing data. Bioinformatics. 2011;27:2987–93.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  67. Quinlan AR, Hall IM. BEDTools: A flexible suite of utilities for comparing genomic features. Bioinformatics. 2010;26:841–2.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  68. Rozas J, Ferrer-Mata A, Sánchez-DelBarrio JC, Guirao-Rico S, Librado P, Ramos-Onsins SE, et al. Mol Biol Evol. 2017;34:3299–302. DnaSP 6: DNA Sequence Polymorphism Analysis of Large Data Sets.

  69. Fu YX, Li WH. Statistical tests of neutrality of mutations. Genetics. 1993;133:693–709.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  70. Tajima F. Statistical method for testing the neutral mutation hypothesis by DNA polymorphism. Genetics. 1989;123:585–95.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  71. Fay JC, Wu CI. Hitchhiking under positive darwinian selection. Genetics. 2000;155:1405–13.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  72. Leigh JW, Bryant D. POPART: Full-feature software for haplotype network construction. Methods Ecol Evol. 2015;6:1110–6.

    Article  Google Scholar 

  73. Yang Z. PAML 4: phylogenetic analysis by maximum likelihood. Mol Biol Evol. 2007;24:1586–91.

    Article  CAS  PubMed  Google Scholar 

  74. Charlwood JD, Jones MDR. Mating in the mosquito, Anopheles gambiae S.l. II. Swarming behaviour. Physiol Entomol. 1980;5:315–20.

    Article  Google Scholar 

  75. Charlwood JD, Jones MDR. Mating behaviour in the mosquito, Anopheles gambiae S.l. I. Close range and contact behaviour. Physiol Entomol. 1979;4:111–20.

    Article  Google Scholar 

  76. Stocker RF, Gendre N. Courtship behavior of Drosophila genetically or surgically deprived of basiconic sensilla. Behav Genet. 1989;19:371–85.

    Article  CAS  PubMed  Google Scholar 

  77. Moon SJ, Lee Y, Jiao Y, Montell C. A Drosophila gustatory receptor essential for aversive taste and inhibiting Male-to-Male courtship. Curr Biol. 2009;19:1623–7.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  78. Dabire KR, Sawadodgo S, Diabate A, Toe KH, Kengne P, Ouari A, et al. Assortative mating in mixed swarms of the mosquito Anopheles gambiae s.s. M and S molecular forms, in Burkina Faso, West Africa. Med Vet Entomol. 2013;27:298–312.

    Article  CAS  PubMed  Google Scholar 

  79. Impoinvil DE, Kongere JO, Foster WA, Njiru BN, Killeen GF, Githure JI, et al. Feeding and survival of the malaria vector Anopheles gambiae on plants growing in Kenya. Med Vet Entomol. 2004;18:108–15.

    Article  CAS  PubMed  Google Scholar 

  80. Sun H, Liu F, Baker AP, Honegger HW, Raiser G, Zwiebel LJ. Neuronal odor coding in the larval sensory cone of Anopheles coluzzii: complex responses from a simple system. Cell Rep. 2021;36:109555–109555.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  81. Lutz EK, Grewal TS, Riffell JA. Computational and experimental insights into the chemosensory navigation of Aedes aegypti mosquito larvae. Proc Biol Sci. 2019;286:20191495–20191495.

    CAS  PubMed  PubMed Central  Google Scholar 

Download references

Acknowledgements

Giridhar Athrey provided helpful advice on the variant calling pipeline. Adrian Castellanos, Curtis Cooper, and Jenna Hull helped diagnose coding errors. This work was conducted as part of ZRPH’s doctoral dissertation, which was supervised by a committee including Giridhar Athrey, J. Spencer Johnston, Christine Merlin, and Aaron M. Tarone.

Funding

This manuscript was not supported by any particular funding.

Author information

Authors and Affiliations

Authors

Contributions

The project was conceived by MAS. ZRPH and MAS designed the study, analyzed the data, and wrote the final manuscript. Bioinformatic pipelines were developed by ZRPH. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Zachary R. Popkin-Hall.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License, which permits any non-commercial use, sharing, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if you modified the licensed material. You do not have permission under this licence to share adapted material derived from this article or parts of it. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc-nd/4.0/.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Popkin-Hall, Z.R., Slotman, M.A. Molecular evolution of gustatory receptors in the Anopheles gambiae complex. BMC Ecol Evo 25, 22 (2025). https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12862-025-02359-x

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doiorg.publicaciones.saludcastillayleon.es/10.1186/s12862-025-02359-x

Keywords