Abstract

Coexistence and cooperation between dogs and humans over thousands of years have supported convergent evolutionary processes in the two species. Previous studies found that Eurasian dogs evolved into a distinct geographic cluster. In this study, we used the genomes of 242 European dogs, 38 Southeast Asian indigenous (SEAI) dogs, and 41 gray wolves to identify adaptation of European dogs . We report 86 unique positively selected genes in European dogs, among which is LCT (lactase). LCT encodes lactase, which is fundamental for the digestion of lactose. We found that an A-to-G mutation (chr19:38,609,592) is almost fixed in Middle Eastern and European dogs. The results of two-dimensional site frequency spectrum (2D SFS) support that the mutation is under soft sweep . We inferred that the onset of positive selection of the mutation is shorter than 6,535 years and behind the well-developed dairy economy in central Europe. It increases the expression of LCT by reducing its binding with ZEB1, which would enhance dog's ability to digest milk-based diets. Our study uncovers the genetic basis of convergent evolution between humans and dogs with respect to diet, emphasizing the import of the dog as a biomedical model for studying mechanisms of the digestive system.

Dogs were domesticated between 15,000 and 40,000 years ago and spread around the world alongside humans (Vilà et al. 1997; Germonpré et al. 2009; Ostrander et al. 2019). They have adapted to both natural and human environments through natural and artificial selection (Wang et al. 2013, 2014, 2020; Gou et al. 2014; Li et al. 2014; Freedman et al. 2016; Liu et al. 2018; Wu et al. 2020). For example, AMY2B (amylase-alpha-2B) and MGAM (maltase-glucoamylase) which are under positive selection in dogs are known to play an important role in starch digestion (Axelsson et al. 2013; Wang et al. 2013). Thus, adaptation to a starch-rich diet during the agricultural revolution was crucial to the domestication of dogs. Meanwhile, a new copy of AKR1B1 (aldo-keto reductase family 1 member B) transcript is identified in the dog genome but absent in gray wolf and dhole genomes (Wang, Shao, et al. 2019b). That may enhance de novo fatty acid synthesis and antioxidant capacity in dogs. These genetic changes are indicative of the dog's adaptability to dietary changes during the spread of prehistoric agriculture (Arendt et al. 2016). It is undeniable that human culture heavily impacts the evolution of dogs (Ollivier et al. 2016).

Based on whole-genome sequencing of canids, Eurasian dogs bifurcate into two major genetic groups: European dogs and Southeast Asian indigenous (SEAI) dogs (Frantz et al. 2016; Wang et al. 2016). Another study revealed that Eurasian dogs split into four distinct geographic clusters: Southeast Asia, India, Middle East, and Europe (Botigué et al. 2017). This population structure implies that Eurasian dogs underwent divergent evolution. Recently, 722 canine whole-genome sequences were published (Plassais et al. 2019), expanding the capacity for in-depth investigation of the adaptive evolution in European dogs.

Genetic basis of adaptation to milk-based diets in Europeans has been reported (Hollox et al. 2001; Enattah et al. 2002). Human has a great impact on dogs, including the diets (Axelsson et al. 2013; Arendt et al. 2016; Wang, Larson, et al. 2019a). Here, we have studied the genomes of 242 European dogs, 38 SEAI dogs, and 41 gray wolves to understand the adaptation of European dogs. We discuss how dogs were influenced by the dietary cultures of the European people.

Results and Discussion

Sample Information and Population Structure

Previous study confirmed that the best cost/benefit argument of whole-genome sequencing is two dogs per breed (Dreger et al. 2016). To avoid sampling bias (from 1 to 44 individuals per breed), we picked 5 individuals with the highest genome coverage for every breed with more than 5 individuals. We used all individuals for breeds in which sample size is less than 5. In total, we have 242 European dog sequences including 239 samples from 96 European breeds and 3 Portugal village dogs, 38 SEAI dogs, and 41 gray wolves from the public 722 canine genomes data set ( supplementary table S1, Supplementary Material online) (Plassais et al. 2019). Principal component analysis (PCA) explores the consistency of the genetic structure and the origin information assigned in the Federation Cynologique Internationale (FCI). PC1 and PC2 showed clear three groups: SEAI dogs, European dogs, and gray wolves ( supplementary fig. S1, Supplementary Material online).

PSGs in European Dogs

We performed extended number of segregating sites by length (XP-nSL) (Garud et al. 2015) and cross-population extended haplotype homozygosity (XP-EHH) tests (Sabeti et al. 2007) to scan for positively selected signals in the autosomes. Using empirical P< 0.01 as threshold ( supplementary note, Supplementary Material online), 104 positively selected genes (PSGs) were commonly identified by XP-nSL (363 PSGs) and XP-EHH (429 PSGs) in European dogs. To identify the unique PSGs in European breeds, XP-nSL and XP-EHH were also carried out to detect the PSGs in SEAI dogs. There are 137 PSGs commonly identifying in SEAI dogs by both methods. Out of the 104 PSGs in European dogs, 86 genes ( supplementary table S2, Supplementary Material online and fig. 1) were retained after excluding 137 PSGs detected in SEAI dogs.

Fig. 1

The positive selection analysis of European dogs. Genome scans with XP-EHH (A) and XP-nSL (B). Red dotted line marks empirical P = 0.001. (C) iSAFE analysis across 1.2 Mb around chr19:38,609,592 (red).

The positive selection analysis of European dogs. Genome scans with XP-EHH (A) and XP-nSL (B). Red dotted line marks empirical P = 0.001. (C) iSAFE analysis across 1.2 Mb around chr19:38,609,592 (red).

Among the 86 PSGs, three genes show strong positively selected signals (P < 0.001) in XP-EHH and XP-nSL (fig. 1), including lactase (LCT), minichromosome maintenance complex component 6 (MCM6), and LIM homeobox 8 (LHX8). LCT is a hydrolase that hydrolyze lactose into galactose and glucose. After weaning, most mammals reduced expression of LCT in the intestinal tissues and cannot digest milk (Sebastio et al. 1989; Büller et al. 1990; Lacey et al. 1994). However, lactase persistence (LP) is common in adult humans who live at northern and western Europe, as well as in African and Middle Eastern pastoralist groups, providing for by mutations in LCT and MCM6 (Hollox et al. 2001; Enattah et al. 2002; Ingram et al. 2009). A Steppe-associated expansion during the early Bronze Age contributed to advance LP in South Asians (Satta and Takahata 2020). Positive selection always creates long linkage disequilibrium (LD). LD analysis using Haploview (Barrett et al. 2005) shows that LCT and MCM6 are tightly linked in European dogs ( supplementary fig. S2, Supplementary Material online). It is consistent with the finding in Finnish pedigrees (Enattah et al. 2002).

The Convergent Distribution of LP in Humans and LCT-G SNP in Dogs

To identify candidate SNPs, Fst by site between European dogs and gray wolves were calculated across whole genome and the top 1% sites taken for gene annotation (Danecek et al. 2011). One SNP (chr19:38,609,592, A-to-G) showed high allele frequency in European dogs (91.7%) compared with the SEAI dogs (61.8%) and wolves (6.1%). Thus, we used integrated Selection of Allele Favored by Evolution (iSAFE) analysis to search for candidate SNPs in a 1.2-Mb around chr19:38,609,592 (fig. 1C) (Akbari et al. 2018). To exclude effects of the demographic history, we simulated 100,000 regions with 1.2 Mb to calculate iSAFE under the demographic history (Liu et al. 2018). The result of iSAFE shows that the A-to-G mutation is significantly under positive selection (P = 1.09E-7). We performed 2D SFS to infer the core region under selection (Fujito et al. 2018; Satta et al. 2020). There are 69 SNPs located at the core region with strong LD (r 2 > 3/4). Thus, we performed the simulations for 20,000 regions with 11,630 bp containing 69 SNPs and calculated 2D SFS under the demographic history (Liu et al. 2018). The significantly small values of Fc , Lc 0, and large imax, γ*(10) and G*c0 support that LCT is under soft sweep in European dogs ( supplementary table S3, Supplementary Material online). It suggests that more than one derived allelic lineages have been undergoing the selective sweep. The time to the most recent common ancestor (t coa) of A-to-G mutation is 6,535 ± 180 years ago. This time is longer than 4,000 years that LP-associated allele earliest appeared in ancient Europeans (Gamba et al. 2014; Mathieson et al. 2015), and longer than the t coa (3,280 ± 480 years ago) of T at rs4988235 in Europeans inferred by 2D SFS (Satta et al. 2020; Satta and Takahata 2020). Because T at rs4988235 in Europeans is under hard sweep, its onset time of positive selection (t SEL) is longer than t coa (3,280 years) (Satta and Takahata 2020). On the contrary, the t SEL of A-to-G mutation in European dogs is younger than t coa (6,535 years) due to its soft sweep. Considering the presence of A-to-G mutation in wolves and gene flows between dogs and wolves (Wang et al. 2016), it is plausible that A-to-G mutation had been existing in European dogs before the onset of positive selection. The t SEL of A-to-G mutation is shorter than the time that the earliest milk consumption in the Near East and southeastern Europe appeared around 6,500 BC (Evershed et al. 2008). Dairy economy was well developed in central Europe by 6,500 years ago (Curry 2013).

To further explore the global distribution of LCT-G in dogs, we calculated its allele frequency from 737 individuals (fig. 2) (Plassais et al. 2019). The LCT-G allele is almost fixed in Middle Eastern dogs (92.2%). A similar pattern has been reported in Middle Eastern human populations (Swallow 2003; Ingram et al. 2009). Because milk consumption emerged in the Near East and southeastern Europe 7,000–8,500 years ago (Evershed et al. 2008). On the contrary, most Chinese adult humans are lactose intolerant (Bolin et al. 1970; Bolin and Davis 1970; Bryant et al. 1970; Wang et al. 1984). Therefore, it is clearly plausible that the increased expression of LCT helps European dogs to adapt to a milk-based diet. The allele frequency is also high in Indian dogs (90.0%). The LP among Indian human populations is complex, with LP high in the North Indians but low in South Indians (Tandon et al. 1981; Enattah et al. 2002). In Africa, the allele frequency is low among Congolese (basenji, 12.5%) and Nigerian indigenous dogs (31.6%), but high in Namibian village dogs (83.3%) and Moroccan (Sloughi, 87.5%). Notably, central Namibian dogs are genetically closest to American dogs, which implies predominantly non-African origins (Boyko et al. 2009). Their high allele frequency might be caused by the nonindigenous lineage. For African human populations, pastoralist populations predominantly exhibit high LP in contrast to nonpastoralists (Mulcare et al. 2004; Tishkoff et al. 2007).

Fig. 2

The distribution SNPs of LCT in the global panel. Red portions of the pies represent the ratio of LCT SNP (G) whereas the blue represents the ratio of LCT SNP (A). The breed origin was obtained from FCI.

The distribution SNPs of LCT in the global panel. Red portions of the pies represent the ratio of LCT SNP (G) whereas the blue represents the ratio of LCT SNP (A). The breed origin was obtained from FCI.

LCT SNP (A > G) Increase the LCT Expression

The A-to-G mutation is located in intron 2 of LCT. The intron 2 is highly conserved across mammals and its deletion significantly reduces the expression of LCT in mice (Labrie et al. 2016). Based on JASPAR database (Khan et al. 2017), transcription factor ZEB1 (zinc finger E-box-binding homeobox 1) potentially binds to this SNP position ( supplementary table S4, Supplementary Material online). The A-to-G mutation in LCT reduces its consistency with ZEB1 (relative score 0.987 vs. 0.853). Additionally, the base in ZEB1 which binds to this SNP in LCT is highly conserved in Homo sapiens (sequence logo = 2, supplementary fig. S3, Supplementary Material online). It is plausible that the A-to-G mutation may change LCT expression by modifying ZEB1 binding. To verify this, LCT-G and LCT-A luciferase reporter constructs were engineered (fig. 3A and supplementary note, Supplementary Material online) and cotransfected into HEK-293T cells with ZEB1 expression vector (fig. 3B). Luciferase activity shows that LCT-G has a higher expression than LCT-A (fig. 3C). When ZEB1 was cotransfected, the luciferase activity of LCT-A and LCT-G luciferase reporter constructs were significantly decreased. These results suggest that A-to-G mutation in LCT increases the expression of LCT by weakening the binding of ZEB1. Thus, we infer that the mutation enhances the function of LCT, resulting in LP of European dogs. The similar pattern is found in humans. For Europeans, the SNP C/T-13910 located ∼14 kb upstream of LCT is associated with the LP and under strong positive selection (Enattah et al. 2002). The region surrounding C/T-13910 increases the expression of LCT as a strong enhancer (Enattah et al. 2002; Olds and Sibley 2003; Troelsen et al. 2003). The LCT intron 2 is a regulatory element for the development of LP in humans and mice (Labrie et al. 2016). We therefore infer that an elevated expression of LCT in European dogs confers their adaptation to milk-based diets.

Fig. 3

LCT SNP (G > A) influences the suppression of LCT expression with the involvement of ZEB1. (A) A schematic representation of the LCT construct. The DNA sequence including exon2, intron2, and exon3 of LCT are linked by a promoter and luciferase. (B) A schematic representation of the ZEB1 construct. ZEB1 is linked by a promoter, flag and puro. (C) HEK-293T cells were cotransfected with pCMV-Renilla (control), LCT SNP (G) or LCT SNP (A) Luciferase reporter construct as well as the ZEB1 expressing vector in sextuplicate. Two days after transfection, the cells were collected for the dual-luciferase reporter assays. Data are means ± SD. ** Mean P < 0.05 (t-test), *** Mean P < 0.001. RFP, red fluorescent protein.

LCT SNP (G > A) influences the suppression of LCT expression with the involvement of ZEB1. (A) A schematic representation of the LCT construct. The DNA sequence including exon2, intron2, and exon3 of LCT are linked by a promoter and luciferase. (B) A schematic representation of the ZEB1 construct. ZEB1 is linked by a promoter, flag and puro. (C) HEK-293T cells were cotransfected with pCMV-Renilla (control), LCT SNP (G) or LCT SNP (A) Luciferase reporter construct as well as the ZEB1 expressing vector in sextuplicate. Two days after transfection, the cells were collected for the dual-luciferase reporter assays. Data are means ± SD. ** Mean P < 0.05 (t-test), *** Mean P < 0.001. RFP, red fluorescent protein.

Conclusion

Genes for LP coevolved over time with variations in human dietary preferences and milk-ingestion cultures (Beja-Pereira et al. 2003) . Of all domestic animals, dogs have had one of the longest mutual coexistence with humans, sharing among other things like food and living environments. There is evidence for convergent evolution between humans and dogs regarding several factors (Perry et al. 2007; Axelsson et al. 2013; Wang et al. 2014; Liu et al. 2018). Here, we describe the coevolution between dogs and human dietary culture at the genome level. Based on whole-genome analyses and gene expression assays, we outline the mutational change in LCT gene which increases its expression to confer adaptability to milk-based diets. This study expands our understanding of the genetic basis of dogs' adaptation to human diets. It is imperative that the dog provides a suitable large animal model for studying human diseases and medicines, especially those of the digestive tract.

Materials and Methods

Sample Information

The raw SNPs files of 722 individuals were downloaded from NCBI (https://www.ncbi.nlm.nih.gov/bioproject/PRJNA448733, last accessed July 17, 2021) (Plassais et al. 2019). SNPs in autosomes marked by PASS were used for analysis. We created a metadata of countries of origins of dog breeds from the FCI (http://www.fci.be, last accessed July 17, 2021). PCA was carried out using smartpca in EIGENSOFT (v7.2.1). Chr19:38,609,592 SNP information of 15 Nigerian dogs were obtained from Liu et al. (2018).

Positive Selection

Based on the genetic map downloaded from https://github.com/auton1/dog_recomb/tree/master/canFam3.1/maps (last accessed July 17, 2021)(Auton et al. 2013), genotypes were phased by SHAPEIT (v2.r904) with 0.5 Mb windows and an effective population size of 83,600 (Delaneau et al. 2012). Subsequently, XP-EHH and XP-nSL were calculated using selscan (v1.3.0) (Sabeti et al. 2007; Szpiech and Hernandez 2014). The empirical P value of XP-EHH and XP-nSL were calculated following the method previously reported ( supplementary note, Supplementary Material online) (Lee et al. 2014). The genomic regions with P < 0.01 were considered for gene annotation. Gene set was download from Ensembl (version CanFam3.1.101). iSAFE was performed for 1.2 Mb around the chr19:38,609,592 (chr19:36,109,592-41,109,592) with gray wolves as the control. 2D SFS was carried out for 69 SNPs located at the core region with strong LD (chr19: 38,600,610-38,612,240, r 2 > 3/4). P values for iSAFE and 2D SFS were got based on the simulations ( supplementary note, Supplementary Material online) of population history from Liu et al (2018) by ms (Hudson 2002). LD was carried out for LCT and MCM6 (19:38,572,058-38,660,280) by haploview (Barrett et al. 2005). SNPs with minor allele frequency ≥0.01 were used.

Generation of Constructs and Dual-Luciferase Assays

The 5,643-bp partial genomic DNA sequences of LCT genes, including exon 2, intron 2, and exon 3, were amplified. The PCR products were cloned into pGL3-basic Luciferase Reporter Vector (Promega, Madison, WI, USA) in XbaI and BamHI (NEB, USA) digestion sites to generate wild-type LCT luciferase constructs. It is formed the template for making mutated LCT luciferase constructs. The 3,378-bp full-length CDS (Coding DNA Sequence) of dog ZEB1 was amplified by primers. The PCR products were cloned into LentiV2-RFP vector in XbaI and XhoI digestion sites to generate ZEB1 expressing vector ( supplementary note, Supplementary Material online). The HEK-293T cells were seeded into 24-well plates at 1 × 105 cells per well ( supplementary note, Supplementary Material online). On the following day, the cells were transfected with SNP (G) or SNP (A) luciferase reporter construct (500 ng per well), and an internal control pCMV-Renilla control (25 ng per well) as well as the ZEB1 expressing vector or negative control LentiV2-RFP vector (25 ng per well) in sextuplicate using Lipofectamine 3000 reagent (Invitrogen, Carlsbad, CA, USA). Two days after transfection, cells were collected to measure the luciferase activity by the Dual-Luciferase Reporter Assay System (Promega), and luciferase expression was normalized to renilla luciferase expression. Student's two-tailed t-test was used to analyze the statistical significance of data.

Supplementary Material

Supplementary data are available at Molecular Biology and Evolution online.

Acknowledgments

The authors are thankful to Laurent A. F. Frantz (School of Biological and Chemical Sciences, Queen Mary University of London) for helpful feedback and discussions related to this work. This work was supported by National Natural Science Foundation of China (32000298), and the Innovative Research Team (in Science and Technology) of Yunnan Province (201905E160019), the National Key R&D Program of China (2019YFA0707101), Strategic Priority Research Program (XDPB17) of the Chinese Academy of Sciences (CAS), and Key Research Program of Frontier Sciences of the CAS (ZDBS-LY-SM011). G.-D.W. was supported by the National Youth Talent Support Program. They also acknowledge the following for funding: the grant (2018KF004) from State Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, Yunnan University. This work was supported by the Animal Branch of the Germplasm Bank of Wild Species, Chinese Academy of Sciences (the Large Research Infrastructure Funding).

Data Availability

The data underlying this article are available in FigShare (https://doi.org/10.6084/m9.figshare.14411024, last accessed July 17, 2021). The VCF file containing 91 million variants and 722 genomes was published on NCBI (https://www.ncbi.nlm.nih.gov/bioproject/PRJNA448733, last accessed July 17, 2021) (Plassais et al. 2019) and at https://doi.org/10.1038/s41467-019-09373-w (last accessed July 17, 2021). No new genome sequencing data were yielded.

References

Akbari

A

,

Vitti

JJ

,

Iranmehr

A

,

Bakhtiari

M

,

Sabeti

PC

,

Mirarab

S

,

Bafna

V.

2018

.

Identifying the favored mutation in a positive selective sweep

.

Nat Methods

.

15

(

4

):

279

282

.

Arendt

M

,

Cairns

KM

,

Ballard

JW

,

Savolainen

P

,

Axelsson

E.

2016

.

Diet adaptation in dog reflects spread of prehistoric agriculture

.

Heredity

117

(

5

):

301

306

.

Auton

A

,

Rui

LY

,

Kidd

J

,

Oliveira

K

,

Nadel

J

,

Holloway

JK

,

Hayward

JJ

,

Cohen

PE

,

Greally

JM

,

Wang

J

, et al.

2013

.

Genetic recombination is targeted towards gene promoter regions in dogs

.

PLoS Genet

.

9

(

12

):

e1003984

.

Axelsson

E

,

Ratnakumar

A

,

Arendt

M-L

,

Maqbool

K

,

Webster

MT

,

Perloski

M

,

Liberg

O

,

Arnemo

JM

,

Hedhammar

Å

,

Lindblad-Toh

K.

2013

.

The genomic signature of dog domestication reveals adaptation to a starch-rich diet

.

Nature

495

(

7441

):

360

364

.

Büller

HA

,

Kothe

MJ

,

Goldman

DA

,

Grubman

SA

,

Sasak

WV

,

Matsudaira

PT

,

Montgomery

RK

,

Grand

RJ.

1990

.

Coordinate expression of lactase-phlorizin hydrolase mRNA and enzyme levels in rat intestine during development

.

J Biol Chem

.

265

(

12

):

6978

6983

.

Barrett

JC

,

Fry

B

,

Maller

J

,

Daly

MJ.

2005

.

Haploview: analysis and visualization of LD and haplotype maps

.

Bioinformatics

21

(

2

):

263

265

.

Beja-Pereira

A

, ,

Luikart

G

, ,

England

PR

,
,

Bradley

DG

,
,

Jann

OC

,
,

Bertorelle

G

,
,

Chamberlain

AT

,
,

Nunes

TP

,
,

Metodiev

S

,
,

Ferrand

N

, et al. .

2003

.

Gene-culture coevolution between cattle milk protein genes and human lactase genes

.

Nat Genet

.

35

(

4

):

311

313

.

Bolin

TD

,

Davis

AE.

1970

.

Lactose intolerance in Australian-born Chinese

.

Australas Ann Med

.

19

(

1

):

40

41

.

Bolin

TD

,

Davis

AE

,

Seah

CS

,

Chua

KL

,

Yong

V

,

Kho

KM

,

Siak

CL

,

Jacob

E.

1970

.

Lactose intolerance in Singapore

.

Gastroenterology

59

(

1

):

76

84

.

Botigué

LR

,

Song

S

,

Scheu

A

,

Gopalan

S

,

Pendleton

AL

,

Oetjens

M

,

Taravella

AM

,

Seregély

T

,

Zeeb-Lanz

A

,

Arbogast

R-M

, et al.

2017

.

Ancient European dog genomes reveal continuity since the Early Neolithic

.

Nat Commun

.

8

:

16082

.

Boyko

AR

,

Boyko

RH

,

Boyko

CM

,

Parker

HG

,

Castelhano

M

,

Corey

L

,

Degenhardt

JD

,

Auton

A

,

Hedimbi

M

,

Kityo

R

, et al.

2009

.

Complex population structure in African village dogs and its implications for inferring dog domestication history

.

Proc Natl Acad Sci U S A

.

106

(

33

):

13903

13908

.

Bryant

GD

,

Chu

YK

,

Lovitt

R.

1970

.

Incidence and aetiology of lactose intolerance

.

Med J Aust

.

1

(

26

):

1285

1288

.

Curry

A.

2013

.

Archaeology: the milk revolution

.

Nature

500

(

7460

):

20

22

.

Danecek

P

,

Auton

A

,

Abecasis

G

,

Albers

CA

,

Banks

E

,

DePristo

MA

,

Handsaker

RE

,

Lunter

G

,

Marth

GT

,

Sherry

ST

, et al.

2011

.

The variant call format and VCFtools

.

Bioinformatics

27

(

15

):

2156

2158

.

Delaneau

O

,

Marchini

J

,

Zagury

J-F.

2012

.

A linear complexity phasing method for thousands of genomes

.

Nat Methods

.

9

(

2

):

179

181

.

Dreger

DL

,

Rimbault

M

,

Davis

BW

,

Bhatnagar

A

,

Parker

HG

,

Ostrander

EA.

2016

.

Whole-genome sequence, SNP chips and pedigree structure: building demographic profiles in domestic dog breeds to optimize genetic-trait mapping

.

Dis Model Mech

.

9

(

12

):

1445

1460

.

Enattah

NS

,

Timo

S

,

Erkki

S

,

Terwilliger

JD

,

Leena

P

,

Järvelä

I.

2002

.

Identification of a variant associated with adult-type hypolactasia

.

Nat Genet

.

30

(

2

):

233

237

.

Evershed

RP

,

Payne

S

,

Sherratt

AG

,

Copley

MS

,

Coolidge

J

,

Urem-Kotsu

D

,

Kotsakis

K

,

Özdoğan

M

,

Özdoğan

AE

,

Nieuwenhuyse

O

, et al.

2008

.

Earliest date for milk use in the Near East and southeastern Europe linked to cattle herding

.

Nature

455

(

7212

):

528

531

.

Frantz

LA

,

Mullin

VE

,

Pionnier-Capitan

M

,

Lebrasseur

O

,

Ollivier

M

,

Perri

A

,

Linderholm

A

,

Mattiangeli

V

,

Teasdale

MD

,

Dimopoulos

EA

, et al.

2016

.

Genomic and archaeological evidence suggest a dual origin of domestic dogs

.

Science

352

(

6290

):

1228

1231

.

Freedman

AH

,

Schweizer

RM

,

Vecchyo

DO

,

Han

E

,

Davis

BW

,

Gronau

I

,

Silva

PM

,

Galaverni

M

,

Fan

Z

,

Marx

P

, et al.

2016

.

Demographically-based evaluation of genomic regions under selection in domestic dogs

.

PLoS Genet

.

12

(

3

):

e1005851

.

Fujito

NT

,

Satta

Y

,

Hayakawa

T

,

Takahata

N.

2018

.

A new inference method for detecting an ongoing selective sweep

.

Genes Genet Syst

.

93

(

4

):

149

161

.

Gamba

C

,

Jones

ER

,

Teasdale

MD

,

McLaughlin

RL

,

Gonzalez-Fortes

G

,

Mattiangeli

V

,

Domboróczki

L

,

Kővári

I

,

Pap

I

,

Anders

A

, et al.

2014

.

Genome flux and stasis in a five millennium transect of European prehistory

.

Nat Commun

.

5

:

5257

.

Garud

NR

,

Messer

PW

,

Buzbas

EO

,

Petrov

DA.

2015

.

Recent selective sweeps in North American Drosophila melanogaster show signatures of soft sweeps

.

PLoS Genet

.

11

(

2

):

e1005004

.

Germonpré

M

,

Sablin

MV

,

Stevens

RE

,

Hedges

RE

,

Hofreiter

M

,

Stiller

M

,

Després

VR.

2009

.

Fossil dogs and wolves from Palaeolithic sites in Belgium, the Ukraine and Russia: osteometry, ancient DNA and stable isotopes

.

J Archaeol Sci

.

36

(

2

):

473

490

.

Gou

X

,

Wang

Z

,

Li

N

,

Qiu

F

,

Xu

Z

,

Yan

D

,

Yang

S

,

Jia

J

,

Kong

X

,

Wei

Z

, et al.

2014

.

Whole genome sequencing of six dog breeds from continuous altitudes reveals adaption to high-altitude hypoxia

.

Genome Res

.

24

(

8

):

1308

1315

.

Hollox

EJ

,

Poulter

M

,

Zvarik

M

,

Ferak

V

,

Krause

A

,

Jenkins

T

,

Saha

N

,

Kozlov

AI

,

Swallow

DM.

2001

.

Lactase haplotype diversity in the Old World

.

Am J Hum Genet

.

68

(

1

):

160

172

.

Hudson

RR.

2002

.

Generating samples under a Wright-Fisher neutral model of genetic variation

.

Bioinformatics

18

(

2

):

337

338

.

Ingram

CJE

,

Mulcare

CA

,

Yuval

I

,

Thomas

MG

,

Swallow

DM.

2009

.

Lactose digestion and the evolutionary genetics of lactase persistence

.

Hum Genet

.

124

(

6

):

579

591

.

Khan

A

,

Fornes

O

,

Stigliani

A

,

Gheorghe

M

,

Castromondragon

JA

,

Lee

RVD

,

Bessy

A

,

Chèneby

J

,

Kulkarni

SR

,

Tan

G

, et al.

2017

.

JASPAR 2018: update of the open-access database of transcription factor binding profiles and its web framework

.

Nucleic Acids Res

.

77

:

e43

.

Labrie

V

,

Buske

OJ

,

Edward

O

,

Jeremian

R

,

Ptak

C

,

Gasiūnas

G

,

Maleckas

A

,

Petereit

R

,

Žvirbliene

A

,

Adamonis

K

, et al.

2016

.

Lactase nonpersistence is directed by DNA-variation-dependent epigenetic aging

.

Nat Struct Mol Biol

.

23

(

6

):

566

573

.

Lacey

SW

,

Naim

HY

,

Magness

RR

,

Gething

MJ

,

Sambrook

JF.

1994

.

Expression of lactase-phlorizin hydrolase in sheep is regulated at the RNA level

.

Biochem J

.

302

(

3

):

929

935

.

Lee

HJ

,

Kim

J

,

Lee

T

,

Son

JK

,

Yoon

HB

,

Baek

KS

,

Jeong

JY

,

Cho

YM

,

Lee

KT

,

Yang

BC

, et al.

2014

.

Deciphering the genetic blueprint behind Holstein milk proteins and production

.

Genome Biol Evol

.

6

(

6

):

1366

1374

.

Li

Y

,

Wu

D-D

,

Boyko

AR

,

Wang

G-D

,

Wu

S-F

,

Irwin

DM

,

Zhang

Y-P.

2014

.

Population variation revealed high altitude adaptation of Tibetan Mastiffs

.

Mol Biol Evol

.

31

(

5

):

1200

1205

.

Liu

YH

,

Wang

L

,

Xu

T

,

Guo

X

,

Li

Y

,

Yin

TT

,

Yang

HC

,

Yang

H

,

Adeola

AC

,

Sanke

OJ

, et al.

2018

.

Whole-genome sequencing of African dogs provides insights into adaptations against tropical parasites

.

Mol Biol Evol

.

35

(

2

):

287

298

.

Mathieson

I

,

Lazaridis

I

,

Rohland

N

,

Mallick

S

,

Patterson

N

,

Roodenberg

SA

,

Harney

E

,

Stewardson

K

,

Fernandes

D

,

Novak

M

, et al.

2015

.

Genome-wide patterns of selection in 230 ancient Eurasians

.

Nature

528

(

7583

):

499

503

.

Mulcare

CA

,

Weale

ME

,

Jones

AL

,

Connell

B

,

Zeitlyn

D

,

Tarekegn

A

,

Swallow

DM

,

Bradman

N

,

Thomas

MG.

2004

.

The T allele of a single-nucleotide polymorphism 13.9 kb upstream of the lactase gene (LCT) (C−13.9kbT) does not predict or cause the lactase-persistence phenotype in Africans

.

Am J Hum Genet

.

74

(

6

):

1102

1110

.

Olds

LC

,

Sibley

E.

2003

.

Lactase persistence DNA variant enhances lactase promoter activity in vitro: functional role as a cis regulatory element

.

Hum Mol Genet

.

12

(

18

):

2333

2340

.

Ollivier

M

,

Tresset

A

,

Bastian

F

,

Lagoutte

L

,

Axelsson

E

,

Arendt

M

,

Bălăşescu

A

,

Marshour

M

,

Sablin

M

,

Salanova

L

, et al.

2016

.

Amy2B copy number variation reveals starch diet adaptations in ancient European dogs

.

R Soc Open Sci

.

3

(

11

):

160449

.

Ostrander

EA

,

Wang

G-D

,

Larson

G

,

vonHoldt

BM

,

Davis

BW

,

Jagannathan

V

,

Hitte

C

,

Wayne

RK

,

Zhang

Y-P

, Dog10K Consortium,

2019

.

Dog10K: an international sequencing effort to advance studies of canine domestication, phenotypes and health

.

Natl Sci Rev

.

6

(

4

):

810

824

.

Perry

GH

,

Dominy

NJ

,

Claw

KG

,

Lee

AS

,

Fiegler

H

,

Redon

R

,

Werner

J

,

Villanea

FA

,

Mountain

JL

,

Misra

R

, et al.

2007

.

Diet and the evolution of human amylase gene copy number variation

.

Nat Genet

.

39

(

10

):

1256

1260

.

Plassais

J

,

Kim

J

,

Davis

BW

,

Karyadi

DM

,

Hogan

AN

,

Harris

AC

,

Decker

B

,

Parker

HG

,

Ostrander

EA.

2019

.

Whole genome sequencing of canids reveals genomic regions under selection and variants influencing morphology

.

Nat Commun

.

10

(

1

):

1489

.

Sabeti

PC

,

Varilly

P

,

Fry

B

,

Lohmueller

J

,

Hostetter

E

,

Cotsapas

C

,

Xie

X

,

Byrne

EH

,

McCarroll

SA

,

Gaudet

R

, et al.

2007

.

Genome-wide detection and characterization of positive selection in human populations

.

Nature

449

(

7164

):

913

918

.

Satta

Y

,

Takahata

N.

2020

. Population genomics on the origin of lactase persistence in Europe and South Asia. bioRxiv 2020.06.30.179432.

Satta

Y

,

Zheng

W

,

Nishiyama

KV

,

Iwasaki

RL

,

Hayakawa

T

,

Fujito

NT

,

Takahata

N.

2020

.

Two-dimensional site frequency spectrum for detecting, classifying and dating incomplete selective sweeps

.

Genes Genet Syst

.

94

(

6

):

283

300

.

Sebastio

G

,

Villa

M

,

Sartorio

R

,

Guzzetta

V

,

Poggi

V

,

Auricchio

S

,

Boll

W

,

Mantei

N

,

Semenza

G.

1989

.

Control of lactase in human adult-type hypolactasia and in weaning rabbits and rats

.

Am J Hum Genet

.

45

(

4

):

489

497

.

Swallow

DM.

2003

.

Genetics of lactase persistence and lactose intolerance

.

Annu Rev Genet

.

37

:

197

219

.

Szpiech

ZA

,

Hernandez

RD.

2014

.

selscan: an efficient multithreaded program to perform EHH-based scans for positive selection

.

Mol Biol Evol

.

31

(

10

):

2824

2827

.

Tandon

RK

,

Joshi

YK

,

Singh

DS

,

Narendranathan

M

,

Balakrishnan

V

,

Lal

K.

1981

.

Lactose intolerance in North and South Indians

.

Am J Clin Nutr

.

34

(

5

):

943

946

.

Tishkoff

SA

,

Reed

FA

,

Ranciaro

A

,

Voight

BF

,

Babbitt

CC

,

Silverman

JS

,

Powell

K

,

Mortensen

HM

,

Hirbo

JB

,

Osman

M

, et al.

2007

.

Convergent adaptation of human lactase persistence in Africa and Europe

.

Nat Genet

.

39

(

1

):

31

40

.

Troelsen

JT

,

Olsen

J

,

Møller

J

,

Sjöström

H.

2003

.

An upstream polymorphism associated with lactase persistence has increased enhancer activity

.

Gastroenterology

125

(

6

):

1686

1694

.

Vilà

C

,

Savolainen

P

,

Maldonado

JE

,

Amorim

IR

,

Rice

JE

,

Honeycutt

RL

,

Crandall

KA

,

Lundeberg

J

,

Wayne

RK.

1997

.

Multiple and ancient origins of the domestic dog

.

Science

276

(

5319

):

1687

1689

.

Wang

GD

,

Fan

R-X

,

Zhai

W

,

Liu

F

,

Wang

L

,

Zhong

L

,

Wu

H

,

Yang

H-C

,

Wu

S-F

,

Zhu

C-L

, et al.

2014

.

Genetic convergence in the adaptation of dogs and humans to the high-altitude environment of the Tibetan plateau

.

Genome Biol Evol

.

6

(

8

):

2122

2128

.

Wang

GD

,

Larson

G

,

Kidd

JM

,

vonHoldt

BM

,

Ostrander

EA

,

Zhang

YP.

2019

.

Dog10K: the International Consortium of Canine Genome Sequencing

.

Natl Sci Rev

.

6

(

4

):

611

613

.

Wang

GD

,

Shao

X-J

,

Bai

B

,

Wang

J

,

Wang

X

,

Cao

X

,

Liu

Y-H

,

Wang

X

,

Yin

T-T

,

Zhang

S-J

, et al.

2019

.

Structural variation during dog domestication: insights from grey wolf and dhole genomes

.

Natl Sci Rev

.

6

(

1

):

110

122

.

Wang

GD

,

Zhai

W

,

Yang

H-C

,

Fan

R-X

,

Cao

X

,

Zhong

L

,

Wang

L

,

Liu

F

,

Wu

H

,

Cheng

L-G

, et al.

2013

.

The genomics of selection in dogs and the parallel evolution between dogs and humans

.

Nat Commun

.

4

:

1860

.

Wang

GD

,

Zhai

W

,

Yang

H-C

,

Wang

L

,

Zhong

L

,

Liu

Y-H

,

Fan

R-X

,

Yin

T-T

,

Zhu

C-L

,

Poyarkov

AD

, et al.

2016

.

Out of southern East Asia: the natural history of domestic dogs across the world

.

Cell Res

.

26

(

1

):

21

33

.

Wang

M-S

,

Thakur

M

,

Peng

M-S

,

Jiang

Y

,

Frantz

LAF

,

Li

M

,

Zhang

J-J

,

Wang

S

,

Peters

J

,

Otecko

NO

, et al.

2020

.

863 genomes reveal the origin and domestication of chicken

.

Cell Res

.

30

(

8

):

693

701

.

Wang

YG

,

Yan

YS

,

Xu

JJ

,

Du

RF

,

Flatz

SD

,

Kühnau

W

,

Flatz

G.

1984

.

Prevalence of primary adult lactose malabsorption in three populations of northern China

.

Hum Genet

.

67

(

1

):

103

106

.

Wu

D-D

,

Yang

C-P

,

Wang

M-S

,

Dong

K-Z

,

Yan

D-W

,

Hao

Z-Q

,

Fan

S-Q

,

Chu

S-Z

,

Shen

Q-S

,

Jiang

L-P

, et al.

2020

.

Convergent genomic signatures of high-altitude adaptation among domestic mammals

.

Natl Sci Rev

.

7

(

6

):

952

963

.

Author notes

Yan-Hu Liu, Lu Wang and Zhiguo Zhang authors contributed equally to this work.

This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com

Associate Editor: Yoko Satta

Yoko Satta

Associate Editor

Search for other works by this author on:


Supplementary data