Abstract
This study was focused on the evolving role of DGAT1 using phylogenetic analysis. DGAT1 was retrieved from available databases of 37 mammalian genomes to detect signatures of positive selection in individual codons. Evolutionary analyses likelihood (REL), fast unconstrained Bayesian approximation (FUBAR), and Mixed-effects model evolution (MEME). REL detected 3 positive selection sites (at 109, 195, and 211 positions), and MEME detected 20 sites. Among them, 211 sites were confirmed as a strong positive selection of the DGAT1 gene. The codon model evolution (CME) model clustered amino acid substitution rates at three class levels (0.06, 0.20, and 0.40). Multiple hit tests showed that 20 of the DGAT1 triplet codes for amino acid formation were changed into new codes. In which, the GAC code for aspartate and GTC for valine had the maximum counts of 14 and 16. Strong evidence of positive selection of the DGAT1 gene in mammals was found.
Key Words
Evolutionary Study, DGAT1, Mammals, Positive Selection Variation
Introduction
Evolutionary biology gives an understanding of genetic improvement happening over species. It helps us to realize the gene functions and variation processes including their mutations, duplication, and genome-wide associations (Arbuckle, 2020; Ball and Balthazart, 2021). Wang et al. (2017) reported that the molecular evolution of some genes with different breeding histories can be used as a unique population structural genes. Which, can explain the genetic variation mechanisms and complex pathways. The primary enzyme that regulates the production of triacylglycerides (TGA) in the endoplasmic reticulum (ER) membrane is called diacylglycerol acyltransferase (DGAT) (Ying et al., 2017). Different organisms have been shown to harbour four DGAT subfamilies: DGAT1, DGAT2, DGAT3, and WAX-DGAT (Liu et al., 2020). whereas the main enzymes for TGA production and storage in animal tissues are DGAT1 and DGAT2 (Chitraju et al., 2019). Nevertheless, there is no evolutionary connection between the two enzymes. Consequently, a number of studies (Canon-Beltran et al., 2020; Hua et al., 2018; Karis et al., 2020) are attempting to determine how these two essentially distinct enzymes contribute to TGA storage in mammalian adipose tissues. As an ER multitopic membrane protein with a luminal side active site, DGAT1 is a member of the O-acyltransferase ER membrane gene family (Canon-Beltran et al., 2020). (Ma et al., 2018). This gene is a potential gene for milk fat and is found in the QTL of chromosome 14 (BTA14) areas (Bovenhuis et al., 2016). The evolution of DGAT1 has been shown to be driven by positive selection, however, the specific codons that are selected are unknown. DGAT1's changing function was investigated by phylogenetic analysis. In addition to diacylglycerol, this enzyme can transfer the fatty acyl moiety to a variety of acceptors, such as retinol or long-chain alcohols, to create retinyl esters or waxes, respectively (Chitraju et al., 2019). Additionally, Maraschin et al. (2019) described the Kennedy pathway and the monoacylglycerol pathway, citing the glycerol phosphate as one of the two primary channels for the synthesis of triacylglycerides. For example, Tabaran et al. (2015) found that the DGAT1 gene is essential for the synthesis of TGA in milk fat since it codes for the DGAT1 enzyme. As a result, the DGAT1 gene is regarded as one of the most significant milk fat candidate genes (Bovenhuis et al., 2016). Numerous bioinformatics models are widely employed to monitor the favourable choices of mammalian genes. Using the Markov Chain Monte Carlo (MCMC) procedure, the MEME model, for example, can combine the fixed effects to find instances of both positive selection and episodic diversifying selection at the individual branch site and rapid unconstrained Bayesian approximation (FUBAR) (Bakiu et al., 2015). According to Bartakova et al. (2021), the advantage of MEME is that it can enable each site to have its own selective history without requiring the branches that are being selected to be partitioned beforehand. Additionally, the evolutionary conservation of amino acid locations is crucial for preserving the structure and functionality of proteins. For instance, Onodera et al. (2019) reported that mammals with serine at 366 positions have more potential to control some cellular metabolism including biosynthesis and mitochondrial oxidation. Therefore, the detection of selected sites may enlighten the selection forces and detect the functionally significant sites for protein interaction. The Mechanistic-Empirical Combination (MEC) model can estimate the selection pressure at particular codons (Onodera et al., 2019). In order to distinguish the positive selection DGAT1 gene among various mammalian species, this study sought to examine selection model markers using the maximum likelihood probability approach. Additionally, it sought to provide information regarding the applicability of these markers assisted selection in the species diversity.
Materials and Methods
The nucleotide and amino
acid sequence-coding DGAT1 gene
Ensembl
(http://useast.ensembl.org/index.html), Uniprot (http://www.uniprot.org), and
NCBI (www.ncbi.nlm.nih.gov/genbank) databases provide DGAT1 gene coding
nucleotides and amino acid sequences. Clustal Omega was used in the MEGA 6.0
programme to align the DGAT1 protein sequences (Tamura et al., 2013). The species was identified using the accession
number in addition to the mRNA and protein accession numbers shown in (Table
1). We used maximum likelihood methods in MEGA 6.0 to generate the phylogenetic
tree of the DGAT1 gene. For taxonomic clustering, 1000 repeats were available
using bootstrapping. Asif and Associates, 2017.
Molecular evolution and positive selection of DGAT1
codon
The molecular basis of
evolution and the relevance of positive selection of DGAT1 was designed by
analysing the codon and sequence of DGAT1 and comparing the dN/dS ratio of ?
for two maximum likelihood methodologies (Ahmad et al., 2018; Ahmad et al., 2019). Data Monkey (http://www.datamonkey.org/) and
the HyPhy package were the software tools used. To discover the values globally
of ?, we employed various likelihood programmes such as internal fixed effect
likelihood (IFEL), rapid unconstrained Bayesian approximation (FUBAR), and
random effect likelihood (REL). For positive site picks, the REL employed a 95%
confidence interval and Bayes factor values greater than 20. According to Ahmad
et al. (2017b), the other
study used p values < 0.05 to evaluate significance. The estimated
parameters were aligned using the multiple EM for motif elicitation (MEME)
model and the equation ? = ?/?. The alternative model includes four factors for
each site: ??, ?+, q?, and ? calculating site-to-site substitution variability
rates. The selective pressure was estimated using two parameters, ?: ?? ? ? and
?+. Based on the ?2 asymptotic distribution, values p < 0.05 were deemed
significant (Pond & Muse, 2005).
Protein prediction by amino acids and nucleic acids
The online database Con
Surf, accessible at http://consurftest.tau.ac.il, was utilised to forecast the
proteins that have retained their amino and nucleic acid compositions over
time. For sequence codon alignment of DGAT1, Yang et al. (2012) employed Selection version 2.2
(http://selecton.tau.ac.il/), which implements the mechanistic-empirical
combination (MEC) model for predicting adaptive selection pressure at distinct
codons. Ramachandran plot is used using http://vadar.wishartlab.com/ to predict
the DGAT1. Furthermore, utilising a genetic algorithm and codon model evolution
based on synonymous and nonsynonymous substitution rates, the evolutionary
fingerprinting of the DGAT1 gene was deduced. The phylogenetic Markov model was
utilised to explain the evolution of the codon model, which included character
frequencies, replacement rates for amino acids, and amino acid substitution
rate clustering.
Table 1
List of species and
accession number of the NCBI gene bank database, which was used for the
hypothesis testing
No |
Species
common name |
Scientific
name |
Accession Number |
Protein Accession Number |
1 |
Human |
Homo sapiens |
NM_005465.4 |
NP_859029.1 |
2 |
House Mouse |
Mus musculus |
NM_001357390.1 |
XP_030109770.1 |
3 |
Norway rat |
Rattus norvegicus |
XM_006250322.3 |
XP_006250383.1 |
4 |
Chimpanzee |
Pan troglodytes |
XM_016934876.1 |
XP_016791361.1 |
5 |
White-tufted-ear
marmoset |
Callithrix jacchus |
XM_017966707.1 |
XP_008983976.1 |
6 |
Feral Cattle |
Bos taurus |
NM_001191309.1 |
XP_024831736.1 |
7 |
Painted turtle |
Chrysemys picta |
XM_008170470.2 |
XP_008168692.1 |
8 |
Sheep |
Ovis aries |
XM_012187897.2 |
XP_027831384.1 |
9 |
Rhesus monkey |
Macaca mulatta |
NM_001266640.1 |
XP_028701012.1 |
10 |
Damara mole-rat |
Fukomys damarensis |
XM_010642598.2 |
XP_010640900.1 |
11 |
Chinese tree shrew |
Tupaia chinensis |
XM_014591111.1 |
XP_014446597.1 |
12 |
Water buffalo |
Bubalus bubalis |
XM_006045843.1 |
XP_006045905.1 |
13 |
Domestic ferret |
Mustela putorius |
XM_004756776.2 |
XP_012913885.1 |
14 |
Chinese hamster |
Cricetulus griseus |
XM_003508130.3 |
XP_016833997.1 |
15 |
Miniopterusnatalensis |
Miniopterus natalensis |
XM_016201398.1 |
XP_016056884.1 |
16 |
Egyptian fruit bat |
Rousettus aegyptiacus |
XM_016151253.1 |
XP_016006738.1 |
17 |
Sooty mangabey |
Cercocebus atys |
XM_012036298.1 |
XP_011891692.1 |
18 |
Chinese soft-shelled
turtle |
Pelodiscus sinensis |
XM_006135202.2 |
XP_014435074.1 |
19 |
Cheetah |
Pelodiscus sinensis |
XM_015072881.1 |
XP_026902031.1 |
20 |
Domestic cat |
Felis catus |
XM_023247428.1 |
XP_023103194.1 |
21 |
Giant panda |
Ailuropoda melanoleuca |
XM_011223567.2 |
XP_011221869.1 |
22 |
Green monkey |
Chlorocebus sabaeus |
XM_007989961.1 |
XP_007988151.1 |
23 |
Gray short-tailed
opossum |
Monodelphis domestica |
XM_016429466.1 |
XP_007481609.1 |
24 |
Long-tailed chinchilla |
Chinchilla lanigera |
XM_005374760.2 |
XP_005374816.1 |
25 |
Naked mole-rat |
Heterocephalus glaber |
XM_004853513.3 |
XP_004853570.1 |
26 |
Northern white-cheeked
gibbon |
Nomascus leucogenys |
XM_012506588.1 |
XP_030668452.1 |
27 |
Przewalski horse |
Equus caballus |
XM_023632779.1 |
XP_008525153.1 |
28 |
Prairie vole |
Microtus ochrogaster |
XM_005369606.2 |
XP_013202436.1 |
29 |
Pacific walrus |
Odobenus rosmarus |
XM_012562385.1 |
XP_012417839.1 |
30 |
Pig-tailed macaque |
Macaca nemestrina |
XM_011729484.1 |
XP_011727791.1 |
31 |
Sumatran orangutan |
Pongo abelii |
XM_009248019.1 |
XP_024089808.1 |
32 |
Wild Bactrian camel |
XM_014559995.1 |
XP_006185623.1 |
|
33 |
Western European
hedgehog |
Erinaceus europaeus |
XM_007534664.1 |
XP_007534733.1 |
34 |
Weddell seal |
Leptonychotes weddellii |
XM_006740006.1 |
XP_006740067.1 |
35 |
American beaver |
Castor canadensis |
XM_020163460.1 |
XP_020019049.1 |
36 |
Australian saltwater
crocodile |
Crocodylus porosus |
XM_019549778.1 |
XP_019405323.1 |
37 |
Koala |
Phascolarctos cinereus |
XM_020982080.1 |
XP_020837737.1 |
38 |
Anubis baboon |
Papio anubis |
XM_017958504.2 |
XP_017813935.1 |
39 |
Zebrafish |
Danio rerio |
NM_001197201.2 |
XP_001923454.3 |
Analysis of the protein-protein interaction network
Using the special linkage
analysis of STRING (version 9.1, http://www.string-db.org/), the
protein-protein interaction network with DGAT1 was predicted (Franceschini et
al., 2013). Protein
connections were identified using the online server data bank of biological
interactions. According to Li et al. (2017), the pooled
score < 0.4 was the cutoff standard value. The conserved motif analysis was
conducted using the MEME online programme. The MEME website
(http://meme-suite.org/tools/meme) provided an explanation of conserved motifs
in the DGAT1.
Results
Phylogenetic
relationship of DGAT1
To determine
the origins of evolutionary alterations in the genes of 37 domesticated and
wild mammalian species, phylogenetic tree analysis was conducted for the DGAT1
gene (Figure 1). which placed species with less significant relationships in
different phylogenetic groups, such as the Western European hedgehog (Erinaceus
europaeus) and the painted turtle (Chrysemys picta), and placed species with
evolutionarily close genes, such as the green monkey (Chlorocebus sabaeus) and
Anubis baboon (Papio anubis), in the same groups. As a result, nine groupings
were created from 37 species. We have determined the sites of the positive
selection codons in these mammalian clades using phylogenetic tree analysis. We
have determined the sites of the positive selection codons in these mammalian
clades using phylogenetic tree analysis.
By estimating global ?
values using random effect likelihood (REL), internal fixed effect likelihood
(IFEL), and fast unconstrained Bayesian approximation (FUBAR) techniques,
evolutionary evidence of positive selection was found. Three positive selection
sites (at locations 109, 195, and 211) were found by the REL. Three positive
selection sites were also found by IFEL (at codons 106, 254, and 319). In the
meanwhile, 11 sites under positive selection were found by FUBAR analysis at
locations 14, 21, 52, 153, 164, 221, 323, 325, 348, 349, and 480 (Table 2).
which, at a 95% confidence interval, FUBAR found more positive locations than
other analyses. Additionally, REL analysis, which found positive selection
values >20, was based on the Bayes factor. When p-values were less than
0.05, every discovered site showed a significant difference (Table 2).
Table 2
Sites under positive selection in DGAT1 genes by using
different approaches
IFEL |
REL |
FUBAR |
Positive sites (p-value) |
Positive sites (Bayes Factor) |
Positive sites (Posterior Probability) |
106(0.02),
254(0.04), 319(0.09) |
109(45.07), 195(44.59),
211(152.35) |
14(0.54), 21(0.54), 52(0.59), 153(0.92), 164(0.53), 221(0.66),
323(0.56), 325(0.55), 348(0.65), 349(0.56), 480(0.81) |
Significant
values (p < 0.05), Bayes factors >20.
Table 3 reports the distribution
of synonymous (?) and non-synonymous (?) substitution rates over sites inferred
by the MEME model, where the proportion of branches with ?>? is
significantly greater than 0. The p-value was derived using a mixture of
?2 distributions. In which, MEME analysis detected 20 sites undergone
episodic diversifying selection (Table 3). Among these sites, 28, 29, 36, 42,
70, 74, 211, 244, 257, 282, and 542 were detected under episodic diversifying
with p-values < 0.05. This model also estimated ? and ? substitution
rates and the sites having values ? > ? were considered significant and
determined these sites under diversifying selection (Ahmad et al., 2017a). For instance, 211 sites were inferred to have
experienced pervasive nonsynonymous substitution throughout the evolutionary
history with a p-value < 0.05. Moreover, this site evolved with ?+
> ?, and it was under positive selection in the other analyses. Therefore,
it has been confirmed that the 211 site was considered as a strong positive
selection of the DGAT1 gene, whereas all other sites were conserved
(Table 3).
Table 3
Mixed-effect
model evolution (MEME) based on the episodic diversifying selection of DGAT1
genes
Codon |
? |
?- |
Pr[?=?-] |
?+ |
Pr[?=?+] |
p-value |
22 |
0.13 |
0.00 |
0.96 |
56.28 |
0.04 |
0.06 |
23 |
0.75 |
0.05 |
0.95 |
14.31 |
0.05 |
0.07 |
28 |
0.15 |
0.00 |
0.80 |
1.78 |
0.20 |
0.02 |
29 |
0.00 |
0.00 |
0.84 |
9.24 |
0.16 |
0.00 |
36 |
0.32 |
0.14 |
0.95 |
31.17 |
0.05 |
0.03 |
42 |
2.01 |
0.13 |
0.93 |
88.60 |
0.07 |
0.02 |
44 |
0.00 |
0.00 |
0.56 |
2.64 |
0.44 |
0.07 |
70 |
1.26 |
0.00 |
0.89 |
149.54 |
0.11 |
0.03 |
74 |
0.53 |
0.53 |
0.93 |
301.72 |
0.07 |
0.00 |
112 |
0.43 |
0.00 |
0.52 |
1.98 |
0.48 |
0.07 |
147 |
0.22 |
0.05 |
0.97 |
4.21 |
0.03 |
0.09 |
211 |
0.00 |
0.00 |
0.00 |
0.37 |
1.00 |
0.03 |
224 |
0.14 |
0.14 |
0.00 |
1.02 |
1.00 |
0.09 |
244 |
0.08 |
0.00 |
0.60 |
0.54 |
0.40 |
0.05 |
254 |
0.14 |
0.14 |
0.18 |
1.62 |
0.82 |
0.06 |
257 |
0.30 |
0.06 |
0.96 |
10.53 |
0.04 |
0.03 |
282 |
0.16 |
0.16 |
0.97 |
11.15 |
0.03 |
0.03 |
310 |
0.30 |
0.07 |
0.92 |
4.33 |
0.08 |
0.10 |
500 |
0.45 |
0.04 |
0.94 |
7.93 |
0.06 |
0.08 |
542 |
0.58 |
0.04 |
0.88 |
78.39 |
0.12 |
0.00 |
Codon Model Selection
The genetic
algorithm was used to generate the codon model, which was used to identify the
evolutionary fingerprint seen in the coding regions of the DGAT1 genes. wherein
both synonymous and nonsynonymous substitution rates were used in the
evolutionary fingerprinting method. Figure 2 shows the evolution of the codon
model using the phylogenetic Markov model (CME). Character frequencies,
substitution rates, and the clustering of amino acid substitution rates at
three class levels (0.06, 0.20, and 0.40) are all included in this model. While
DENQ had 50% substitution, FWY and HKR had less than 50% substitution, and the
substitution pair ACGILMPSTV had 90% substitution. whereas the highest
substitution rate for the various ratio classes was 0.40. Furthermore, among
the different amino acid positions in the DGAT1 genes, the minimum was 0.06
(Figure 2).
In addition, the plot
in Figure 3 used Gaussian-approximated variance to show the
estimated distribution of
?
and ? rate alignments. In
which, the colored pixels show the density of the posterior sample distribution
for the given rate. The diagonal line represented the idealized neutral
evolution regime (? = 1). The points above the line correspond to positive
selection (? > 1), and the points below the line were for negative selection
(? < 1). The
graph showed the neutral evolution and only a few sites under the circle above
the diagonal had positive evolution at the DGAT1 gene. In which, the
probability of site-to-site distribution ratio (? = ?/?) based on the
likelihood log and Akaike information criterion (AIC) of five neutral evolution
classes were identified at DGAT1 gene (Figure 3). And, the likelihood
log was -19009.06134 for five class rates using 37 parameters.
The
modified Bayesian Information Criterion (mBIC) of the DGAT1 in 4032 logs (L)
was used to determine the codon models (Table 4). It was believed that the
model with -19302.3 L was the most useful for studying amino acid substitution.
The greatest estimate of a single rate (dN/dS) substitution in the third class
(Table 4) was 0.40/15, and the distribution of amino acids was classified into
three classes based on these rates (Figure 2). Furthermore, 644.48 mBIC and
342.30 L improvements are provided by this approach, assuming that the rate of
all non-synonymous modifications stays constant.
Table 4
Codon model
selection is based on the modified Bayesian Information Criterion (mBIC) of the
DGAT1 gene from different organisms.
Classes |
Models |
Credible |
mBIC |
?mBIC |
dN/dS (Rates in class) |
||
1 |
1 |
0 |
40974.5 |
|
0.16/75 |
||
2 |
2935 |
0 |
40387.1 |
587.41 |
0.07/50 |
0.32/25 |
|
3 |
1096 |
132 |
40330.0 |
57.07 |
0.06/42 |
0.20/18 |
0.40/15 |
N: number of
rate classes included in models; Models: genetic algorithm models; Credible:
all the models evaluated by a genetic algorithm within 9.21 mBIC unit (the best
model has credible values 0.01 or >1); mBIC: modified Bayesian Information
Criterion; ?mBIC: mBIC for N rate classes compared to N ? 1 rate classes;
dN/dS: maximum likelihood estimates for each rate class.
The maximum likelihood
analysis of the DGAT1 gene's codon by codon positive selection is shown in
Table 5. Onodera et al. (2019) found that when dN-dS values were higher than 1,
locations were significant and codons had experienced positive selection. For instance,
among the twenty-one identified codons, only codons 55 and 185 had dN-dS values
smaller than 1, suggesting that they were not under positive selection.
Table 5
Maximum
likelihood analysis of DGAT1 gene for codon by codon positive selection.
Codon
# |
Codon
Start |
Triplet |
Syn
(s) |
Nonsyn
(n) |
Syn
sites (S) |
Nonsyn
sites (N) |
dS |
dN |
dN-dS |
7 |
670 |
GGG |
2.67 |
14.33 |
0.87 |
2.08 |
3.06 |
6.88 |
3.82 |
8 |
673 |
GTC |
3.00 |
10.00 |
0.99 |
2.01 |
3.03 |
4.98 |
1.94 |
25 |
724 |
GAG |
2.25 |
11.75 |
0.67 |
2.22 |
3.37 |
5.30 |
1.92 |
33 |
748 |
GTG |
0.33 |
10.67 |
0.90 |
2.08 |
0.37 |
5.12 |
4.75 |
34 |
751 |
CTG |
3.00 |
8.00 |
1.27 |
1.72 |
2.37 |
4.66 |
2.29 |
43 |
778 |
TTC |
0.00 |
4.00 |
0.73 |
2.27 |
0.00 |
1.76 |
1.76 |
55 |
814 |
TGG |
0.00 |
2.00 |
0.15 |
2.17 |
0.00 |
0.92 |
0.92 |
56 |
817 |
TGC |
0.00 |
3.00 |
0.70 |
2.16 |
0.00 |
1.39 |
1.39 |
93 |
1093 |
ATG |
0.00 |
3.00 |
0.20 |
2.80 |
0.00 |
1.07 |
1.07 |
113 |
1153 |
CAG |
0.33 |
4.67 |
0.56 |
2.08 |
0.59 |
2.25 |
1.66 |
114 |
1156 |
AAC |
0.00 |
3.00 |
0.67 |
2.33 |
0.00 |
1.29 |
1.29 |
120 |
1174 |
AAG |
1.00 |
7.00 |
0.63 |
2.26 |
1.58 |
3.10 |
1.52 |
122 |
1180 |
ATG |
0.00 |
3.00 |
0.30 |
2.70 |
0.00 |
1.11 |
1.11 |
161 |
1297 |
ATG |
0.50 |
6.50 |
0.36 |
2.64 |
1.39 |
2.46 |
1.07 |
185 |
1375 |
TGG |
0.00 |
2.00 |
0.19 |
2.12 |
0.00 |
0.94 |
0.94 |
194 |
1402 |
ATC |
3.00 |
12.00 |
0.93 |
2.07 |
3.21 |
5.81 |
2.60 |
199 |
1417 |
AAG |
0.33 |
5.67 |
0.62 |
2.25 |
0.53 |
2.52 |
1.99 |
201 |
1423 |
ATG |
0.00 |
8.00 |
0.31 |
2.68 |
0.00 |
2.99 |
2.99 |
206 |
1438 |
AGC |
3.67 |
15.33 |
0.83 |
2.15 |
4.44 |
7.12 |
2.68 |
255 |
1621 |
CTG |
2.50 |
9.50 |
1.14 |
1.86 |
2.19 |
5.12 |
2.93 |
258 |
1630 |
ATC |
0.00 |
3.00 |
0.74 |
2.26 |
0.00 |
1.33 |
1.33 |
dN-dS values
> 1 indicate significantly the codons have undergone positive selection and
positions with their synonymous and non-synonymous substitution site.
Positive selection of amino acid positions
The
Mechanistic-Empirical Combination (MEC)
the model
estimated the selection pressure at particular codons at various codons in
DGAT1 under the positive selection (Figure 4). The indicators of selection
masked codons confirmed the positive selection differences on 11, 21, 191, 221,
and 231 codons.
A map of all
the discovered codon locations' overall performance is presented in Figure 5.
which shows the ambiguous, synonymous, and nonsynonymous codon changes with the
evolutionary time. Both the synonym and nonsynonymous mutant codons performed
consistently up until codon number 20. After that, when the number of codons
climbed to 580, there was a discernible improvement in performance. The
synonymous cumulative rate was lower at codon 471 than the non-synonymous
cumulative rate. Furthermore, the performance of the ambiguous codon increased
as the codon placements were initiated, and it eventually stabilised until
codon 471.
Ramachandran plot
The
energetically permitted regions of the backbone dihedral angle psi (?, x-axis)
against phi (?, y-axis) of the amino acid residues were displayed using a
Ramachandran plot. The energetically permitted portions of the amino acid
residues backbone dihedral angle psi (?, x-axis) versus phi (?, y-axis) that
appeared in the protein structure were displayed using a Ramachandran plot
(Vimala et al., 2021). In Figure
6, the amino acids are represented by a black dot, while the permitted regions
with ?-helical and ?-sheet conformations are shown by a red region. The protein
exhibits more right-handed ?-helix secondary structure than left-handed ?-helix
and ?-sheet conformations, according to the cluster dots displayed on the map.
Thus, phi and
psi, respectively, have established the relative rotational angle of the
protein torsion. The relative rotational angle of the protein torsion by phi
and psi, respectively, was verified by the cluster dots displayed in the
Ramachandran plot. This could explain how certain amino acids and tight atom
interactions play a crucial role in the functionality of produced proteins
(Vimala et al., 2021).
Motif Compositions of DGAT1 protein
To gain a
better understanding of the variations in the development and function of
proteins, as well as to identify the distinct areas of various DGAT1 proteins,
conserved motifs were predicted through the use of the online MEME tool
(Mulaudzi-Masuku et al., 2019). The
analysis of conserved regions in the DGAT1 gene from various animals is
displayed in Figure 7. Using BLAST, ten motifs in all were predicted and their
sequences were confirmed. DGAT1 protein from the same group generally had
comparable motifs. Furthermore, Table 6 shows that the lengths of the motifs
varied from 29 to 50 amino acid residues.
Of the 37
species that were the subject of the study, 29 species had 10 motifs in each
DGAT1 protein, whereas the remaining 8 species had 9 motifs. In eight species
of DGAT1 proteins, motifs in the colours yellow, orange, and light blue were
also absent. For example, neither the Chinese soft-shell turtle (Pelodiscus
sinensis) nor the feral cattle (Bos taurus) have the light blue motif that is
part of the membrane-bound O-acyltransferase family MBOAT and possesses amino
acid residue sequence (RLLEMLFFTQLQVGLIQQWMVPTIQNSMKPFKDMDYSRIIE).
Additionally, Table 6 shows that the Egyptian fruit bat (Rousettus aegyptiacus)
and the Canadian beaver (Castor canadensis) lack the yellow motif, which is an
amino acid residue sequence (APDKDGDVGSGHWELRCHRLQDSLFSS) with no description.
Table 6
The length of motifs ranged and
amino acid residues.
No. |
Motif |
Size |
Description |
1 |
MQFGDREFYRDWWNSESVTYFWQNWNIPVHKWCJRHFYKPM |
41 |
MBOAT, membrane-bound
O-acyltransferase family |
2 |
RLLEMLFFTQLQVGLIQQWMVPTIQNSMKPFKDMDYSRIIE |
41 |
MBOAT, membrane-bound
O-acyltransferase family |
3 |
SNYRGILNWCVVMLILSNARLFLENLIKYGILVDPIQVVSLFLKDPYSWP |
50 |
No Description |
4 |
WAFTGMMAQIPLAWIVGRFFQGNYGNAAVWLTLIIGQPVAVLMYVHDYYV |
50 |
MBOAT, membrane-bound O-acyltransferase
family |
5 |
TVSYPDNLTYRDLYYFLFAPTLCYELNFPRSPRIRKRFLLR |
41 |
No Description |
6 |
VAAFQVEKRLAVGALTEQAGLLLHVANLATILCFPAAVALLVESITPVGS |
50 |
No Description |
7 |
LLKLAVPNHLIWLIFFYWLFHSCLNAVAE |
29 |
No Description |
8 |
SKWMARTGVFLASAFFHEYLVSIPLRMFR |
29 |
MBOAT, membrane-bound
O-acyltransferase family |
9 |
ALMVYTILFLKLFSYRDVNLWCRZRRAKA |
29 |
Cry35Ab1 HTH C-terminal
domain |
10 |
APDKDGDADVGSGHWELRCHRLQDSLFSS |
29 |
No Description |
Multi Hits Likelihood Test
A circus network was
built to investigate the shift in triplet codes on the amino acids, and the
multiple hits test was carried out using the Datamonkey web tool (Figure 8).
The findings demonstrated that while the triplet code has changed, the amino
acids in DGAT1 have remained conserved (Table 7). It led to the alteration of
the amino acids and polypeptide synthesis by changing 20 triplet codes into new
codes. Furthermore, aspartate's GAC code was the most frequently altered target
code to become other amino acids (Table 7). Moreover, GTC for valine had the
highest count of 16 to be changed into AAG and the highest count of 14 for GTC
to be transformed into CAG code. Therefore, it could be concluded that in DGAT1
protein, the aspartic acid and valine were the most sourced amino acids for
different other targeted amino acids.
Table 7
The list of DGAT1 amino
acids remains conserved and also changed due to changes in the triplet code.
No |
Source |
Target |
Source Amino Acid |
Target Amino Acid |
Count |
1 |
GAC |
CCA |
D |
P |
2 |
2 |
GAC |
CCG |
D |
P |
2 |
3 |
GAC |
CCT |
D |
P |
2 |
4 |
GAC |
CTG |
D |
L |
4 |
5 |
GAC |
ATG |
D |
M |
2 |
6 |
GAC |
CGG |
D |
R |
2 |
7 |
GAC |
ACA |
D |
T |
2 |
8 |
GAC |
TTG |
D |
L |
2 |
9 |
GAC |
YCT |
D |
S |
2 |
10 |
GAC |
TGG |
D |
W |
1 |
11 |
GTC |
CGA |
V |
R |
2 |
12 |
GTC |
AAG |
V |
K |
16 |
13 |
GTC |
ACT |
v |
T |
4 |
14 |
GTC |
AGA |
v |
R |
5 |
15 |
GTC |
AAA |
v |
K |
3 |
16 |
GTC |
CCG |
v |
P |
3 |
17 |
GTC |
CAG |
v |
Q |
14 |
18 |
GTC |
YGG |
v |
W |
8 |
19 |
GTC |
AAT |
v |
N |
5 |
20 |
GTC |
TAT |
v |
Y |
5 |
The network of Protein-Protein Interaction (PPI)
Further
information regarding the molecular
function of
DGAT1 could be obtained through the study of the protein-protein interaction
network (Rao et al., 2014). The PPI
linkage in this study consisted of 32 edges, or line networks connecting the
nodes, and 11 nodes, which represent proteins encoded with DGAT1. (Figure 9).
This suggests that DGAT1 can be connected to multiple proteins via the protein
interaction network. It showed that DGAT1 was networked with the other ten
essential genes that are co-expressed from the PPI network. PPAP2A, AGPAT2,
PLD1, SLC27A2, CD36, RPE65, DGAT2, MOGAT3, MOGAT1, and MOGAT2 are some of these
genes (Figure 9).
Phosphatidic
acid phosphatase 2a (PPAP2A) is one of these genes that catalyses the
hydrolysis and uptake of lipids from extracellular space by converting
phosphatidylcholine (PC) to phosphatidic acid and diacylglycerol (Goto et al., 2021). Additionally, the acylglycerol
O-acyltransferase family (DGAT2/MOGAT) includes the genes MOGAT1, MOGAT2, and
the related MOGAT3 genes. These genes are involved in the synthesis of
diacylglycerol (DAG) and triacylglycerol (TAG) from monoacylglycerol (MAG), and
their related pathways include metabolism and glycerolipid metabolism (Agarwal
et al., 2019). In biological
signalling pathways, all of these genes may be involved in the overexpression
of the DGAT1 gene.
Discussion
Given that they contain proteins with biological functions intended to protect the host, mammalian genes are among those that are developing the fastest (Madende & Osthoff, 2019). An attempt was undertaken to determine whether the places under positive selection had any specific functions that contributed to their evolution since these positions are probably correlated with sites of significant activity. For contemporary evolutionary research, identifying the selection in the mammalian genome provides a crucial research platform.
The DGAT1 gene's evolutionary changes among 37 mammalian species were the main focus of this investigation. which separated these 37 species into nine groups according to how similar they were. For example, the phylogenetic tree revealed that the genes of some species, such as P. anubis and C. sabaeus, had evolved closely together. Thus, in order to determine the sites of the positive selection codons, we performed an initial analysis of the phylogenetic tree. Farmanullah et al. (2020) identified the positive selection codon positions in the mammalian AKT3 gene using a phylogenetic tree; REL confirmed that these places comprise 32 legitimate selection sites. The DGAT1 gene in animals has undergone considerable evolutionary positive selection, according to the findings. This has three DGAT1 positive selection sites identified by REL and IFEL.
Additionally, REL analysis, which found positive selection values >20, was based on the Bayes factor. The amino acid substitutions under Bayes Empirical Bayes-based selection analysis demonstrated a positive selection of different amino acid positions in the MKRN3 gene by IFEL and REL models, according to Ahmad et al. (2019). Under positive selection, they found several sites with a Bayes factor > 20 and a p < 0.05. Consequently, REL may be a better model to identify the codon locations of DGAT1 that are subject to positive or negative selection.
Twenty locations that experienced episodic diversifying selection were identified by the MEME model based on the distribution of ? and ?. Of these, p-values < 0.05 were found for 28, 29, 36, 42, 70, 74, 211, 244, 257, 282, and 542 under episodic diversifying. wherein the DGAT1 gene's substantial positive selection at the 211 site was verified. According to Bartakova et al. (2021), the advantage of MEME is that it can enable each site to have its own selective history without requiring the branches that are being selected to be partitioned beforehand. Furthermore, Farmanullah et al. (2020) discovered 20 MEME-based positive selection sites in the AKT3 gene.
The mBIC identified 19 codons that had positive selection with dN-dS values > 1. In which, the positions of amino acids' evolutionary conservation is important for maintaining the protein structure and function. For instance, Onodera et al. (2019) reported that mammals with serine at 366 positions have more potential to control some cellular metabolism including biosynthesis and mitochondrial oxidation. Therefore, the detection of selected sites may enlighten the selection forces and detect the functionally significant sites for protein interaction. Also, MEC confirmed the positive selection differences on 11, 21, 191, 221, and 231 codons. Moreover, based on the similarities between sequences on the phylogenic relationship, there were many conserved amino acids even with the positive selection presence (da et al., 2021). The collective performance showed different stability after codon number 20 with a significant increase till codon 580. Barbour et al. (2013) mentioned that the differences in the cumulative behaviour between the nonsynonymous and synonymous distribution could indicate the differences in positive selection substitution among the mammalian species.
Furthermore, based on the similarities between sequences on the phylogenic relationship, there were highly conserved amino acids even in the presence of positive selection (da et al., 2021). The overall performance showed a discernible improvement from codon 20 and persisted until codon 580. Variations in synonymous and nonsynonymous distributions' cumulative behaviour could indicate differences in positive selection replacement among mammalian species, according to Barbour et al. (2013).
Additionally, the DGAT1 PPI linkage contained 11 nodes, indicating that other proteins are able to link with DGAT1 via the protein interaction network. In the human BMP15 gene, Auclair et al. (2013) found positive selection signals across 24 mammalian species. Because DGAT1 is upregulated, these genes might be connected to biological signalling networks. AGPAT2 and PLD1, for example, are important genes in the route leading to phospholipid production. The class B scavenger receptor family of cell surface proteins includes the integral membrane protein CD36 antigen, which is involved in the import of fatty acids within cells (Jay et al., 2020).
Furthermore, during phototransduction, RPE65, which is expressed in the retinal pigment epithelium, is in charge of converting all-trans-retinyl esters to 11-cis-retinol (Jakobiec et al., 2021). As a result, DGAT1 is an enzyme that accelerates the last stage of triglyceride synthesis in adipose tissue.
Conclusion
The evolutionary investigation of the DGAT1 gene in mammals has a significant potential to control the expression of the related gene in the lipid mechanisms of mammal cells. This can open the scientific field for biological studies and their relations in the medical field. The present work included different bioinformatics methods as a point of clarification and comparison. The major benefit of the used models is that they can identify which specific codons and codes that should focus on in studying the DGAT1 gene for its translation, esterification, or metabolic action. This study emphasized the great importance of the structural differences of DGAT1 between the mammal species on their different functional responses against immune diseases.
Conflict of Interests
Regarding the release of this paper, the authors affirm that they have no conflicts of interest.
Acknowledgements
The authors would like to thank the Guangxi Key Laboratory of Buffalo Genetics, Reproduction, and Breeding project, the Ministry of Agriculture's Research Platform Construction project, and the International Exchange and Cooperation in Science and Technology project (SNKF-2016-01) for their financial support.
References
- Fernström, I., & Johansson, B. (1976). Percutaneous Pyelolithotomy. Scandinavian Journal of Urology and Nephrology, 10(3), 257–259.
- Cracco, C. M., & Scoffone, C. M. (2020). Endoscopic combined intrarenal surgery (ECIRS) - Tips and tricks to improve outcomes: A systematic review. Turkish Journal of Urology, 46(Supp1), S46–S57.
- Jones, P., Pietropaolo, A., Chew, B. H., & Somani, B. K. (2021). Atlas of Scoring Systems, Grading Tools, and Nomograms in Endourology: A Comprehensive Overview from the TOWER Endourological Society Research Group. Journal of Endourology, 35(12), 1863– 1882.
- u, Y., Yuan, Y., Cai, Y., Li, X., Wan, S., & Xu, G. (2019). Use 3D printing technology to enhance stone-free rate in single tract percutaneous nephrolithotomy for the treatment of staghorn stones. Urolithiasis, 48(6), 509– 516.
- Inoue, T., Okada, S., Hamamoto, S., & Fujisawa, M. (2021). Retrograde intrarenal surgery: Past, present, and future. Investigative and Clinical Urology, 62(2), 121.
- Agbo, C. U. (2021). Open Surgery for Urinary Stones in a Resource Poor Setting: A Look at Dalhatu Araf Specialist Hospital, Lafia, Nigeria. Socié té Internationale D’urologie Journal, 2(2), 79–81.
- Sigdel, B., Shrestha, S., & Maskey, P. (2022). Predicting the outcome of mini percutaneous nephrolithotomy using STONE nephrolithometry score—a single- centre experience. Urolithiasis, 51(1),
- Deole, S., Ghagane, S. C., Patel, P., Nerli, R., Patil, S. M., & Dixit, N. S. (2020). The outcome of Percutaneous Nephrolithotomy in a Tertiary Care Center in North Karnataka. World Journal of Nephrology and Urology, 9(2), 35– 39.
- Ergani, B., Karabıçak, M., Türk, H., Yoldaş, M., İşoğlu, C. S., Süelözgen, T., Koç, G., & İlbey, Y. Ö. (2019). Does increased stone-skin distance due to obesity affect outcomes of percutaneous nephrolithotomy? J Urol Surg, 6(4), 283-8
- Tzou, D. T., Tailly, T. O., & Stern, K. L. (2023). Ultrasound-Guided PCNL — Why Are We Still Performing Exclusively Fluoroscopic Access? Current Urology Reports, 24(7), 335–343.
-
Ketsuwan, C., Pimpanit, N., Phengsalae, Y., Leenanupunth, C., Kongchareonsombat, W., & Sangkum, P. (2020).
Peri-Operative Factors Affecting Blood Transfusion Requirements During PCNL: A Retrospective Non-Randomized Study
Research and Reports in Urology, Volume 12, 279–285. - Grønkjær, M., Eliasen, M., Skov-Ettrup, L., Tolstrup, J. S., Christiansen, A. L., Mikkelsen, S. S., Becker, U., & Flensborg- Madsen, T. (2014). Preoperative Smoking Status and Postoperative Complications. Annals of Surgery, 259(1), 52– 71.
- Rosenbluth, E., Liaw, C. W., Bamberger, J. N., Omorogbe, A., Khusid, J. A., Khargi, R., Yaghoubian, A. J., Ricapito, A., Gallante, B., Atallah, W. M., & Gupta, M. (2023). The effects of continuing aspirin on blood loss and postoperative outcomes in percutaneous nephrolithotomy. PubMed, 11(1), 50–58.
- Bolton, D. M., & Hennessey, D. B. (2019). Exit Strategies After Percutaneous Nephrolithotomy. Smith's Textbook of Endourology, 427-440.
- ElSheemy, M. S., Elmarakbi, A. A., Hytham, M. A., Ibrahim, H., Khadgi, S., & Al-Kandari, A. M. (2018). Mini vs standard percutaneous nephrolithotomy for renal stones: a comparative study. Urolithiasis, 47(2), 207– 214.
Cite this article
-
APA : Rehman, B., Nasir, R., & Ikram, H. (2023). Evolutionary Study of DGAT1 Gene Indicates the Presence of Positive Selection in Mammals. Global Drug Design & Development Review, VIII(II), 48-62. https://doi.org/10.31703/gdddr.2023(VIII-II).07
-
CHICAGO : Rehman, Bushra, Rabiya Nasir, and Hira Ikram. 2023. "Evolutionary Study of DGAT1 Gene Indicates the Presence of Positive Selection in Mammals." Global Drug Design & Development Review, VIII (II): 48-62 doi: 10.31703/gdddr.2023(VIII-II).07
-
HARVARD : REHMAN, B., NASIR, R. & IKRAM, H. 2023. Evolutionary Study of DGAT1 Gene Indicates the Presence of Positive Selection in Mammals. Global Drug Design & Development Review, VIII, 48-62.
-
MHRA : Rehman, Bushra, Rabiya Nasir, and Hira Ikram. 2023. "Evolutionary Study of DGAT1 Gene Indicates the Presence of Positive Selection in Mammals." Global Drug Design & Development Review, VIII: 48-62
-
MLA : Rehman, Bushra, Rabiya Nasir, and Hira Ikram. "Evolutionary Study of DGAT1 Gene Indicates the Presence of Positive Selection in Mammals." Global Drug Design & Development Review, VIII.II (2023): 48-62 Print.
-
OXFORD : Rehman, Bushra, Nasir, Rabiya, and Ikram, Hira (2023), "Evolutionary Study of DGAT1 Gene Indicates the Presence of Positive Selection in Mammals", Global Drug Design & Development Review, VIII (II), 48-62
-
TURABIAN : Rehman, Bushra, Rabiya Nasir, and Hira Ikram. "Evolutionary Study of DGAT1 Gene Indicates the Presence of Positive Selection in Mammals." Global Drug Design & Development Review VIII, no. II (2023): 48-62. https://doi.org/10.31703/gdddr.2023(VIII-II).07