Genetic Variation Linked to Body Changes & Abnormal Lipids: Genetic analysis implicates resistin in HIV lipodystrophy
AIDS:Volume 22(13)20 August 2008p 1561-1568
Ranade, Koustubha; Geese, William Ja; Noor, Mustafaa; Flint, Olivera; Tebas, Pablob; Mulligan, Kathleenc; Powderly, Williamd; Grinspoon, Steven Ke; Dube, Michael Pf
aBristol-Myers Squibb R&D, Princeton, New Jersey, USA
bUniversity of Pennsylvania, Philadelphia, Pennsylvania, USA
cUniversity of California, San Francisco, California, USA
dUniversity College Dublin, Dublin, Ireland
eHarvard Medical School, Boston, Massachusetts, USA
fIndiana University School of Medicine, Indianapolis, Indiana, USA.
"By grouping patients with similar metabolic profiles after exposure to HAART using a powerful clustering approach, we identified a subgroup of patients that was especially vulnerable to the metabolic side effects caused by HAART. This subgroup of patients had slightly elevated lipids at baseline, which were made worse after exposure to HAART, suggesting that HAART exacerbated underlying predisposition to dyslipidemia and insulin resistance..... This high-risk subgroup of patients also experienced significant body composition changes, particularly limb fat loss.....Genetic variation in resistin is associated with metabolic complications caused by HAART.....a single nucleotide polymorphism in resistin, a gene previously implicated in obesity and insulin resistance, was associated with this high-risk group (P = 0.0003).....Additional work will be needed to understand how genetic variation in resistin might predispose individuals to HIV lipodystrophy. One possibility is that HIV-positive patients on HAART who are in a chronic inflammatory state  might have increased resistin, as resistin gene expression can be induced by the proinflammatory molecules lipopolysaccharide and TNF-α......Our work suggests a potential basis for managing lipodystrophy in high-risk individuals"
Objectives: To investigate the role of genetic variation in influencing the risk of metabolic complications associated with highly active antiretroviral therapy (HAART).
Methods: Cluster analysis of metabolic traits of 189 patients enrolled in ACTG5005s, the metabolic substudy of ACTG384, a clinical trial of HAART, was performed to identify a subgroup of individuals with increased risk of developing a cluster of metabolic abnormalities after exposure to HAART. Almost 300 single nucleotide polymorphisms in 135 candidate genes were evaluated for their association with this subgroup.
Results: A subgroup of patients was identified that had a normal metabolic profile at baseline but developed significantly elevated lipids and insulin resistance on HAART. This high-risk subgroup of patients also experienced significant body composition changes, particularly limb fat loss. Candidate gene analysis revealed that a single nucleotide polymorphism in resistin, a gene previously implicated in obesity and insulin resistance, was associated with this high-risk group (P = 0.0003).
Conclusion: Genetic variation in resistin is associated with metabolic complications caused by HAART.
Highly active antiretroviral therapy (HAART) regimens can be associated with multiple metabolic abnormalities, including elevated lipids, glucose and insulin resistance, which are accompanied by changes in body composition such as visceral obesity and limb fat loss. Typically these abnormalities cluster together in a syndrome called HIV lipodystrophy [1,2]. With the reduction in morbidity and mortality resulting from HAART, these metabolic side effects are of concern as they are well-known risk factors for future cardiovascular disease . The cause of HIV lipodystrophy is poorly understood, although some protease inhibitors may be more likely to cause metabolic abnormalities [2,4,5]. The genetic basis of HIV lipodystrophy is also unclear. Some studies [6-8] have implicated nucleotide variation in ApoCIII, the β3 adrenergic receptor or tumor necrosis factor (TNF)-α. However, these studies focused only on one trait (e.g. triglycerides or insulin resistance) and examined selected single nucleotide polymorphisms (SNPs) in a single gene.
In this study, we employed a new approach to analyze metabolic data collected in a clinical trial of HAART to identify a subgroup of patients that developed marked lipid elevations and insulin resistance after exposure to HAART. The clustering approach we used to identify this subgroup is a powerful method for finding patterns in multivariable data, and has been used successfully to detect subgroups of cancer patients based on gene expression profiles [9,10]. We reasoned that this approach was particularly suited to the metabolic complications caused by HAART because of the apparently syndromic nature of the side effects [1,2]. Having identified this subgroup of patients, we performed a comprehensive candidate gene analysis to implicate resistin, a gene previously associated with obesity and insulin resistance .
By grouping patients with similar metabolic profiles after exposure to HAART using a powerful clustering approach, we identified a subgroup of patients that was especially vulnerable to the metabolic side effects caused by HAART. This subgroup of patients had slightly elevated lipids at baseline, which were made worse after exposure to HAART, suggesting that HAART exacerbated underlying predisposition to dyslipidemia and insulin resistance.
As has been noted by others , 'the idea that complex data can be grouped into clusters or categories is central to our understanding of the world, and this structure arises in many diverse contexts.' Similar clustering approaches have been used widely in the analysis of whole genome expression arrays [9,10] to identify patterns in data that would not be apparent in univariate analysis. We believe such an approach is particularly suited to complex datasets with limited numbers of patients, such as the current study, to increase statistical power.
By performing a comprehensive candidate gene analysis, we implicated resistin in the susceptibility to HIV lipodystrophy. Resistin was cloned as an adipocyte hormone that links obesity and insulin resistance . Increased resistin resulted in mice with abnormal glucose tolerance . Conversely, resistin knockout mice had lower fasting blood glucose . SNP in resistin have also been variably associated with metabolic traits captured by the clusters described above including BMI, fat mass, insulin resistance and diabetes [27-31]. Engert et al.  examined SNP in the 5' flanking region of the resistin gene and found significant association between SNP rs1862513 and increased BMI in non-diabetic obese individuals. Although this particular SNP was not significantly associated with cluster membership in this study, two flanking SNP, rs321975 and rs3760678, which are 62 and 673 bp, respectively from rs1862513 were significantly associated with the high-risk cluster (Table 3). Wang et al.  demonstrated significant interaction between rs3219177 and BMI and insulin sensitivity, with the common CC genotype demonstrating greater mean insulin sensitivity. The association between this SNP and clusters, which show significant differences in mean insulin resistance is consistent with this observation (Tables 1 and 3). Sentinelli et al.  did not find significant association between one SNP that they tested and diabetes status in a small case-control study. We did not detect this SNP in our population. Conneely et al.  found significant association between SNP rs3219177 and increased weight in elderly non-diabetic individuals, an observation that is completely consistent with the highly significant association we found between the same SNP and high-risk cluster, which also had increased mean BMI (Table 1). Osawa et al.  found significant association between a SNP in the promoter, rs1862513, and diabetes status. As noted above, we did not find significant association with this particular SNP but two SNP in proximity to this one were significantly associated with cluster membership in this study. More recently, the same group also found significant association between this SNP and plasma resistin levels, which in turn, were correlated with C-reactive protein in a Japanese population . Taken together with the highly significant genetic association that we found, these genetic studies in non-HIV individuals support a strong link between genetic variation in resistin and the metabolic side effects observed in the high-risk cluster. Because this SNP is located in non-coding sequence, we hypothesize that it could influence expression of resistin, perhaps by affecting its splicing or transcription. Sequence comparison revealed that this SNP is located in a consensus binding site for the T-cell transcription factor, TCF7L2, which has been previously implicated in susceptibility to Type II diabetes .
Additional work will be needed to understand how genetic variation in resistin might predispose individuals to HIV lipodystrophy. One possibility is that HIV-positive patients on HAART who are in a chronic inflammatory state  might have increased resistin, as resistin gene expression can be induced by the proinflammatory molecules lipopolysaccharide and TNF-α [35,36]. Individuals carrying the associated SNP in resistin might be particularly vulnerable to such changes in resistin gene expression. Given the link between inflammation and resistin levels and genotypes, it will be important in future studies of patients treated with HAART to determine circulating levels of inflammatory molecules including TNF-α, its soluble receptor and C-reactive protein in addition to resistin.
Our work suggests a potential basis for managing lipodystrophy in high-risk individuals. As increased resistin is associated with obesity and insulin resistance, agents that reduce resistin including the marketed peroxisome proliferator-activated receptor agonists pioglitazone and rosiglitazone could potentially have a beneficial impact on HAART-induced lipodystrophy. In fact, consistent with this idea, rosiglitazone can reduce elevated circulating resistin levels in individuals with HIV lipodystrophy . Another study  did not find significant correlation between circulating serum resistin and lipodystrophy; possibly because the different diagnostic criteria used to define lipodystrophy in that study led to individuals with normal lipid levels being classified as lipodystrophic. It is also possible that the paracrine effects of resistin are crucial to HIV lipodystrophy, and circulating resistin levels as measured in plasma in these studies do not accurately capture resistin levels at the site of its action.
In conclusion, we have used a clustering approach to identify a subgroup of patients with increased susceptibility to the metabolic side effects cause by HAART. Genetic analysis revealed highly significant association between SNP in the resistin gene and this subgroup of patients. Additional studies with larger cohorts exposed to other HAART regimens will be needed to determine the general applicability of these findings.
Clustering of metabolic profiles to identify a subgroup of patients at increased risk of developing metabolic abnormalities after exposure to HAART
We analyzed 189 individuals, enrolled in a HAART clinical trial [13,14], who had longitudinal metabolic measurements, including lipids, glucose and insulin, at baseline and up to 64 weeks of treatment . We applied a clustering approach to the analysis of metabolic traits to identify a subgroup of patients with pronounced changes in their metabolic profiles after treatment with HAART. Because of the stronger phenotypic differences displayed by such subgroups, we reasoned that these subgroups might show greater contrast, and thus increase the likelihood of finding a genetic association [18,19]. In addition, because HIV-positive patients on HAART typically present with a cluster of metabolic abnormalities [1,2], it was logical to analyze these traits simultaneously as opposed to sequential genetic analysis of individual metabolic traits. Furthermore, this approach has the potential to reduce noise, as patients with multiple abnormalities are more likely to be truly affected than those with merely sporadic elevations of a single metabolic trait. Finally, as we analyzed all the metabolic traits simultaneously in this multivariable analytic approach, we reduced the number of statistical tests by an order of magnitude, thereby significantly increasing the power of the study, an important consideration given the limited number of samples that were available for genetic analysis.
For each patient, we created a metabolic profile comprised of BMI, total, LDL, HDL and non-HDL cholesterol, triglycerides, glucose and insulin resistance by HOMA-IR, all measured after 32 weeks of HAART. We selected this time point because metabolic abnormalities become apparent at this level of exposure and because later time points had significant numbers of missing values. We then clustered these metabolic profiles such that individuals with similar metabolic profiles were grouped together. This process was repeated iteratively until all individuals were clustered into two groups. This approach is analogous to the widely used method of clustering gene expression profiles to identify subgroups of tumors or cell types [9,10].
Examination of mean values of metabolic traits led to labeling one cluster 'normal' (n = 142) and the other 'high risk' (n = 47; Table 1). At baseline, lipids and insulin resistance index by HOMA-IR were comparable between the two clusters, although the high-risk group was a little heavier and had higher mean lipid levels. After 32 weeks of HAART, the normal cluster (n = 142) continued to exhibit normal mean total cholesterol (180 ± 29 mg/dl), LDL (109 ± 28 mg/dl), triglycerides (142 ± 77 mg/dl), glucose (90 ± 9 mg/dl) and insulin resistance by HOMA-IR (1.5 ± 0.9). Relative to this subgroup, the high-risk cluster (n = 47) experienced significant increases in mean total cholesterol (261 ± 45 mg/dl, P < 0.001), LDL (178 ± 30 mg/dl, P < 0.001), triglycerides (310 ± 236 mg/dl, P < 0.001), glucose (102 ± 23 mg/dl, P < 0.001) and insulin resistance by HOMA-IR (4.3 ± 5.6, P < 0.001). These mean lipid levels and HOMA-IR are considered clinically relevant hyperlipidemia and insulin resistance, respectively . Both clusters experienced only modest and statistically insignificant increases in mean HDL levels after exposure to HAART.
Body composition measurements as determined by DEXA were available for a subset (n = 76) of the patients evaluated above. DEXA measurements were made at 16-week intervals up to 64 weeks because these time points were consistent with patient visits planned in the trial. Moreover, at the time of this clinical trial, prospective data about DEXA measurements and knowledge of how these measurements change over time were unavailable. Since then however, another study  using only 40 individuals found significant changes in both lean and fat masses as measured by DEXA at 12-week intervals, thereby providing reassurance that the changes in DEXA measurements we note below are clinically relevant and unlikely to be simply fluctuations in DEXA values.
Striking differences in body composition over 64 weeks of HAART (Fig. 1a-d) were observed between the normal (n = 54) and high-risk (n = 22) clusters described above. The high-risk group had higher mean total and trunk fat at baseline (Fig. 1a and b), and both groups gained a little over 1 kg of fat up to 32 weeks. Beyond 32 weeks however, the normal cluster stabilized, whereas the high-risk cluster experienced reductions of approximately 5 and 2.5 kg in mean total (P for interaction with cluster 0.01) and trunk fat (P for interaction with cluster 0.1), respectively. Changes in limb fat over time were even more striking (Fig. 1c). Both clusters had similar mean limb fat levels at baseline, and gained approximately 0.5 kg up to 32 weeks. The high-risk group then experienced a precipitous decline in mean limb fat or lipoatrophy of approximately 2.5 kg, but the normal cluster lost only approximately 0.5 kg of mean limb fat more gradually and returned to baseline by 64 weeks (P for interaction with cluster 0.001). The high-risk cluster had greater mean lean mass at baseline than the normal cluster. This mean lean mass increased by approximately 4 kg up to 64 weeks of HAART. In contrast, the normal cluster experienced little gain (0.5 kg) in mean lean mass for the duration of the study (P for interaction with cluster 0.009). This study does not address the mechanistic basis of this interesting difference in lean mass, and future studies will be needed to address this issue.
Taken together, these results demonstrate that the clustering approach successfully classified patients into subgroups that displayed markedly different and clinically relevant metabolic and body composition responses to HAART.
Genetic association with cluster membership
We evaluated age, sex, race, HIV disease severity (baseline HIV copy number and CD4 cell count) and individual drug treatment arm as potential predictors of high-risk cluster membership. None was significantly predictive (Table 2). Hepatitis C coinfection status or plasma markers of inflammation were not available for most patients, so were not evaluated as potential predictors.
We next examined whether nucleotide variation in candidate genes was associated with cluster membership. We genotyped these 189 individuals for 299 SNP in 135 candidate genes (Supplementary Table 1). We selected genes on the basis of their likely involvement in regulating lipid and glucose metabolism, cytokines, drug metabolizing enzymes and transcription factors that regulate expression of these genes. We also selected genes whose expression in cell culture was significantly perturbed after exposure to protease inhibitors . To reduce genotyping effort and because the HapMap project  was not complete when this work was initiated, we concentrated our attention on SNP that are more likely to affect the activity (e.g. coding sequence changes) or expression (promoter or near splice sites) of the gene .
A SNP (dbSNP ID rs3219177) 39 bp downstream of the second exon of the resistin gene was highly significantly associated with cluster membership (Table 3). The frequency of this SNP was 0.16 in the normal cluster and 0.33 in the high-risk group (P = 0.0003; P = 0.04 after adjusting for 135 genes that were tested). Heterozygotes and homozygotes were three [95% confidence interval (CI) 1.3-5.3] and 19 (95% CI 2-183) times more likely than wild type to be classified in the high-risk cluster. The association was consistently observed in whites (P = 0.002) and blacks (P = 0.049), but because of the small number of Hispanics (n = 27), the association was not significant in this race (Table 4); nonetheless heterozygotes were at consistently increased risk in whites (odds ratio, OR = 2.3), blacks (OR = 3.3) and Hispanics (OR = 2.3) and there was no evidence of heterogeneity among the three races (P = 0.9). As noted below in the Discussion, resistin is a strong candidate for the metabolic changes observed in the high-risk cluster. For this reason, we decided to further investigate genetic variation at this locus.
To examine whether other SNPs in the resistin gene were also associated with cluster membership, we determined the sequence of the entire resistin gene in all 47 individuals classified as high-risk and, for comparison, up to 32 individuals from a human diversity panel. In addition, from dbSNP, we selected SNP that are located in phylogenetically conserved regions in the vicinity of the resistin gene. We genotyped the entire cohort for eight SNPs in resistin, including four newly identified by sequencing and two from phylogenetically conserved regions. Although three other SNP were also associated with cluster membership (P < 0.05), none was as significantly associated as the SNP in intron 2 described above (Table 3). Furthermore, haplotype analysis revealed that only haplotypes bearing the intron 2 SNP, rs3219177, were significantly associated with cluster membership (data not shown). Taken together these results indicate that this SNP is potentially causative.
Individuals, genotyping and sequencing
One hundred eighty-nine individuals who participated in ACTG5005s  and who provided written informed consent for genetic analysis were analyzed in this study. Briefly, drug naive HIV-positive patients were randomized to a backbone of nucleoside reverse transcriptase inhibitors (NRTI) comprised of didanosine and stavudine or zidovudine and lamivudine [13,14]. In addition, individuals were randomized to receive a protease inhibitor (nelfinavir), a nonnucleoside reverse transcriptase inhibitor (NNRTI) (efavirenz) or the combination of both. Metabolic traits were measured at baseline and then at 16-week intervals up to 64 weeks. Body fat was measured in all consenting patients using dual-energy X-ray absorptiometry (DEXA) at baseline and every 16 weeks on HAART. Genotyping was done using the TaqMan method . The sequence of the entire resistin gene was determined in 47 high-risk individuals, and for comparison, up to 32 individuals from the diversity panel (M44PDR, Coriell repository) to identify SNP.
Insulin resistance by homeostasis-model assessment (HOMA-IR) was calculated using the formula: fasting insulin (μU/ml) x fasting glucose (mmol/l)/22.5. Traits measured after 32 weeks of HAART were standardized to have a mean of 0 and variance of 1 and then used in the clustering. Traits measured at 32 weeks were selected for clustering because metabolic side effects become apparent by this time point and because later time points had substantial numbers of missing values. For each individual, a profile was made of body mass index (BMI), total, low-density lipoprotein (LDL), high-density lipoprotein (HDL) and non-HDL cholesterol, triglyceride, glucose, and HOMA-IR. These profiles were clustered together using 'two-step' clustering  as implemented in SPSS version 12 (SPSS Inc., Chicago, Illinois, USA). In this clustering method, data are first grouped into subclusters using agglomerative clustering and then cluster assignment is refined to determine the optimal number of clusters. The optimal number of clusters was determined in two steps. First, Bayesian information criterion was calculated for the specified number of clusters to obtain an initial estimate of cluster number. Second, this estimate was refined by finding the greatest change in distance between two clusters in each stage of hierarchical clustering. The distance between two clusters was defined as the decrease in log-likelihood resulting from the two clusters being combined into a single cluster. Up to 20 clusters were permitted, but the algorithm repeatedly determined that a two-cluster solution was optimal. To assess the stability of the cluster solution, multiple runs of clustering (n = 20) were performed using randomly sorted data. All runs produced clusters identical to those described in this study. Differences in means between clusters for metabolic traits and changes in body fat over time were evaluated using Kruskal-Wallis and repeated measures analysis of variance (ANOVA), respectively. Total, trunk and limb fat values were natural log transformed to approximate normality and then used in the repeated measures ANOVA. Genetic association between SNP and clusters was assessed using Fisher's exact test. Linkage disequilibrium between SNP and haplotypes were analyzed using Haploview .