Skip to main content

Main menu

  • Authors
  • Home
  • Content
    • Current Issue
    • Archive
    • Preview Papers
    • Focus Collections
    • Classics Collection
    • Upcoming Focus Issues
  • Submit a Manuscript
  • Advertisers
  • About
    • About the Journal
    • Editorial Board and Staff
  • Subscribers
  • Librarians
  • More
    • Alerts
    • Contact Us
  • Other Publications
    • Plant Physiology
    • The Plant Cell
    • Plant Direct
    • The Arabidopsis Book
    • Plant Cell Teaching Tools
    • ASPB
    • Plantae

User menu

  • My alerts
  • Log in

Search

  • Advanced search
Plant Physiology
  • Other Publications
    • Plant Physiology
    • The Plant Cell
    • Plant Direct
    • The Arabidopsis Book
    • Plant Cell Teaching Tools
    • ASPB
    • Plantae
  • My alerts
  • Log in
Plant Physiology

Advanced Search

  • Authors
  • Home
  • Content
    • Current Issue
    • Archive
    • Preview Papers
    • Focus Collections
    • Classics Collection
    • Upcoming Focus Issues
  • Submit a Manuscript
  • Advertisers
  • About
    • About the Journal
    • Editorial Board and Staff
  • Subscribers
  • Librarians
  • More
    • Alerts
    • Contact Us
  • Follow plantphysiol on Twitter
  • Visit plantphysiol on Facebook
  • Visit Plantae
Research ArticleBIOINFORMATICS
Open Access

Application of the Gini Correlation Coefficient to Infer Regulatory Relationships in Transcriptome Analysis

Chuang Ma, Xiangfeng Wang
Chuang Ma
School of Plant Sciences, University of Arizona, Tucson, Arizona 85721
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Xiangfeng Wang
School of Plant Sciences, University of Arizona, Tucson, Arizona 85721
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site

Published September 2012. DOI: https://doi.org/10.1104/pp.112.201962

  • Article
  • Figures & Data
  • Info & Metrics
  • PDF
Loading

Article Figures & Data

Figures

  • Additional Files
  • Figure 1.
    • Download figure
    • Open in new tab
    • Download powerpoint
    Figure 1.

    Assessment of the overall performances of the five correlation methods evaluated by ROC analyses. A to C, The ROC curves were plotted for the GCC, PCC, SCC, KCC, and BiWt using the data sets of TF-target (A), TF-cofactor (B), and cofactor-target (C) gene pairs with a 1:1 ratio of positive and negative samples. D, Box plot of AUC values derived from the ROC analysis repeated 2,000 times. cof, Cofactor.

  • Figure 2.
    • Download figure
    • Open in new tab
    • Download powerpoint
    Figure 2.

    The GCC can detect the regulatory relationships missed by the PCC, SCC, and BiWt methods. A, The GCC can detect a linear relationship with similar correlation values to the PCC and SCC correlations. B, The PCC failed to infer the relationship in the samples containing outliers, which was detected by the GCC, SCC, and BiWt. The five outlier samples are represented by the red circles in the gray region in seed. C, The GCC was able to identify transient interactions that were overlooked by the PCC, SCC, and BiWt. The expression values of TF and target are only correlated in nine samples in apex out of the 79 samples (red circles in the gray region). Two correlations, GCC1 and GCC2, are produced by the GCC reciprocally using rank and value information of the two genes’ expression data. In the last two columns, the expression data of genes sorted with their own rank information are displayed as black dashed curves, while the expression data of genes sorted with the other gene’s rank information are shown as blue and red solid curves. The Gini correlation can be explained as the difference between the solid and dashed curves weighted by the rank information. “Value” and “Rank” denote the value and the rank information of the gene expression data, respectively.

  • Figure 3.
    • Download figure
    • Open in new tab
    • Download powerpoint
    Figure 3.

    Assessment of the influences from the outlier data points on the five methods. The performances of the correlation methods were evaluated on the pairs TF-target, TF-cofactor, and cofactor-target gene sets, with influences of zero, one to five, six to 10, and more than 10 outliers by ROC analyses.

  • Figure 4.
    • Download figure
    • Open in new tab
    • Download powerpoint
    Figure 4.

    Assessment of the influences from sample size on the five methods. A and B, A significant linear relationship exists between the gene expression profiles of the TF-target (AGL-Hsp40) gene pair. C, The average correlation of the different correlation methods for 1,000 gene pairs with five to 75 samples randomly selected from the gene expression profiles of the AGL-Hsp40 gene pair. D, The differences between the correlations computed on simulated gene pairs (sCor) and the Pearson correlations on the real gene pairs (rCor), computed with the formula (sCor − rCor)/rCor. “Log2 value” denotes the log2-transformed value of the gene expression data.

  • Figure 5.
    • Download figure
    • Open in new tab
    • Download powerpoint
    Figure 5.

    The compatibility of the five correlation methods on RNA-Seq data. A, The kernel density estimation of the read counts and the RPKM values were generated from the Arabidopsis RNA-Seq data (Gene Expression Omnibus accession nos. GSM838184 and GSM764078). By using the RNA-Seq data without (B) and with (C) log transformation, the average correlation coefficients of each method were calculated from 2,000 random gene pairs with an expected correlation coefficient of 0.70 across the five to 100 simulated samples. “Ave correlation” represents the average correlation coefficients.

  • Figure 6.
    • Download figure
    • Open in new tab
    • Download powerpoint
    Figure 6.

    A screen shot of the GCC-based R package (rsgcc) for correlation and clustering analyses of gene expression data. The rsgcc was applied to cluster approximately 2,800 tissue-specifically expressed genes in maize RNA-Seq data. DAP, Days after pollination; Endo, endosperm; Post-em, postemergence; Pre-em, preemergence; ts-genes, tissue-specific genes.

Additional Files

  • Figures
  • Supplemental Data

    Supplemental Figures and Tables

    Files in this Data Supplement:

    • Supplemental Data - Supplemental Figures 1-5 and Supplemental Tables I-III
PreviousNext
Back to top

Table of Contents

Print
Download PDF
Article Alerts
Sign In to Email Alerts with your Email Address
Email Article

Thank you for your interest in spreading the word on Plant Physiology.

NOTE: We only request your email address so that the person you are recommending the page to knows that you wanted them to see it, and that it is not junk mail. We do not capture any email address.

Enter multiple addresses on separate lines or separate them with commas.
Application of the Gini Correlation Coefficient to Infer Regulatory Relationships in Transcriptome Analysis
(Your Name) has sent you a message from Plant Physiology
(Your Name) thought you would like to see the Plant Physiology web site.
Citation Tools
Application of the Gini Correlation Coefficient to Infer Regulatory Relationships in Transcriptome Analysis
Chuang Ma, Xiangfeng Wang
Plant Physiology Sep 2012, 160 (1) 192-203; DOI: 10.1104/pp.112.201962

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
Request Permissions
Share
Application of the Gini Correlation Coefficient to Infer Regulatory Relationships in Transcriptome Analysis
Chuang Ma, Xiangfeng Wang
Plant Physiology Sep 2012, 160 (1) 192-203; DOI: 10.1104/pp.112.201962
del.icio.us logo Digg logo Reddit logo Twitter logo CiteULike logo Facebook logo Google logo Mendeley logo
  • Tweet Widget
  • Facebook Like
  • Google Plus One

Jump to section

  • Article
    • Abstract
    • RESULTS
    • CONCLUSION
    • MATERIALS AND METHODS
    • Footnotes
    • REFERENCES
  • Figures & Data
  • Info & Metrics
  • PDF

In this issue

Plant Physiology: 160 (1)
Plant Physiology
Vol. 160, Issue 1
Sep 2012
  • Table of Contents
  • Table of Contents (PDF)
  • About the Cover
  • Index by author
  • Ed Board (PDF)
  • Front Matter (PDF)
View this article with LENS

More in this TOC Section

  • Systematic Prediction of cis-Regulatory Elements in the Chlamydomonas reinhardtii Genome Using Comparative Genomics
  • ANAP: An Integrated Knowledge Base for Arabidopsis Protein Interaction Network Analysis
Show more Bioinformatics

Similar Articles

Our Content

  • Home
  • Current Issue
  • Plant Physiology Preview
  • Archive
  • Focus Collections
  • Classic Collections
  • The Plant Cell
  • Plant Direct
  • Plantae
  • ASPB

For Authors

  • Instructions
  • Submit a Manuscript
  • Editorial Board and Staff
  • Policies
  • Recognizing our Authors

For Reviewers

  • Instructions
  • Journal Miles
  • Policies

Other Services

  • Permissions
  • Librarian resources
  • Advertise in our journals
  • Alerts
  • RSS Feeds

Copyright © 2019 by The American Society of Plant Biologists

Powered by HighWire