#################
##    DATA     ##  in data/ subdirectory
#################

1] 2 data.frames containing all probe set pairs with Q75 >= 55 for both the 
    Affymetrix and the custom-made CDF.
   Both contain a 'peCC' column which holds the pairs Pearson correlation 
    coefficient calculated on the Tissue dataset (orig from AtgenExpress data).
   The 'alSum' column contains the Q75 value of the alignment scores of X's 
    reporters to the transcript of Y.
   The 'meant2hit' column contains the gene locus the probe set was designed
    to target according to Affymetrix (extracted from ATH1-121501_annot.csv).

    AffysTissue.rda
    ===============
    data.frame with 42826 rows.
    > colnames(AffysTissue)
    [1] "pr"         "target"     "alSum"      "meant2hit"  "PSofTarget"
    [6] "peCC"      
    
    
    CustomsTissue.rda
    =================
    data.frame with 40094 rows.
    > colnames(CustomsTissue)
    [1] "pr"     "target" "alSum"  "peCC"  


2] Same as above but gene pairs that align to each other with BLAST in at 
    least one direction with an E-value smaller than 10^{-10} are omitted 

    AffysTissue.noBl.rda
    ====================
    data.frame with 11361 rows.
    
    CustomsTissue.noBl.rda
    ======================
    data.frame with 10702 rows.
    

3] Same as 1] but a column containing the metacorrelation coefficients added.
   Only those probe set pairs are included for which is this value =! NA (which
    is the case when the alignment scores of X's reporters to Y's transcript are 
    all identical -> no correlation coefficient can be calculated).
       
    AffysTissueMC.rda
    =================
    data.frame with 42815 rows.
    > colnames(AffysTissueMC)
    [1] "pr"         "target"     "alSum"      "meant2hit"  "PSofTarget"
    [6] "peCC"       "Mcor"      

    
    CustomsTissueMC.rda
    ===================
    data.frame with 39424 rows.
    > colnames(CustomsTissueMC)
    [1] "pr"     "target" "alSum"  "peCC"   "Mcor"  


4] Cross-hybridisation examples.   
   All three are S4 objects. To look at the data, must set class first:

   setClass("XhybExamples", representation(X = "character", Y = "character", 
   IVX = "numeric", IVY = "numeric", IVXi = "matrix", ai = "numeric"))

   The slots of these objects contain:
    X    : the name of probe set X
    Y    : the name of probe set Y    
    IVX  : the expression intensities of X in the Tissue dataset  
    IVY  : the expression intensities of Y in the Tissue dataset  
    IVXi : the expression intensities of X's reporters in the Tissue dataset   
    ai   : the alignment scores of X's reporters to Y's transcript sequence  

    ex1.rda
    =======
    X: "245875_at"
    Y: "AT3G28550"
   
    ex2.rda
    =======
    X: "250857_at"
    Y: "AT1G75180"

    ex3.rda
    =======
    X: "258508_at"
    Y: "AT3G06650"



####################
##    SCRIPTS     ##    in R/ subdirectory
####################

1]  An S4 method to plot S4 objects of 'XhybExamples' objects.
    This generates a window with 3 plots: 
      - The left one plots the summarised expression patterns of X and Y.
      - The middle plot shows the background corrected normalised intensity
        values of X's reporters. The colours used correspond to the alignment
        strength (ai) of the reporter to Y's transcript sequence.
      - The plot on the right plots the Pearson correlation coefficient of the
        reporters' expression patterns to Y's expression pattern in function
        of their alignment score, ai.
      

    plotExample.R
    =============
    
2] 'runSimulation' is a function that will run the simulation we ran for our study.
   By modifying the parameters, users can play with it themselves.

    runSimulation.R 
    ===============