NCBI Word Ingestion App Standalone local prototype for DOCX ingestion review
Stakeholder review

Document review and submission decision

This page shows what the system understood from the submitted DOCX file, what it could normalize, and whether the document currently passes review, needs warning-level follow-up, or should be rejected and corrected in Word first.

Document selection

Open another reviewed document

You are currently reviewing document #137. Use this form only if you want to jump to a different review.

Review decision

Warning

This document parsed successfully, but it still needs review. 1 warning detected.

Document metadata

datareporting

ID
137
Parsed title
Basic Guidelines for Reporting Non-Clinical Data
Template profile
NCBI Bookshelf Template
Strict template mode
Yes
Source file
datareporting.docx
Stored path
docx/137/master.docx
Status
parsed
Notes
Parsed 52 headings, 450 blocks, 53 references, 65 citations, 0 citation issues, 1 preflight issues.
Uploaded
2026-06-23 13:29:30
Extracted front matter Metadata candidates
Parsed summary

Block counts

Headings
52
Paragraphs
351
Tables
14
References
53
Citations
65
Matched citations
65
Open issues
0
Blocking pre-flight issues
0
Warning pre-flight issues
1
Validation status

Pre-flight decision

Warning

Blocking issues: 0 | warnings: 1 | total: 1

Pre-flight issues Acceptance and rejection guidance
warning missing_keywords

No keywords metadata group was detected.

If keywords are required for this content type, add them using the expected Word style so they can be promoted into canonical metadata.

Structured blocks Heading, paragraph, and table order

Use this section to confirm heading fidelity, spot spacing artifacts, inspect first-class table content, and understand how nearby paragraphs and tables were sequenced during parsing. Matched in-text citations now jump to their reference entries, while unresolved citations route to open citation issues.

Matched and externally validated or skipped Needs review Failed match
paragraph Order 1 Style Comment word/document.xml:/w:document[1]/w:body[1]/w:p[1]

Use this form to tell us important information about this document, then start the text on the following page. All information you give in this form will appear in the document, or affect the way it is handled online.

paragraph Order 4 Style Comment word/document.xml:/w:document[1]/w:body[1]/w:p[3]

Note: for “Document Type”, choose only one of the following: appendix, chapter, dedication, foreword, front-matter-part, glossary, preface, ref-list, or section.

paragraph Order 7 Style Comment word/document.xml:/w:document[1]/w:body[1]/w:p[5]

Blank or “Yes” is the default value, indicating these features will apply to the document.

paragraph Order 10 Style Comment word/document.xml:/w:document[1]/w:body[1]/w:p[7]

Use one row for each author. List authors in order of appearance in the document. Add rows to add more authors. For institutional author, enter the name of the institution in the Given Name(s) column.

paragraph Order 13 Style Comment word/document.xml:/w:document[1]/w:body[1]/w:p[9]

Use one row for each affiliation. Link affiliations by label to the respective author. Add rows to add more affiliations.

heading Order 23 Level 1 Style Heading1 word/document.xml:/w:document[1]/w:body[1]/w:p[19]

H1. Basic Guidelines for Reporting Non-Clinical Data

paragraph Order 26 Style AbstractHeader word/document.xml:/w:document[1]/w:body[1]/w:p[22]

Abstract

paragraph Order 27 Style Abstract word/document.xml:/w:document[1]/w:body[1]/w:p[23]

Reporting experimental and assay results can occur in many different settings, including informal laboratory meetings, technical reports, collaborative interactions, updates to management groups and presentations at professional conferences. In order to convey the intended message and make a lasting impact, the data presented must be clear to the observer or reader. Understanding key concepts and methods for reporting data is also critical to preserve scientific findings. This chapter describes some basic guidelines for reporting non-clinical data with an emphasis on standard elements of graphs and tables and the use of these tools to describe data most appropriately. Several fundamental statistical and numerical descriptions such as significant digits, replicates, error and correlations are also included, as they constitute an integral part of communicating results. These guidelines form the foundation for non-clinical data reporting mechanisms, such as laboratory notebooks and reports. While these guidelines are general in nature and may not be inclusive of the requirements for publication within specific journals, they should provide a solid basis for reporting non-clinical data, independent of the presentation venue.

heading Order 30 Level 2 Style Heading2 word/document.xml:/w:document[1]/w:body[1]/w:p[26]

H2. Abbreviations

table Order 31 word/document.xml:/w:document[1]/w:body[1]/w:tbl[5]

Table.

AGM

Assay Guidance Manual

AUC

area under the curve

CRC

concentration-response curve

CV

Coefficient of Variation

EC50

half-maximal effective concentration (relative or absolute, see AGM chapters on Assay Operations and Glossary for further definitions) (1-3)

HTS

high-throughput screening

LLOQ

lower limit of quantitation

Log

Log base 10 or Log10

LsA

limits of agreement

MR

mean ratio

n

number of replicates

pEC50

negative log EC50

PK

pharmacokinetics

PMCC

product moment correlation coefficient

r

Pearson’s correlation coefficient (equivalent to linear correlation coefficient)

ρ

correlation coefficient

SD

standard deviation

SE

standard error

SEM

standard error of the mean

ULOQ

upper limit of quantitation

Table footprint: 20 rows, 40 cells

heading Order 34 Level 2 Style Heading2 word/document.xml:/w:document[1]/w:body[1]/w:p[29]

H2. Overview

paragraph Order 35 word/document.xml:/w:document[1]/w:body[1]/w:p[30]

Representation of data in graphs and tables is a key part of any scientific experiment. Creating appropriate figures requires familiarity with constructing them as well as knowledge of the data. The ability to convey results with clarity and accuracy does not require special skills but following some common guidelines can be very helpful in describing the data. This chapter outlines steps to make figures an effective communication tool for the scientist with a focus on assay development and high-throughput screening (HTS) applications. Certain figure types, such as flow charts and process diagrams, are not discussed in this chapter.

paragraph Order 37 word/document.xml:/w:document[1]/w:body[1]/w:p[32]

When considering graphs or tables for publications, grants, regulatory reports, etc. consult with the appropriate journals or technical documentation available for any specific requirements.

paragraph Order 39 word/document.xml:/w:document[1]/w:body[1]/w:p[34]

The main types of graphs for summarizing scientific data that are used for assay development or the HTS field include bar graphs, line graphs, scatter plots, frequency distributions, scatter box plots and heat maps. Other graph types such as spider or radar plots, pie charts, Pareto diagrams, and area charts are used less frequently and are not specifically discussed in this chapter.

paragraph Order 41 word/document.xml:/w:document[1]/w:body[1]/w:p[36]

Allocating most of the “graph ink” to data and minimizing extraneous ”chart junk” should be the goal of any effective graph (4). With that in mind, the main elements of many graphs include a title, axis scale with tick marks, axes labels, data, and a symbol key. When the graph is to be used in a printed format, a caption or legend and footnotes may be added. These graph components, and how to make better graphs, have been described in the literature (5-9). In addition, several books are often cited when describing graphing methods (4,10). At least one paper offers a five principal tutorial on how to visualize data (11). This chapter highlights some of the key graphing basics and considers graphs that may be used during everyday informal presentations of non-clinical data. In addition, tables are also briefly discussed.

paragraph Order 43 word/document.xml:/w:document[1]/w:body[1]/w:p[38]

Proper statistical treatment of the data is essential and consulting with a statistician familiar in assay development and HTS applications is highly recommended. Otherwise it is the author’s responsibility to ensure that the statistical methods support the types of conclusions being drawn and that the statistical methods are appropriate to the data based upon data type, distribution (e.g. normal vs. log-normal), and study design. More details are available in the statistics chapters of the Assay Guidance Manual (12).

paragraph Order 45 word/document.xml:/w:document[1]/w:body[1]/w:p[40]

Before creating graphs and tables, one should understand the numerical and statistical concepts described below. These concepts have a pivotal role in conveying data efficiently and effectively.

heading Order 48 Level 2 Style Heading2 word/document.xml:/w:document[1]/w:body[1]/w:p[43]

H2. Rounding to Decimal Places

paragraph Order 49 word/document.xml:/w:document[1]/w:body[1]/w:p[44]

During the collection and manipulation of primary data, data should not be rounded until the end, when one wishes to summarize and report the findings in a table, etc. When rounding to decimal places, numbers are rounded by removing digits from the rightmost, farthest side, or the value which falsely suggests a high degree of precision. When the digit to be removed (immediately to the right of the rounding digit) is 0, 1, 2, 3 or 4, the rounding digit is “rounded down”. Similarly, when the digit to be removed is 5, 6, 7, 8 or 9, the preceding rounding digit is “rounded up”. When eliminating multiple digits from the right of a rounded digit, round from the rightmost digit to the left, ensuring to capture any influence rounding carryover from preceding digits. See Table 1 for rounding examples which include these carryover conventions.

Immediately before a table block

table Order 51 word/document.xml:/w:document[1]/w:body[1]/w:tbl[6]

Table caption. Table 1. Rounding examples from thousands to three decimal places

Rounding the Whole Number to:

Rounding the Decimal Number to:

Result

Thousand

Hundreds

Tens

Ones

Tenths

Hundredths

Thousandth

1234.1234

1000

1200

1230

1234

1234.1

1234.12

1234.123

2345.2345

2000

2300

2340

2345

2345.2

2345.24

2345.235

3456.3456

3000

3500

3460

3456

3456.4

3456.35

3456.346

4567.4567

5000

4600

4570

4567

4567.5

4567.46

4567.457

5678.5678

6000

5700

5780

5679

5678.6

5678.57

5678.568

Table footprint: 7 rows, 51 cells

Attached table caption from preceding Word paragraph

Follows a paragraph block

Precedes a paragraph block

paragraph Order 53 word/document.xml:/w:document[1]/w:body[1]/w:p[47]

Although the authors subscribe to the common rounding rule which “rounds down” for digits less than 5 and “rounds up” for digits greater than 5, there are alternative “odd-even” rounding strategies for summarized values that have been published (13).

Immediately after a table block

heading Order 56 Level 2 Style Heading2 word/document.xml:/w:document[1]/w:body[1]/w:p[50]

H2. Rounding to Significant Digits

paragraph Order 57 word/document.xml:/w:document[1]/w:body[1]/w:p[51]

The number of significant digits or figures that are used to display a value is distinct from the number of decimal places that are used when expressing numbers. Demonstrating numerical expression consistency in tables, figures, legends, etc. is important when describing results and aids in the reporting and understanding of variability.

paragraph Order 59 word/document.xml:/w:document[1]/w:body[1]/w:p[53]

The following basic rules are used for rounding with significant digits:

list Order 60 Style ListParagraph word/document.xml:/w:document[1]/w:body[1]/w:p[54]
  • Non-zero digits are always significant
list Order 61 Style ListParagraph word/document.xml:/w:document[1]/w:body[1]/w:p[55]
  • A zero or zeros between two significant digits are significant
list Order 62 Style ListParagraph word/document.xml:/w:document[1]/w:body[1]/w:p[56]
  • Final or trailing zeros in a number with no decimal are not significant (e.g. 1020)
list Order 63 Style ListParagraph word/document.xml:/w:document[1]/w:body[1]/w:p[57]
  • Leading zeros in the decimal portion of a number are not significant; whereas final trailing zeros in the decimal portion of a number are significant
paragraph Order 65 word/document.xml:/w:document[1]/w:body[1]/w:p[59]

Expressing numbers using scientific notation can provide a useful method to demonstrate the number of significant digits associated with a value. As shown in Table 2, the defined digits which are located before the exponent part of the scientific notation expression are significant:

Immediately before a table block

table Order 67 word/document.xml:/w:document[1]/w:body[1]/w:tbl[7]

Table caption. Table 2. Three significant digits for numbers viewed with scientific notation

Number

Scientific Notation

# Significant Digits

Significant Digits

1020

1.02 x 103

3

1,0,2

102

1.02 x 102

3

1,0,2

10.2

1.02 x 101

3

1,0,2

1.02

1.02 x 100

3

1,0,2

0.102

1.02 x 10-1

3

1,0,2

0.0102

1.02 x 10-2

3

1,0,2

0.00102

1.02 x 10-3

3

1,0,2

Table footprint: 8 rows, 32 cells

Attached table caption from preceding Word paragraph

Follows a paragraph block

Precedes a paragraph block

paragraph Order 69 word/document.xml:/w:document[1]/w:body[1]/w:p[62]

As shown in Table 3, the number of significant digits for result values of different magnitudes (from 1 to 4), is depicted.

Immediately before a table block

Immediately after a table block

table Order 71 word/document.xml:/w:document[1]/w:body[1]/w:tbl[8]

Table caption. Table 3. The number of significant digits from one to four for several examples.

Result with indicated # of Significant Digits

Result

Scientific Notation

1

2

3

4

0.1234

1.234 x 10-1

0.1

0.12

0.123

0.1234

1.234

1.234 x 100

1

1.2

1.23

1.234

12.34

1.234 x 101

10

12

12.3

12.34

123.4

1.234 x 102

100

120

123

123.4

1234

1.234 x 103

1000

1200

1230

1234

Table footprint: 7 rows, 39 cells

Attached table caption from preceding Word paragraph

Follows a paragraph block

heading Order 74 Level 2 Style Heading2 word/document.xml:/w:document[1]/w:body[1]/w:p[66]

H2. Presenting Rounded Data in Tables

paragraph Order 75 word/document.xml:/w:document[1]/w:body[1]/w:p[67]

When preparing summarized data tables, all result values should be expressed with the same number of significant digits. Once the desired number of significant digits has been established for the result value, use the same number of decimal places in the variability measurement (e.g. SD, SEM) (Table 4, Data Set 3).

paragraph Order 77 word/document.xml:/w:document[1]/w:body[1]/w:p[69]

Consider the summary presented in Table 4 that represents typical data generated for compound testing results and the associated error. Each data set in the table is discussed below.

paragraph Order 79 word/document.xml:/w:document[1]/w:body[1]/w:p[71]

Data Set 1: Each Result Value and the paired statistical measurement (SEM) are expressed with the same number of decimal places, in this case three. Note that the table is cumbersome when values range several magnitudes and the number of significant digits is variable.

paragraph Order 81 word/document.xml:/w:document[1]/w:body[1]/w:p[73]

Data Set 2: All Result Values have the same number of decimal places (three) with the paired statistical measurement (SEM) also having the same number of decimal places (three). Once again, the number of significant digits expressed for the result value is variable with anywhere from 2-7 significant digits. Summary tables with values and error like those shown are often seen when results are obtained from a database query with a set number of decimal places for the results.

paragraph Order 83 word/document.xml:/w:document[1]/w:body[1]/w:p[75]

Data Set 3: All Result Values are shown with the same number of significant digits (three in this example). The paired statistical measurement (SEM) has the same number of decimal places as the result value. This is the preferred method of expression for results and error. If results are obtained from a database query, there may be some manipulation required with either the result value or the error.

Immediately before a table block

table Order 85 word/document.xml:/w:document[1]/w:body[1]/w:tbl[9]

Table caption. Table 4. Expression of Result Values and Error in a Data Summary Table

Data Set 1

Data Set 2

Data Set 3 (preferred)

Each Result Value and paired SEM have the same number of decimal places

All Result Values and all paired SEM have the same number of decimal places (three)

Same number of Significant Digits for all Result Values; same number of decimal places for paired SEM

Compound

Result Value

SEM

Result Value

SEM

Result Value

SEM

1

0.0410

0.0067

0.041

0.007

0.0410

0.0067

2

2114.4

101.6

2114.437

101.642

2110

102

3

2635

238

2635.146

238.178

2640

238

4

389.3

35.2

389.358

35.261

389

35

5

0.188

0.008

0.188

0.008

0.188

0.008

6

1430.5

100.2

1430.561

100.263

1430

100

Table footprint: 9 rows, 57 cells

Attached table caption from preceding Word paragraph

Follows a paragraph block

Precedes a paragraph block

paragraph Order 88 word/document.xml:/w:document[1]/w:body[1]/w:p[79]

It is common practice to express assay results in summarized tables to two or three significant digits, but this can depend on the actual or perceived error associated with the process used in generating the result values. These error statistics (SD and SEM) are discussed further in a subsequent section.

Immediately after a table block

paragraph Order 90 word/document.xml:/w:document[1]/w:body[1]/w:p[81]

Finally, expressing results using negative log transformations (e.g. pIC50, pKi, etc.) provide consistency in keeping all result values to the same number of significant digits. The concept of using negative log-transformed result values is discussed later in this chapter.

heading Order 93 Level 2 Style Heading2 word/document.xml:/w:document[1]/w:body[1]/w:p[84]

H2. Logarithms

paragraph Order 94 word/document.xml:/w:document[1]/w:body[1]/w:p[85]

The logarithm (or log) of a number is the exponent that indicates the power to which another number (the base) is raised to produce the initial number. The “common” log or base-10 log is essentially the only one used in assay and screening applications and will be discussed exclusively in this chapter. The log base-10 (Log10) of a number (n) can be described as shown in the equation below:

paragraph Order 97 Style Equation word/document.xml:/w:document[1]/w:body[1]/w:p[88]

Equation 1. Base 10 Logarithm

paragraph Order 98 Style Equation word/document.xml:/w:document[1]/w:body[1]/w:p[89]

log10n= a

paragraph Order 100 word/document.xml:/w:document[1]/w:body[1]/w:p[91]

where 10 is the base and a is the exponent or power to which the base is raised. For example, the base 10 log of 1000 is 3, since 10 raised to the power of 3 (or 103) is 1000. When the base is not indicated it means log base 10, by convention. Table 5 shows some log10 values of several numbers.

Immediately before a table block

table Order 102 word/document.xml:/w:document[1]/w:body[1]/w:tbl[10]

Table caption. Table 5. Log10 values of several numbers

Value

Log10

1

0

10

1

100

2

1000

3

842

2.93

35

1.54

0.1

-1

0.01

-2

0.001

-3

Table footprint: 10 rows, 20 cells

Attached table caption from preceding Word paragraph

Follows a paragraph block

Precedes a paragraph block

paragraph Order 105 word/document.xml:/w:document[1]/w:body[1]/w:p[95]

Note that the log can be positive, negative or zero. However, log 0 is undefined since no number raised to the power of another number results in zero.

Immediately after a table block

paragraph Order 107 word/document.xml:/w:document[1]/w:body[1]/w:p[97]

The antilog is the inverse log function. For instance, the antilog of -2 is 10-2 or 0.01. Software programs such as Microsoft Excel and others easily calculate log and antilog values using built in functions.

paragraph Order 109 word/document.xml:/w:document[1]/w:body[1]/w:p[99]

Additional information regarding the use of log values (geomean, pEC50, etc.) can be found elsewhere within this chapter. In addition, the rounding rules described above apply to log values as well.

heading Order 111 Level 2 Style Heading2 word/document.xml:/w:document[1]/w:body[1]/w:p[101]

H2. Statistical Descriptors/Metrics

heading Order 112 Level 3 Style Heading3 word/document.xml:/w:document[1]/w:body[1]/w:p[102]

H3. Arithmetic Mean

paragraph Order 113 word/document.xml:/w:document[1]/w:body[1]/w:p[103]

The mean (referred to as arithmetic mean) is the average of all results. To calculate the mean, add up all the result values in the data set and divide the sum by the number of values (n).

heading Order 116 Level 3 Style Heading3 word/document.xml:/w:document[1]/w:body[1]/w:p[106]

H3. Geometric Mean

paragraph Order 118 word/document.xml:/w:document[1]/w:body[1]/w:p[107]

Use the Geometric Mean or Geomean when averaging data that have been calculated from log-normal values. The most common result types to which this often applies are potencies, affinities or inhibition constants (e.g. EC50, IC50, Ki, etc.) that have been determined from concentration response curves. The equation to calculate the Geomean for an EC50 is shown below:

paragraph Order 120 Style Equation word/document.xml:/w:document[1]/w:body[1]/w:p[109]

Equation 2: Geometric Mean

paragraph Order 121 Style Equation word/document.xml:/w:document[1]/w:body[1]/w:p[110]

Geometric Mean=10Log EC501+Log EC502+…Log EC50nn

heading Order 124 Level 3 Style Heading3 word/document.xml:/w:document[1]/w:body[1]/w:p[113]

H3. Median

paragraph Order 125 word/document.xml:/w:document[1]/w:body[1]/w:p[114]

The median is the middle value of all results in a ranked list. Half of the numbers in the data set will be above the median and half the numbers will be below the median. To calculate the median, rank order the result values in the data set (in ascending order) and determine the middle value. When there is an odd number of results, the median is the middle value. When there is an even number of results, the median is the average between the two middle values.

heading Order 128 Level 3 Style Heading3 word/document.xml:/w:document[1]/w:body[1]/w:p[117]

H3. Examples for Arithmetic Mean, Geometric Mean and Median

paragraph Order 129 word/document.xml:/w:document[1]/w:body[1]/w:p[118]

The following examples serve to demonstrate the difference between and use of arithmetic and geometric means as well as median values.

paragraph Order 131 word/document.xml:/w:document[1]/w:body[1]/w:p[120]

The arithmetic mean and the median for three different sets of numbers are compared in Table 6:

Immediately before a table block

table Order 133 word/document.xml:/w:document[1]/w:body[1]/w:tbl[11]

Table caption. Table 6. Arithmetic mean and median for three sets of numbers.

Result Values

Number of Results (n)

Arithmetic Mean

Median

18

3

25

26

26

31

18

4

27.5

28.5

26

31

35

15

5

27

31

18

31

35

36

Table footprint: 13 rows, 52 cells

Attached table caption from preceding Word paragraph

Follows a paragraph block

Precedes a paragraph block

paragraph Order 136 word/document.xml:/w:document[1]/w:body[1]/w:p[124]

A key concept is that the mean of a data set can be influenced by outliers, whereas the median is relatively resistant to outliers and is a more robust statistic when several values exist. Consider the example shown in Table 7 where a few values cause the elevated mean.

Immediately before a table block

Immediately after a table block

table Order 138 word/document.xml:/w:document[1]/w:body[1]/w:tbl[12]

Table caption. Table 7. Effect of an outlier on the arithmetic mean and median.

Result Values

Number of Results (n)

Arithmetic Mean

Median

16

7

53

22

18

20

22

45

72

180

Table footprint: 8 rows, 32 cells

Attached table caption from preceding Word paragraph

Follows a paragraph block

Precedes a paragraph block

paragraph Order 141 word/document.xml:/w:document[1]/w:body[1]/w:p[128]

Note: many of the parameters discussed in the AGM chapter on Assay Operations for SAR Support (1) refer to using the median instead of the mean.

Immediately after a table block

paragraph Order 143 word/document.xml:/w:document[1]/w:body[1]/w:p[130]

Table 8 shows the geometric and arithmetic mean for several different sets of result values:

Immediately before a table block

table Order 145 word/document.xml:/w:document[1]/w:body[1]/w:tbl[13]

Table caption. Table 8. Geometric mean and arithmetic mean for different sets of result values.

Result Values

Geometric Mean

Arithmetic Mean

4, 6, 9, 10, 12

7.6

8.2

123, 219, 228, 198, 267

201

207

0.25, 0.67, 0.17, 0.46, 0.91

0.41

0.49

2, 2, 2, 2, 2

2

2

1, 10, 100

10

37

Table footprint: 6 rows, 18 cells

Attached table caption from preceding Word paragraph

Follows a paragraph block

Precedes a paragraph block

paragraph Order 148 word/document.xml:/w:document[1]/w:body[1]/w:p[134]

The geometric mean is always less than the arithmetic mean unless all the result values are the same.

Immediately after a table block

heading Order 151 Level 3 Style Heading3 word/document.xml:/w:document[1]/w:body[1]/w:p[137]

H3. Standard Deviation and Standard Error of the Mean

paragraph Order 152 word/document.xml:/w:document[1]/w:body[1]/w:p[138]

Two statistical quantities that are often (incorrectly) used interchangeably are SEM and SD (14,15). SD is defined as:

paragraph Order 154 Style Equation word/document.xml:/w:document[1]/w:body[1]/w:p[140]

Equation 3: Standard Deviation

paragraph Order 155 Style Equation word/document.xml:/w:document[1]/w:body[1]/w:p[141]

s= (X- X)2n - 1

paragraph Order 157 word/document.xml:/w:document[1]/w:body[1]/w:p[143]

where s represents SD, X represents each data point, X represents the arithmetic mean of the population, and n represents the number of data points

paragraph Order 159 word/document.xml:/w:document[1]/w:body[1]/w:p[145]

The SD describes the variation, or dispersion, in measurements relative to the population mean. By contrast, SEM is defined as:

paragraph Order 161 Style Equation word/document.xml:/w:document[1]/w:body[1]/w:p[147]

Equation 4: Standard Error of the Mean

paragraph Order 162 Style Equation word/document.xml:/w:document[1]/w:body[1]/w:p[148]

sx= sn

paragraph Order 164 word/document.xml:/w:document[1]/w:body[1]/w:p[150]

where sx represents SEM, s represents SD, and n represents the number of data points

paragraph Order 166 word/document.xml:/w:document[1]/w:body[1]/w:p[152]

The SEM describes the probability that the measured mean is different from the population mean. With increasing sample size (assuming constant deviation in measurement), the SEM will approach zero.

paragraph Order 168 word/document.xml:/w:document[1]/w:body[1]/w:p[154]

SD and SEM represent very different concepts. The SD describes the distribution of individual data points around a mean, while the SEM describes the precision of the mean estimate.

paragraph Order 170 word/document.xml:/w:document[1]/w:body[1]/w:p[156]

When presenting data, figure legends should explicitly state whether the error bars represent the SD or SEM, as well as the number and type of replicates. Since SEM is always smaller when compared to SD in replicate measurements, we find that SEM is often plotted, presumably to imply less variation in the data. SD should be plotted when trying to convey the variation in the data, while SEM should be plotted when trying to convey the differences in means. In addition, with small data sets, plotting the individual replicates tends to be the best way to demonstrate the variability in the results in a manner universally understood.

paragraph Order 172 word/document.xml:/w:document[1]/w:body[1]/w:p[158]

Consider the following three examples in Figure 1 for a concentration-response curve (CRC) where the same data is plotted with SD, SEM, or individual run values at each concentration:

paragraph Order 174 Style Figuregraphic word/document.xml:/w:document[1]/w:body[1]/w:p[160]

image2.tiff Picture 3
paragraph Order 175 Style Figurecaptioncontinued word/document.xml:/w:document[1]/w:body[1]/w:p[161]

A CRC (n = 3 inter-run replicates) is plotted with the error bars representing the SD (A) or SEM (B). In (C), all three independent values at each concentration are shown as data points on the graph. For A and B, the data points on the graphs are shown as the median of the independent data values from the three runs.

figure Order 176 Style Figurenumberandcaption word/document.xml:/w:document[1]/w:body[1]/w:p[162]

Figure. Figure 1. Concentration-response curve plotted with error.

heading Order 178 Level 3 Style Heading3 word/document.xml:/w:document[1]/w:body[1]/w:p[164]

H3. Confidence Interval

paragraph Order 179 word/document.xml:/w:document[1]/w:body[1]/w:p[165]

A confidence interval, computed from the statistics of the observed data, is an estimated range that is likely to contain the unknown parameter. For instance, the 95% confidence interval is a range of values that one can be 95% certain contains the true mean of the estimated parameter. The confidence interval for an estimated parameter is the estimate of that parameter plus or minus the quantile corresponding to the desired confidence level from the appropriate distribution (e.g., normal, t, chi-squared, etc.) times the standard error of the estimate. When the distribution is approximately normal, the confidence interval for a sample mean is given by Equation 5:

paragraph Order 181 Style Equation word/document.xml:/w:document[1]/w:body[1]/w:p[167]

Equation 5. Confidence Interval

paragraph Order 182 Style Equation word/document.xml:/w:document[1]/w:body[1]/w:p[168]

x±t*sx

paragraph Order 184 word/document.xml:/w:document[1]/w:body[1]/w:p[170]

where sx is the standard error of the mean, as defined above, and t is the quantile from the Student’s t-distribution corresponding to the desired confidence level and sample size. For example, t=2.26, for 95% confidence and a sample size of 10 (9 degrees of freedom).

heading Order 187 Level 2 Style Heading2 word/document.xml:/w:document[1]/w:body[1]/w:p[173]

H2. Example of Methods to Plot the Same Data

paragraph Order 188 word/document.xml:/w:document[1]/w:body[1]/w:p[174]

An example presented in an article that was published in four separate journals shows a set of data visualized using a scatter plot, box-and-whiskers, median and quartiles, mean ± SD, mean with confidence interval (see section below) and mean with SEM (16-19). To display variability, the author suggests showing raw data, the box-and-whisker plot, median and quartiles, or mean ± SD as the most effective methods. Using these principles, Figure 2 shows all five of these plots representing data for a control compound in an assay performed over a period of time. It demonstrates the impact of each method and the resulting message about variability or error being conveyed. The scatter plot, showing all data points, is typically the preferred format, particularly with many journals.

paragraph Order 190 Style Figuregraphic word/document.xml:/w:document[1]/w:body[1]/w:p[176]

image3.tiff Picture 20
paragraph Order 191 Style Figurecaptioncontinued word/document.xml:/w:document[1]/w:body[1]/w:p[177]

The activity for an assay control compound from several different runs is plotted on the y-axis using different methods for expressing the variability. (A) Scatter plot showing all of the data values used, with the median indicated by the red line; (B) Box and whiskers plot, showing the range of data with the median indicated by the line; (C) Median and quartiles; (D) Mean with error bars (1 SD); (E) Mean with 95% confidence interval; (F) Mean with standard error of the mean. Adapted from reference (17).

figure Order 192 Style Figurenumberandcaption word/document.xml:/w:document[1]/w:body[1]/w:p[178]

Figure. Figure 2. Summarized result for an assay control compound using six different methods.

paragraph Order 194 word/document.xml:/w:document[1]/w:body[1]/w:p[180]

Note that the box-and-whisker plot shows the distribution of a set of data by drawing a box (rectangle) that spans from the first to the third quartile, with a line at the median. The whisker on each end extends either to the most extreme data value or to a distance that is 1.5 times the interquartile range (IQR = third quartile – first quartile) from the end of the box, whichever is shorter. Data values that are more extreme than 1.5 times the IQR are plotted as individual points.

heading Order 197 Level 3 Style Heading3 word/document.xml:/w:document[1]/w:body[1]/w:p[183]

H3. Signal-to-Noise (S/N or SNR), Signal-to-Background (S/B) and Z’-Factor

paragraph Order 199 word/document.xml:/w:document[1]/w:body[1]/w:p[185]

The Signal-to-Noise Ratio (SNR or S/N) can be defined as the ratio of the mean signal (Xs) to the standard deviation of that signal (ss). Although the S/N parameter incorporates signal variability, it only evaluates one signal in the assay. Typically, assays are optimized with controls that provide biologically relevant high and low signals that together define the dynamic range. The S/N ratio evaluates only one of the two relevant signals and thus does not allow the overall assay quality to be evaluated. In Figure 3 S/N values are given for high and low assay controls in panels A-D. The S/N values are highest for the two signals in panel A due to the low variability.

paragraph Order 201 Style Equation word/document.xml:/w:document[1]/w:body[1]/w:p[187]

Equation 6: Signal to Noise Ratio (SNR or S/N)

paragraph Order 202 Style Equation word/document.xml:/w:document[1]/w:body[1]/w:p[188]

S/N= XSsS

paragraph Order 204 word/document.xml:/w:document[1]/w:body[1]/w:p[190]

The S/N is the reciprocal of the coefficient of variation (CV) which is a measure of precision relative to the mean. The coefficient of variation is often expressed as percent. Percent CV values are indicated for both high and low controls in the panels of Figure 3. The high and low controls of panel A have the lowest %CV values in Figure 3 due to the low variability among data points. Alternatively, and to address the limitations noted above, S/N is sometimes defined as the ratio of the difference between high and low signals of the controls and the total variability of the signal from the controls. The “controls” here refer to whatever reflects the dynamic range of the assay signal; for example, they may be the positive control (at high concentration) and negative control (background), presence and absence of enzyme, etc.

paragraph Order 206 Style Equation word/document.xml:/w:document[1]/w:body[1]/w:p[192]

Equation 7: Coefficient of Variation (CV)

paragraph Order 207 Style Equation word/document.xml:/w:document[1]/w:body[1]/w:p[193]

CV= s X

paragraph Order 209 word/document.xml:/w:document[1]/w:body[1]/w:p[195]

The Signal-to-Background Ratio (S/B) is defined as the ratio of the high control mean signal (XHS) to the low control mean signal (XLS). While this metric is useful for describing the assay dynamic range, it does not incorporate the variability of the high (sHS) or low (sLS) signals. An assay with a reasonable S/B value can still be of poor quality if the variability of either signal or both are high. An example is provided in Figure 3D, which shows high variability among both high (%CV = 23) and low (%CV = 37) controls.

paragraph Order 211 Style Equation word/document.xml:/w:document[1]/w:body[1]/w:p[197]

Equation 8: Signal to Background Ratio (S/B)

paragraph Order 212 Style Equation word/document.xml:/w:document[1]/w:body[1]/w:p[198]

S/B= XHS XLS

paragraph Order 214 word/document.xml:/w:document[1]/w:body[1]/w:p[200]

The Z’-Factor (20) is considered the best metric to describe assay quality because it incorporates the mean values of both high (XHS) and low (XLS) control signals as well as their variabilities (sHS and sLS, respectively). The Z’-factor value approaches 1 (an ideal assay) as the signal variabilities approach zero or as the dynamic range approaches ∞. An assay with a Z’-factor value above 0.5 is considered excellent. Below 0.5, the assay quality is considered progressively lower quality as the Z’-factor value approaches zero or becomes negative. Screening is essentially impossible when the Z’-factor value is less than zero. The data shown in Figure 3 panels A-D have progressively higher variability as demonstrated by the CV values. Notably, the best quality assay is shown in panel A (Z’ = 0.89), despite the modest dynamic range (S/B = 3.2). The low variability among both high and low controls is the primary driver of the excellent assay quality in this case. At the other extreme is the data in panel D, which has a Z’ value of 0, despite having an S/B value more than 2-fold higher than that of panel A.

paragraph Order 217 Style Equation word/document.xml:/w:document[1]/w:body[1]/w:p[203]

Equation 9: Z’-Factor

paragraph Order 218 Style Equation word/document.xml:/w:document[1]/w:body[1]/w:p[204]

Z'=1- (3sHS + 3sLS) XHS - XLS

paragraph Order 221 Style Figuregraphic word/document.xml:/w:document[1]/w:body[1]/w:p[207]

Picture 9
image4.jpg Picture 9
paragraph Order 222 Style Figurecaptioncontinued word/document.xml:/w:document[1]/w:body[1]/w:p[208]

Assays with varying data quality are shown with calculations of associated parameters. Solid lines represent mean signals and dashed lines represent 3 SD of the mean. (A) A high-quality assay has clear separation between high (XHS) and low (XLS) mean signals as well as low variability (sHS and sLS, respectively). Although the assay dynamic range can be considered modest (S/B = 3.2), the low variability of high and low signals (%CV = 1.4 and %CV = 3.4, respectively) results in an excellent Z’-factor value of 0.89. The low variability of the high signal results in a large S/N value of 73. (B) Despite the greater variability in the high signal (%CV = 6.5) compared to (A) that results in a lower S/N value of 15 (compared to 73), this assay has a larger dynamic range (S/B = 13), resulting in a similarly high Z’-factor value of 0.72. (C) Compared to (B), this assay has approximately 2-fold higher variability for the high signal than (%CV = 11 versus %CV = 6.5), with similar variability in the low signal. This results in lower values of S/N, S/B and Z’-factor. (D) Among the four assays, this assay has the highest variability in both high and low signals. Although the S/B value is not substantially lower than that in (C), the high variability results in low S/N and a Z’-factor near zero.

figure Order 223 Style Figurenumberandcaption word/document.xml:/w:document[1]/w:body[1]/w:p[209]

Figure. Figure 3. Commonly used metrics to describe features of assays, including S/N, CV, S/B and Z’-Factor.

heading Order 226 Level 3 Style Heading3 word/document.xml:/w:document[1]/w:body[1]/w:p[212]

H3. Replicates

paragraph Order 227 word/document.xml:/w:document[1]/w:body[1]/w:p[213]

There has been considerable focus on irreproducible research in science, identifying the issue as a crisis (21,22). Most journals have revised guidelines to address reproducibility within their respective submissions. However, there remains confusion around the definitions of basic scientific terms associated with reproducibility including replicates and repeats (23). Vaux et al. stated several fundamental principles related to statistical design with a focus on replicates and that replicates alone do not provide evidence of reproducibility (24). In addition, different disciplines (biology, chemistry, and statistics) may have different meanings for these terms, which adds to the confusion when involved in multi-functional projects or groups.

paragraph Order 229 word/document.xml:/w:document[1]/w:body[1]/w:p[215]

To minimize confusion and for the purposes of this chapter and the AGM, we define replicates (technical, independent and inter-run) as they apply to in vitro assay development and HTS disciplines. The definitions are followed by some specific examples. Keep in mind that these definitions may differ among various journals or funding agencies.

paragraph Order 231 word/document.xml:/w:document[1]/w:body[1]/w:p[217]

Technical replicates are measurements of the same sample occurring within a single run or experiment. Technical replicates can help to identify within-sample variation but are dependent replicates due to being tested under the same conditions. Technical replicates could be on the same plate or on different plates within the experiment, depending on the variability being assessed.

paragraph Order 233 word/document.xml:/w:document[1]/w:body[1]/w:p[219]

Independent replicates are measurements of distinct preparations of the same samples occurring within a single run or experiment. Comparison of enzyme lots or multiple batches of independently cultured and treated cell preparations are examples of independent replicates for in vitro assays. For the best estimate of between sample variations, independent replicates should be on the same plate to minimize the contribution of additional sources of error (e.g. between plate variation). If sample capacity of the plate is limited and multiple plates are required for the study, multiple independent replicates can be randomized to multiple plates with multiple technical replicates per plate.

paragraph Order 235 word/document.xml:/w:document[1]/w:body[1]/w:p[221]

Inter-run replicates are measurements of the same or different sample(s) across multiple runs or experiments. A compound tested in a CRC on three different days represents inter-run replicates. Each of the runs or experiments should have the same assay conditions and reagents.

paragraph Order 237 word/document.xml:/w:document[1]/w:body[1]/w:p[223]

Depending on the experimental design, the greater the number of measurements, the better the estimate of variation, and the better the estimate of the mean.

paragraph Order 239 word/document.xml:/w:document[1]/w:body[1]/w:p[225]

When presenting data, it is important to provide the number and type of replicates (24-26) and to include them in the figure legend. For written work, scientists should consider providing a statement in the experimental methods defining the nature of their technical, independent or inter-run replicates. In general, sample variation may be greater in magnitude than technical variation. The optimal type and number of replicates depends on the scientific question and the experimental methods.

paragraph Order 241 word/document.xml:/w:document[1]/w:body[1]/w:p[227]

In practice, the distinction between technical, independent and inter-run replicates is not always straightforward. For example, with cell-free/biochemical assays, one approach to modeling variation would be to utilize independently synthesized reagents (e.g., enzymes), though in practice this is either impractical or counter-productive (i.e., in the case of HTS where one is attempting to minimize imprecision). Steps can be taken to mitigate batch-to-batch variation in the context of large scale experiments, for example through the use of batch pooling strategies, which is described further in the AGM Chapter on Validating Identity, Mass Purity and Enzymatic Purity of Enzyme Preparations (27).

paragraph Order 243 word/document.xml:/w:document[1]/w:body[1]/w:p[229]

Consider the following examples describing typical assay or screening experiments and the type of replicates that would result from each format:

list Order 245 Style ListParagraph word/document.xml:/w:document[1]/w:body[1]/w:p[231]
  • Example 1. A compound is tested for inhibition of enzyme X in a 384-well microplate. Three concentration-response curves for this inhibitor are tested on the same plate, with compounds and reagents derived from the same stock solutions and tested at the same time. Variation in this experimental setup would be random, so the best description would be n = 3 technical replicates. Determine an average value at each concentration (mean or median, depending on the variability) and fit a single concentration-response curve for the entire data set. Preferably, fit all individual replicates using the curve fitting routine.
list Order 247 Style ListParagraph word/document.xml:/w:document[1]/w:body[1]/w:p[233]
  • Example 2. A compound is tested for inhibition of enzyme X in a 96-well microplate. One concentration-response curve for this inhibitor is tested on the same plate. Each well is measured ten times with a plate reader. Variation in this experimental setup would be random, so the best description would be n = 10 technical replicates. In this case, unless the detection methodology is not very robust, this would probably constitute a poor choice of replication.
list Order 249 Style ListParagraph word/document.xml:/w:document[1]/w:body[1]/w:p[235]
  • Example 3. A compound is tested for inhibition of enzyme X in a 384-well microplate. Three concentration-response curves for this inhibitor are tested on different plates, with compounds and reagents derived from the same stock solutions and tested at the same time. Variation in this experimental setup would still be random, so the best description would be n = 3 technical replicates. The most appropriate approach for data analysis is to fit each curve independently and then report the average or geomean of the potency values with variability.
list Order 251 Style ListParagraph word/document.xml:/w:document[1]/w:body[1]/w:p[237]
  • Example 4. A compound is tested for inhibition of enzyme X in a 384-well microplate. Three concentration-response curves for this inhibitor are tested, with compounds and reagents derived from the same sources, but tested on different days. These would be classified as inter-run replicates, since the same samples were tested, but in different runs. Therefore, n = 3 inter-run replicates. The most appropriate approach for data analysis is to fit each curve independently and report the mean and associated error of the results (e.g. EC50)..
list Order 253 Style ListParagraph word/document.xml:/w:document[1]/w:body[1]/w:p[239]
  • Example 5. A compound is tested for inhibition of enzyme X in a 384-well microplate. Three concentration response curves are prepared for the inhibitor. Each curve is tested with a separate lot of enzyme from Vendor Y. All three curves are run on the same day, on the same plate. This would be an example of independent replicates, n = 3 independent replicates. Including technical replicates (multiple aliquots of the same enzyme) would help to determine if between sample differences are greater than the within sample variation. Determine a single response curve for each enzyme lot.
list Order 255 Style ListParagraph word/document.xml:/w:document[1]/w:body[1]/w:p[241]
  • Example 6. A compound is tested for inhibition of enzyme X in a 384-well microplate. Three concentration response curves are prepared for the inhibitor. Each curve is prepared with a completely independent synthesis of the compound, i.e. each is a unique lot or batch. All three curves are run on the same day, on the same plate. This would be an example of independent replicates, n = 3 independent replicates. Determine a single response curve for each compound lot.
paragraph Order 257 word/document.xml:/w:document[1]/w:body[1]/w:p[243]

Figure 4 shows example curves that might be associated with data for technical replicates (4A), independent replicates (4B) and inter-run replicates (4C).

paragraph Order 259 Style Figuregraphic word/document.xml:/w:document[1]/w:body[1]/w:p[245]

image5.tif Picture 2
paragraph Order 260 Style Figurecaptioncontinued word/document.xml:/w:document[1]/w:body[1]/w:p[246]

<?escape?>(A) Three independently-prepared concentration-response curves (from the same compound stock) were tested on the same plate within a single experiment. This represents technical replicates (n = 3). Data points are the median result values with error bars indicated as SD. (B) Enzyme progress curves for four different lots of enzyme all tested within the same experiment. This represents independent replicates (n = 4). In this example, each lot was tested only once, so there were no technical replicates associated with the data. (C) Three independent saturation-binding experiments were performed on different days. All assay conditions and reagents were identical for each of the three runs. Within each run, there were three replicates for each concentration tested. This example represents inter-run replicates (n = 3) with n = 3 technical replicates within each run. Error bars represent SD.

figure Order 261 Style Figurenumberandcaption word/document.xml:/w:document[1]/w:body[1]/w:p[247]

Figure. Figure 4. Comparison of technical replicates, independent replicates, and inter-run replicates.

paragraph Order 263 word/document.xml:/w:document[1]/w:body[1]/w:p[249]

There are no standard guidelines for how to treat technical versus independent replicates, and in practice it will depend on the scientific question and sources of error. In a system with both technical and independent replicates, if there is significant biological error, then it may be useful to analyze independent replicates (with any associated technical replicates) separately. In such a case of significant biological error, the source should be investigated or explicitly addressed.

paragraph Order 265 word/document.xml:/w:document[1]/w:body[1]/w:p[251]

A related concept is the number of independent experiments or inter-run replicates required to understand the variation of a process or an assay, which has been described for assay validation studies (28). Again, the definition of a truly “independent” experiment may not always be straightforward. Independent experiments are typically performed on separate days using reagents that originate from the earliest possible source (enzyme stock, cell line stock, etc.).

paragraph Order 267 word/document.xml:/w:document[1]/w:body[1]/w:p[253]

There is not a consensus for the number of individual technical/independent replicates or independent experiments (inter-run replicates) required to understand the variation of a process or an assay. Several factors (time, reagent cost, etc.) may limit the number of practical replicates that can be conducted. In addition, journals may have their own guidelines on the number of replicates required for acceptable publications. Consider these factors with any study and consult a statistician to ensure an adequate level of replication.

paragraph Order 269 word/document.xml:/w:document[1]/w:body[1]/w:p[255]

Be cautious when terms such as “duplicate”, “triplicate” or “quadruplicate” measurements are being used and be sure that the meaning of these terms is clearly understood. It can be very confusing when reading figure legends in publications, since there are so many variations on how to write up technical replicates, independent replicates and inter-run replicates, etc. Some examples from a few issues of the same journal are shown below. These examples represent the difficulty in interpretation that can happen when there are no standards or consistency.

list Order 271 Style ListParagraph word/document.xml:/w:document[1]/w:body[1]/w:p[257]
  • Plotted is the mean of triplicate reactions
list Order 272 Style ListParagraph word/document.xml:/w:document[1]/w:body[1]/w:p[258]
  • Data points represent the mean of triplicate measurements
list Order 273 Style ListParagraph word/document.xml:/w:document[1]/w:body[1]/w:p[259]
  • Data are mean ± SEM for 16 replicates
list Order 274 Style ListParagraph word/document.xml:/w:document[1]/w:body[1]/w:p[260]
  • Data are expressed as means ± SD of a representative experiment performed in quadruplicate out of three independent experiments.
list Order 275 Style ListParagraph word/document.xml:/w:document[1]/w:body[1]/w:p[261]
  • Data points represent the mean ± SD (n = 3) of three independent experiments.
list Order 276 Style ListParagraph word/document.xml:/w:document[1]/w:body[1]/w:p[262]
  • Shown are representative figures for n = 3 independent experiments
list Order 277 Style ListParagraph word/document.xml:/w:document[1]/w:body[1]/w:p[263]
  • Data shown are the mean ± SD; n = minimum of 2 wells
list Order 278 Style ListParagraph word/document.xml:/w:document[1]/w:body[1]/w:p[264]
  • Data points represent the mean and standard deviation of three independent replicates.
paragraph Order 280 word/document.xml:/w:document[1]/w:body[1]/w:p[266]

These definitions and guidelines presented for technical replicates, independent replicates, and inter-run replicates may help in interpreting and understanding data. It is important to provide additional information on replicates such as whether the replicates came from the same or independent sample stock, plates, experimental runs or from multiple runs. For example, simply stating that the “data represents 16 replicates” is not useful. An example of a representative legend can be found in the Figure Legends section of this chapter. In addition, a couple of direct examples from a journal that has figure legends that fully describe the data points and error in graphs are shown below:

list Order 282 Style ListParagraph word/document.xml:/w:document[1]/w:body[1]/w:p[268]
  • Concentration-response curves, fitted according to the Hill equation, are shown for three technical replicates from the same assay plate. Error is represented by SD.
list Order 283 Style ListParagraph word/document.xml:/w:document[1]/w:body[1]/w:p[269]
  • Results are expressed as the geometric mean of 5 independent measurements all made on separate days. Error = SD.
heading Order 285 Level 3 Style Heading3 word/document.xml:/w:document[1]/w:body[1]/w:p[271]

H3. Random and Systematic Error

paragraph Order 286 word/document.xml:/w:document[1]/w:body[1]/w:p[272]

This section describes random and systematic error. Understanding the type of error that may exist in an assay is an important counterpart to experiment replication and the assessment of variability in an assay.

paragraph Order 288 word/document.xml:/w:document[1]/w:body[1]/w:p[274]

Random errors are unpredictable and have no defined pattern. Random errors are fluctuations that are not replicated in subsequent experiments. Sources of random error may include the precision limitations of detection or liquid handing instrumentation, changing temperatures in a laboratory and fluctuations in the assay methods. An example of random error is measuring the same sample in an instrument with three different result values. As described above, technical or biological replicates can provide an estimate of variation for understanding random error within an assay. Another example of random error occurs when measuring radioactive experiments such as scintillation proximity assays or SPA as described in the AGM chapter Receptor Binding Assays for HTS and Drug Discovery (29). The variability associated with random error from radioactivity can be reduced by increasing the read time (30).

paragraph Order 290 word/document.xml:/w:document[1]/w:body[1]/w:p[276]

Systematic errors are persistent (unless addressed) and can be associated with instrumentation, technique or the experimental design itself. An improperly calibrated instrument can lead to constant variation. Often systematic error results in values that are proportional or scaled to the true value. Examples of systematic error in HTS can include an incorrect wavelength setting/emission filter on a detector, faulty liquid and compound dispensing instrumentation, positional effect within a detector (31), time drift in a stack of microplates being read, and edge effects associated with evaporation. Systematic errors may be difficult to identify, but a thorough knowledge of equipment and the experimental methods being used are a critical part of detecting and minimizing the effects. One group has written extensively about minimizing the impact of systematic errors in HTS data (32-35) including those associated with assay-specific and plate-specific spatial biases (36).

paragraph Order 292 word/document.xml:/w:document[1]/w:body[1]/w:p[278]

Some examples of plate drift and spatial effects, which are systematic errors, are shown in the HTS Assay Validation chapter of the AGM (28).

heading Order 295 Level 2 Style Heading2 word/document.xml:/w:document[1]/w:body[1]/w:p[281]

H2. Correlation of Data

heading Order 297 Level 3 Style Heading3 word/document.xml:/w:document[1]/w:body[1]/w:p[283]

H3. Pearson’s Correlation

paragraph Order 298 word/document.xml:/w:document[1]/w:body[1]/w:p[284]

Pearson’s correlation measures the strength of the linear relationship between two sets of variables and is therefore equivalent to a linear correlation. An underlying assumption is that both variables are normally distributed (bell-shaped, symmetrical distribution of data). Only extreme or obvious departures from this assumption are problematic. Other names for Pearson’s correlation include product moment correlation coefficient (PMCC) and Pearson’s r. Pearson’s correlation returns a value (the correlation coefficient, r) between -1 and 1 where:

paragraph Order 300 word/document.xml:/w:document[1]/w:body[1]/w:p[286]

r = -1 indicates a perfect negative linear relationship

paragraph Order 301 word/document.xml:/w:document[1]/w:body[1]/w:p[287]

r = 1 indicates a perfect positive linear relationship

paragraph Order 302 word/document.xml:/w:document[1]/w:body[1]/w:p[288]

r = 0 indicates no linear relationship

paragraph Order 304 word/document.xml:/w:document[1]/w:body[1]/w:p[290]

The larger the absolute value of the correlation coefficient, the stronger the linear relationship. The meaning of the correlation coefficient size varies in the scientific literature, but one suggested range (37) is shown in Table 9.

Immediately before a table block

table Order 306 word/document.xml:/w:document[1]/w:body[1]/w:tbl[14]

Table caption. Table 9. Meanings for Pearson’s correlation coefficient from Reference (37).

Absolute size of correlation

(positive or negative)

Interpretation of correlation

0.90 to 1.00

Very high correlation

0.70 to 0.90

High correlation

0.50 to 0.70

Moderate correlation

0.30 to 0.50

Low correlation

0.00 to 0.30

Negligible correlation

Table footprint: 6 rows, 12 cells

Attached table caption from preceding Word paragraph

Follows a paragraph block

Precedes a paragraph block

paragraph Order 310 word/document.xml:/w:document[1]/w:body[1]/w:p[295]

The correlation coefficient number alone is not adequate for demonstration of statistical relevance. Different patterns of Y vs X relationships can have the same correlation coefficient (11,38) as shown in Figure 5. Therefore, it is always important to plot the data for any reported correlation and consult with a statistician if the resulting correlation coefficient does not match an expected visual representation of the data.

Immediately after a table block

paragraph Order 312 Style Figuregraphic word/document.xml:/w:document[1]/w:body[1]/w:p[297]

Picture 7
image6.png Picture 7
paragraph Order 313 Style Figurecaptioncontinued word/document.xml:/w:document[1]/w:body[1]/w:p[298]

All the graphs shown have Pearson correlation coefficients equal to 0.7. The example in graph 6 is the one most typically perceived when the correlation coefficient is 0.7. (Reprinted from (38) with permission.)

figure Order 314 Style Figurenumberandcaption word/document.xml:/w:document[1]/w:body[1]/w:p[299]

Figure. Figure 5. Example plots of 8 different data sets with the same Pearson correlation coefficient value (r=0.7).

heading Order 316 Level 3 Style Heading3 word/document.xml:/w:document[1]/w:body[1]/w:p[301]

H3. Spearman’s Correlation

paragraph Order 317 word/document.xml:/w:document[1]/w:body[1]/w:p[302]

Spearman’s correlation is a nonparametric approach to correlation, which means the variables are not assumed to have a normal distribution. It measures the strength of the rank-ordered relationship between two variables. Computationally, Spearman’s correlation is the linear correlation of the ranks of each variable. That is, the values for each variable are separately assigned ranks 1 to n, and Spearman’s correlation is calculated on the ranks. The result is a value (the correlation coefficient, ρ) between -1 and 1 and can be interpreted in a similar fashion to Pearson’s correlation shown in Table 9.

heading Order 319 Level 3 Style Heading3 word/document.xml:/w:document[1]/w:body[1]/w:p[304]

H3. Correlation Example

paragraph Order 320 word/document.xml:/w:document[1]/w:body[1]/w:p[305]

This subsection describes a common analysis made in the drug discovery process using the correlation concepts described in the previous sections.

paragraph Order 322 word/document.xml:/w:document[1]/w:body[1]/w:p[307]

Consider the EC50 results in Table 10 for two cell-based assays associated with different species, Assay 1 and Assay 2. The goal of the analysis is to determine whether there is a correlation between the performance of the two assays, such that one assay could be used for predictive purposes over the other assay, if needed.

Immediately before a table block

table Order 324 word/document.xml:/w:document[1]/w:body[1]/w:tbl[15]

Table caption. Table 10. EC50 values for two cell-based assays used for correlation analysis.

EC50, nM

Compound

Assay 1

Assay 2

1

0.33

0.29

2

0.41

0.74

3

0.52

0.24

4

0.83

0.42

5

1.1

1.2

6

1.2

0.68

7

1.6

1.9

8

2.2

1.2

9

2.5

2

10

3

4

11

6

2

12

7

6

13

11

12

14

16

7

15

19

22

16

25

8

17

26

18

18

30

27

19

35

12

20

40

1

Table footprint: 22 rows, 65 cells

Attached table caption from preceding Word paragraph

Follows a paragraph block

Precedes a paragraph block

paragraph Order 327 word/document.xml:/w:document[1]/w:body[1]/w:p[311]

Performing a linear regression using GraphPad Prism software with the data yields the graph in Figure 6 and the Pearson’s correlation coefficient (r = 0.62). The x-axis is a linear scale.

Immediately after a table block

paragraph Order 329 Style Figuregraphic word/document.xml:/w:document[1]/w:body[1]/w:p[313]

image7.tif Picture 8
paragraph Order 330 Style Figurecaptioncontinued word/document.xml:/w:document[1]/w:body[1]/w:p[314]

EC50 values in nM (from Table 10) determined from two different CRC assays were plotted on a linear x-axis (Assay 1) and a linear y-axis (Assay 2). Shown is the regression line (solid line) determined from the analysis along with the line of identity (dashed line, defined as unity or x=y). The Pearson’s correlation coefficient is r=0.62.

figure Order 331 Style Figurenumberandcaption word/document.xml:/w:document[1]/w:body[1]/w:p[315]

Figure. Figure 6. Linear correlation using EC50 values.

paragraph Order 333 word/document.xml:/w:document[1]/w:body[1]/w:p[317]

Note that the data points are tightly clustered at the lower end of the regression line. Pearson’s correlation for such highly clustered data would give disproportionately higher weight to a few data points. Since the potency data was originally derived from nonlinear regression of log-transformed concentrations (x-axis), using log transformed potency values would be the correct method for determining correlations. The use of a log transformation on potency values provides a symmetric (approximately normal) distribution, and thus all points would more equally contribute to the calculated value of Pearson’s correlation. It is recommended to perform the log transformations of the potency values using the molar scale to avoid issues with negative and positive log values in the same scale. Furthermore, some scientists prefer to express the data in terms of the negative log10 EC50 using molar units, which is referred to as pEC50 in the literature. The same correlation coefficient will be obtained whether using log-transformed or pEC50 values. The data in Table 10 are now updated in Table 11 to include the results in terms of log10 EC50 (Molar) and pEC50.

Immediately before a table block

table Order 335 word/document.xml:/w:document[1]/w:body[1]/w:tbl[16]

Table caption. Table 11. Results for Assay 1 and Assay 2 in Table 10 converted to Log EC50 and pEC50 values.

EC50, nM

EC50, Molar

Log EC50 Molar

-Log Molar = pEC50

#

Assay 1

Assay 2

Assay 1

Assay 2

Assay 1

Assay 2

Assay 1

Assay 2

1

0.33

0.29

3.3E-10

2.9E-10

-9.48

-9.54

9.48

9.53

2

0.41

0.74

4.1E-10

7.4E-10

-9.39

-9.13

9.38

9.13

3

0.52

0.24

5.2E-10

2.4E-10

-9.28

-9.62

9.28

9.62

4

0.83

0.42

8.3E-10

4.2E-10

-9.08

-9.38

9.08

9.37

5

1.1

1.2

1.1E-09

1.2E-09

-8.96

-8.92

8.95

8.92

6

1.2

0.68

1.2E-09

6.8E-10

-8.92

-9.17

8.92

9.16

7

1.6

1.9

1.6E-09

1.9E-09

-8.80

-8.72

8.79

8.72

8

2.2

1.2

2.2E-09

1.2E-09

-8.66

-8.92

8.65

8.92

9

2.5

2

2.5E-09

2.0E-09

-8.60

-8.70

8.60

8.69

10

3

4

3.0E-09

4.0E-09

-8.52

-8.40

8.52

8.39

11

6

2

6.0E-09

2.0E-09

-8.22

-8.70

8.22

8.69

12

7

6

7.0E-09

6.0E-09

-8.15

-8.22

8.15

8.22

13

11

12

1.1E-08

1.2E-08

-7.96

-7.92

7.95

7.92

14

16

7

1.6E-08

7.0E-09

-7.80

-8.15

7.79

8.15

15

19

22

1.9E-08

2.2E-08

-7.72

-7.66

7.72

7.65

16

25

8

2.5E-08

8.0E-09

-7.60

-8.10

7.60

8.09

17

26

18

2.6E-08

1.8E-08

-7.59

-7.74

7.58

7.74

18

30

27

3.0E-08

2.7E-08

-7.52

-7.57

7.52

7.56

19

35

12

3.5E-08

1.2E-08

-7.46

-7.92

7.45

7.92

20

40

1

4.0E-08

1.0E-09

-7.40

-9.00

7.39

9.00

Table footprint: 22 rows, 194 cells

Attached table caption from preceding Word paragraph

Follows a paragraph block

Precedes a paragraph block

paragraph Order 338 word/document.xml:/w:document[1]/w:body[1]/w:p[321]

The linear regressions using the Log10-transformed EC50 values (Molar) or the pEC50 values from Table 11 are shown in Figures 7A and 7B.

Immediately after a table block

paragraph Order 340 Style Figuregraphic word/document.xml:/w:document[1]/w:body[1]/w:p[323]

image8.tif Picture 16
paragraph Order 341 Style Figurecaptioncontinued word/document.xml:/w:document[1]/w:body[1]/w:p[324]

<?escape?>(A) EC50 values (Molar) determined from two different CRC assays (Table 11) were Log10-transformed and plotted on a log-transformed linear x-axis (Assay 1) and a log-transformed linear y-axis (Assay 2). (B) A similar plot was created using pEC50 values (-Log Molar) for Assay 1 and Assay 2. Shown in both plots are the linear regression line (solid line) and the line of identity agreement, x=y (dashed line). Note that the correlations are mirror opposites, since the pEC50 (Plot B) is the -Log Molar EC50 (Plot A). The Pearson’s correlation coefficient is 0.83 in both plots.

figure Order 342 Style Figurenumberandcaption word/document.xml:/w:document[1]/w:body[1]/w:p[325]

Figure. Figure 7. Linear correlation using Log10 transformed EC50 and pEC50 values.

paragraph Order 344 word/document.xml:/w:document[1]/w:body[1]/w:p[327]

The Pearson’s correlation coefficient for this example is now 0.83, and adequately displays the linear relationship between the two assays, since the data points are more evenly dispersed across the data range on the log-transformed linear scale.

paragraph Order 346 word/document.xml:/w:document[1]/w:body[1]/w:p[329]

To determine Spearman’s correlation coefficient for this example, the compound result values (either EC50 or log transformed) are ranked for each assay from one-to-n. The analysis can be performed in GraphPad Prism, JMP, or another suitable statistics software. Figure 8 shows a plot of the ranks for each assay. Spearman’s  is 0.80 for this data set.

paragraph Order 348 Style Figuregraphic word/document.xml:/w:document[1]/w:body[1]/w:p[331]

image9.tif Picture 17
paragraph Order 349 Style Figurecaptioncontinued word/document.xml:/w:document[1]/w:body[1]/w:p[332]

Spearman’s correlation coefficient for ranked data from Table 10,  = 0.80.

figure Order 350 Style Figurenumberandcaption word/document.xml:/w:document[1]/w:body[1]/w:p[333]

Figure. Figure 8 Plot of the ranked values for each assay in Table 10.

heading Order 352 Level 3 Style Heading3 word/document.xml:/w:document[1]/w:body[1]/w:p[335]

H3. Concordance Correlation Coefficient

paragraph Order 353 word/document.xml:/w:document[1]/w:body[1]/w:p[336]

Another parameter that will be useful to evaluate and report for comparing variables that are expected to yield identical results is the Concordance Correlation coefficient (39). This correlation measures the degree of agreement between the values of two variables in relation to the 45-degree line (line of agreement). Mathematically, it is approximately equivalent to first evaluating the Pearson’s correlation, which measures the degree of closeness of the values to the best straight line (least-squares regression line), and then penalizing this correlation value based on the degree of the departure of this best straight line from the 45-degree line. Therefore, the Concordance Correlation value can never be greater than the Pearson’s correlation value. This calculation is also available in the Replicate-Experiment analysis Excel template referenced below.

heading Order 356 Level 2 Style Heading2 word/document.xml:/w:document[1]/w:body[1]/w:p[339]

H2. Agreement Between Two Variables

paragraph Order 357 word/document.xml:/w:document[1]/w:body[1]/w:p[340]

If the interest is more in assessing the agreement between two variables, then the Bland-Altman method (40) demonstrated in the Replicate-Experiment template available in the Replicate-Experiment Study section of the AGM chapter on HTS Assay Validation (28) could be used, followed by correlation assessments. Examples of such scenarios include comparison of two assays that are expected to give similar results, comparison of results from the same assay from different laboratories or assay reagents. From this analysis, additional information such as the limits of agreement (LsA), mean ratio (MR), minimum significant ratio (MSR), and other statistical parameters are estimated.

paragraph Order 359 word/document.xml:/w:document[1]/w:body[1]/w:p[342]

The Replicate-Experiment analysis for the data in Table 10 is shown in Figure 9.

paragraph Order 361 Style Figuregraphic word/document.xml:/w:document[1]/w:body[1]/w:p[344]

Picture 6
image10.png Picture 6
paragraph Order 362 Style Figuregraphic word/document.xml:/w:document[1]/w:body[1]/w:p[345]

Picture 5
image11.jpg Picture 5
paragraph Order 363 Style Figuregraphic word/document.xml:/w:document[1]/w:body[1]/w:p[346]

Picture 23
image12.jpg Picture 23
paragraph Order 364 Style Figuregraphic word/document.xml:/w:document[1]/w:body[1]/w:p[347]

Picture 13
image13.png Picture 13
paragraph Order 365 Style Figuregraphic word/document.xml:/w:document[1]/w:body[1]/w:p[348]

Picture 25
image14.jpg Picture 25
paragraph Order 367 Style Figurecaptioncontinued word/document.xml:/w:document[1]/w:body[1]/w:p[350]

Results from the Replicate-Experiment template available in the AGM for the comparison of Assay 1 and 2 data obtained from Table 10 or 11.

figure Order 368 Style Figurenumberandcaption word/document.xml:/w:document[1]/w:body[1]/w:p[351]

Figure. Figure 9. Agreement between Assay 1 and Assay 2 using data from Table 10.

heading Order 370 Level 2 Style Heading2 word/document.xml:/w:document[1]/w:body[1]/w:p[353]

H2. P-values

paragraph Order 371 word/document.xml:/w:document[1]/w:body[1]/w:p[354]

In 2016, the American Statistical Association published an official statement on p-values (41). A list of the key principles from that statement is shown below:

list Order 373 word/document.xml:/w:document[1]/w:body[1]/w:p[356]
  • P-values can indicate how incompatible the data are with a specified statistical model.
list Order 374 word/document.xml:/w:document[1]/w:body[1]/w:p[357]
  • P-values do not measure the probability that the studied hypothesis is true, or the probability that the data were produced by random chance alone.
list Order 375 word/document.xml:/w:document[1]/w:body[1]/w:p[358]
  • Scientific conclusions and business or policy decisions should not be based only on whether a p-value passes a specific threshold.
list Order 376 word/document.xml:/w:document[1]/w:body[1]/w:p[359]
  • Proper inference requires full reporting and transparency.
list Order 377 word/document.xml:/w:document[1]/w:body[1]/w:p[360]
  • A p-value, or statistical significance, does not measure the size of an effect or the importance of a result.
list Order 378 word/document.xml:/w:document[1]/w:body[1]/w:p[361]
  • By itself, a p-value does not provide a good measure of evidence regarding a model or hypothesis.
paragraph Order 380 word/document.xml:/w:document[1]/w:body[1]/w:p[363]

We refer the reader to the main article and a discussion article for further information (41,42).

paragraph Order 382 word/document.xml:/w:document[1]/w:body[1]/w:p[365]

A key topic related to p-values is multiple comparisons or multiple testing. When multiple groups from the same experiment are compared pairwise or in relation to a control group, the unadjusted p-values can overstate the overall significance. Appropriate adjustments should be made using methods such as Bonferroni, Dunnett, or Tukey depending on what is being compared (i.e., comparisons vs. a control group or all possible pairwise comparisons between groups) (42). In situations with a very large number of comparisons, such as with ‘omics data, the False Discovery Rate (FDR) method is often recommended (43).

heading Order 384 Level 2 Style Heading2 word/document.xml:/w:document[1]/w:body[1]/w:p[367]

H2. Calculation of Upper and Lower Quantitation Limits

paragraph Order 385 word/document.xml:/w:document[1]/w:body[1]/w:p[368]

Assays where the analyte concentration levels are calibrated from a reference standard curve entail special statistical considerations and evaluation of additional parameters during the development, optimization and validation, as explained in the Immunoassay Methods chapter in this manual (44). This includes accuracy, intermediate precision, sensitivity, dynamic range with the lower and upper quantitation limits, dilution linearity, parallelism, stability, etc. The lower limit of quantitation (LLOQ) and upper limit of quantitation (ULOQ) are defined as the lowest and highest concentrations of the analyte that can be reliably measured by the assay based on pre-specified criteria around accuracy, imprecision and sometimes total error (see Table 8 in the Immunoassay Methods chapter of the AGM (44)). Accuracy is typically defined in terms of percent relative bias of the measured analyte concentration from the calibration curve relative to its spiked nominal level. Imprecision (also referred to simply as precision) is defined by the percent coefficient of variation. Total error is defined as the sum of the absolute value of relative bias plus the imprecision (%CV). The criteria for total error, accuracy, and imprecision vary on a case by case basis. For example, accuracy and imprecision are typically expected to be < 15% for PK assays of small molecules, < 20% for PK assays of large molecules, < 25 to 30% for biomarker assays. For biomarker assays, a white paper from the cross-industry working group (45) proposed the criteria for LLOQ and ULOQ to be based on 30% total error, 25% accuracy, and 25% imprecision.

paragraph Order 387 word/document.xml:/w:document[1]/w:body[1]/w:p[370]

As mentioned above, a Microsoft Excel-based template for the analysis of pre-study validation data for these types of assays associated with the calculation of assay performance characteristics such as accuracy, precision, sensitivity, etc., is available in the Replicate-Experiment Study section of the AGM chapter on HTS Assay Validation. An example output from this Excel-based tool is provided in Figure 10. This is a graph of the Bias, Imprecision (%CV) and Total Error for each validation sample. If the LLOQ and ULOQ are defined as the lowest and highest concentrations where the bias and imprecision are less than 25% and the total error is less than 30%, then in this example dataset, the LLOQ is 5.9 pg/mL and the ULOQ is 550 pg/mL.

paragraph Order 389 Style Figuregraphic word/document.xml:/w:document[1]/w:body[1]/w:p[372]

image15.tif Picture 21
paragraph Order 390 Style Figurecaptioncontinued word/document.xml:/w:document[1]/w:body[1]/w:p[373]

Profile of the % relative error (bias), % imprecision (coefficient of variation; CV) and % total error for the samples used in a validation experiment. Additional assay parameters such as the sensitivity and dynamic range are derived from this analysis. For practical purposes the useful calibration range is below the 25-30% CV line.

figure Order 391 Style Figurenumberandcaption word/document.xml:/w:document[1]/w:body[1]/w:p[374]

Figure. Figure 10. Profile for Total error, precision and bias.

heading Order 393 Level 2 Style Heading2 word/document.xml:/w:document[1]/w:body[1]/w:p[376]

H2. Area under the Curve

paragraph Order 394 word/document.xml:/w:document[1]/w:body[1]/w:p[377]

The area under a concentration response curve or kinetic time course can be quantified and used as a metric to compare data, which is referred to as the area under the curve (AUC). This metric provides a basis for comparing data when it is not possible to determine the values of EC50 and Emax. However, even when these parameters are determined, the AUC can be preferable because it incorporates both potency and efficacy within a single metric (46). Figure 11 illustrates the concept of AUC.

paragraph Order 396 Style Figuregraphic word/document.xml:/w:document[1]/w:body[1]/w:p[379]

image16.tif Picture 22
paragraph Order 397 Style Figurecaptioncontinued word/document.xml:/w:document[1]/w:body[1]/w:p[380]

Area under the curve (AUC) is shown as the green shaded region. The advantage of calculating AUC is that both potency and efficacy are incorporated into a single metric.

figure Order 398 Style Figurenumberandcaption word/document.xml:/w:document[1]/w:body[1]/w:p[381]

Figure. Figure 11. CRC plot illustrating the quantification of area under the curve (AUC).

heading Order 401 Level 2 Style Heading2 word/document.xml:/w:document[1]/w:body[1]/w:p[384]

H2. Graph Elements

heading Order 403 Level 3 Style Heading3 word/document.xml:/w:document[1]/w:body[1]/w:p[386]

H3. Title

paragraph Order 404 word/document.xml:/w:document[1]/w:body[1]/w:p[387]

A graph in written format typically has a figure number and a legend title (see Figure Legends section below) whereas a graph used in a presentation or discussion may have only a title. This title can provide the speaker with a visual cue, but it should be succinct so that it does not distract the listener or reader from the message being delivered, often during a short time period. Examples of succinct titles for a standard CRC like those shown previously in Figure 1 might be:

paragraph Order 406 word/document.xml:/w:document[1]/w:body[1]/w:p[389]

Concentration-Response Curve for Compound X

paragraph Order 407 word/document.xml:/w:document[1]/w:body[1]/w:p[390]

CRC for Compound X

paragraph Order 408 word/document.xml:/w:document[1]/w:body[1]/w:p[391]

or simply, Compound X

paragraph Order 410 word/document.xml:/w:document[1]/w:body[1]/w:p[393]

Much of the context for a graph in a discussion/presentation format can be provided during the presentation.

heading Order 412 Level 3 Style Heading3 word/document.xml:/w:document[1]/w:body[1]/w:p[395]

H3. Figure Legends

paragraph Order 413 word/document.xml:/w:document[1]/w:body[1]/w:p[396]

Figure legends (also referred to as captions) are crucial components of scientific figures in the literature. Remember, figures and their accompanying legends should function as stand-alone material. In other words, readers should be able to interpret the overall message of the figure without having to consult the primary text. High-quality figure legends should contain the following: title, description of techniques used, summary of results, and definitions. Note that depending on the publication source, certain components may be emphasized or de-emphasized. Figure legends typically comprise 100 to 300 words in total.

paragraph Order 415 word/document.xml:/w:document[1]/w:body[1]/w:p[398]

Titles should be brief and either descriptive (“Inhibition of enzyme X by compound Y”) or declarative (“Compound Y is a nanomolar inhibitor of enzyme X”). The choice of descriptive versus declarative titles may depend on journal formats or author preference. For multi-panel figures, the title should encompass a common message for all of the panels.

paragraph Order 417 word/document.xml:/w:document[1]/w:body[1]/w:p[400]

The techniques used should be briefly described (“Inhibition of enzyme X was determined by radiolabeled substrate incorporation”) and should be minimized to only what is necessary to understand the figure. This should include the number and type of replicates as well as independent experiments, whether the results are pooled or representative and any statistical tests utilized (e.g., SD or SEM).

paragraph Order 419 word/document.xml:/w:document[1]/w:body[1]/w:p[402]

Depending on the nature of the data and figure, a brief statement about the key results should be included in the figure legend (“Compound Y is 5-fold more potent than the previously reported Compound Z”).

paragraph Order 421 word/document.xml:/w:document[1]/w:body[1]/w:p[404]

Lastly, figure legends should define any abbreviations, symbols, coloring, or scaling. Uncommon features, such as broken axes, may need to be explicitly noted (see below).

paragraph Order 423 word/document.xml:/w:document[1]/w:body[1]/w:p[406]

Other important miscellaneous notes:

list Order 424 Style ListParagraph word/document.xml:/w:document[1]/w:body[1]/w:p[407]
  • For multi-panel figures, it may not be possible to describe each panel in detail. In such cases, it may be most effective (and efficient) to summarize several related panels in one statement.
list Order 425 Style ListParagraph word/document.xml:/w:document[1]/w:body[1]/w:p[408]
  • Utilize consistent verb tense. Past tense is used most often for describing completed experiments (“Inhibition of enzyme X was determined by radiolabeled substrate incorporation”), while present tense can be used for declarative statements (“Compound Y is a nanomolar inhibitor of enzyme X”).
paragraph Order 427 word/document.xml:/w:document[1]/w:body[1]/w:p[410]

To illustrate these concepts, consider the graph in Figure 12 and several examples of a figure legend:

paragraph Order 429 Style Figuregraphic word/document.xml:/w:document[1]/w:body[1]/w:p[412]

image17.tiff Picture 31
paragraph Order 430 Style Figurecaptioncontinued word/document.xml:/w:document[1]/w:body[1]/w:p[413]

Lower-quality example figure legend:

paragraph Order 431 Style Figurecaptioncontinued word/document.xml:/w:document[1]/w:body[1]/w:p[414]

Figure 12. Effect of compound X versus enzyme Y.

paragraph Order 433 Style Figurecaptioncontinued word/document.xml:/w:document[1]/w:body[1]/w:p[416]

Better-quality example figure legend:

paragraph Order 434 Style Figurecaptioncontinued word/document.xml:/w:document[1]/w:body[1]/w:p[417]

Figure 12. Inhibition of enzyme Y by compound X. Enzymatic activity was determined using a radiolabeled substrate assay in triplicate. Compound X inhibits enzyme Y with an IC50 value of 10M.

paragraph Order 436 Style Figurecaptioncontinued word/document.xml:/w:document[1]/w:body[1]/w:p[419]

High-quality example figure legend:

paragraph Order 437 Style Figurecaptioncontinued word/document.xml:/w:document[1]/w:body[1]/w:p[420]

Figure 12. Inhibition of enzyme Y by compound X. Enzymatic activity of enzyme Y was determined using radiolabeled substrate assay. Compound X inhibits enzyme Y with an IC50 value of 10 2M. Data are mean SD from three technical replicates.

figure Order 439 Style Figurenumberandcaption word/document.xml:/w:document[1]/w:body[1]/w:p[422]

Figure. Figure 12. CRC plot with three example figure legends shown below.

heading Order 443 Level 3 Style Heading3 word/document.xml:/w:document[1]/w:body[1]/w:p[426]

H3. Axis Scale

paragraph Order 444 word/document.xml:/w:document[1]/w:body[1]/w:p[427]

Most graphs will have a single horizontal axis (x-axis) which corresponds to the independent variable and a single vertical axis (y-axis) which corresponds to a dependent variable. The axis scale can vary depending on the type of data being displayed and the same data on different scales may not convey the same message. The main types of scales used in assay and screening applications include linear (or arithmetic) or logarithmic (or log). In addition, when data is converted to log values, the data points can be plotted on a linear scale.

list Order 446 Style ListParagraph word/document.xml:/w:document[1]/w:body[1]/w:p[429]
  • Linear scale – a linear scale will show equal spacing between the scale units or tick marks. With a linear axis, the baseline often begins with zero. Log-transformed data can be plotted on a linear scale as well.
list Order 447 Style ListParagraph word/document.xml:/w:document[1]/w:body[1]/w:p[430]
  • Log scale – a log scale will have unequal spacing between scale units or tick marks. Major tick marks will have a consistent ratio between them, such as ten. By definition, logarithmic axes do not contain negative numbers. In addition, zero cannot be plotted on a logarithmic axis. To use a logarithmic scale, the actual data values are plotted.
paragraph Order 449 word/document.xml:/w:document[1]/w:body[1]/w:p[432]

Consider the data values from the curves in Figure 2, shown in Table 12 with the concentration (x-axis) listed in the native format (Concentration, [nM]) and in the log-transformed format (Log M).

Immediately before a table block

table Order 451 word/document.xml:/w:document[1]/w:body[1]/w:tbl[17]

Table caption. Table 12. CRC data with x-axis values in native and log-transformed formats.

Concentration, [nM]

Concentration, [Molar]

Log M

Replicate 1

Replicate 2

Replicate 3

100

1.00E-07

-7.00

97.3

90.5

96.4

33.3

3.33E-08

-7.48

94.3

97.9

100.4

11.1

1.11E-08

-7.95

93.3

94.3

96.6

3.70

3.70E-09

-8.43

86.6

86.9

89.2

1.23

1.23E-09

-8.91

72.7

70.3

81.0

0.412

4.12E-10

-9.39

53.1

47.3

55.7

0.137

1.37E-10

-9.86

27.0

31.4

30.0

0.0457

4.57E-11

-10.34

31.4

-3.0

17.2

0.0152

1.52E-11

-10.82

14.1

-4.2

0.5

0.00508

5.08E-12

-11.29

13.7

-1.5

5.8

Table footprint: 11 rows, 66 cells

Attached table caption from preceding Word paragraph

Follows a paragraph block

Precedes a paragraph block

paragraph Order 454 word/document.xml:/w:document[1]/w:body[1]/w:p[436]

The data in Table 12 can be plotted on a log scale (Figure 13A), linear scale (Figure 13B) and using log-transformed data on a linear scale (Figure 13C).

Immediately after a table block

paragraph Order 456 Style Figuregraphic word/document.xml:/w:document[1]/w:body[1]/w:p[438]

image18.tiff Picture 32
paragraph Order 457 Style Figurecaptioncontinued word/document.xml:/w:document[1]/w:body[1]/w:p[439]

Data from Table 12 was plotted using a log x-axis scale (A), linear x-axis scale (B) or log-transformed data on a linear x-axis scale (C). Data points for all graphs are the mean of three technical replicates. Error bars represent SD. The same nonlinear equation (4-parameter Hill equation (47)) was used for each curve fit.

figure Order 458 Style Figurenumberandcaption word/document.xml:/w:document[1]/w:body[1]/w:p[440]

Figure. Figure 13. CRC data plotted with three different x-axis scales.

paragraph Order 460 word/document.xml:/w:document[1]/w:body[1]/w:p[442]

Comments about each sub-figure are shown below:

paragraph Order 462 word/document.xml:/w:document[1]/w:body[1]/w:p[444]

Figure 13A. This figure uses the actual concentration values (Molar) with an x-axis log scale. The resulting curve fit from GraphPad Prism is ambiguous (compare the curve fit of 13A and 13C, which both use the same nonlinear curve fitting routine, but difference x-axis scales). This type of scale is not typically used with CRC data.

paragraph Order 464 word/document.xml:/w:document[1]/w:body[1]/w:p[446]

Figure 13B. This figure uses a linear scale for the actual concentration values. As a result, the data points on the graph will be clustered at one end of the scale and the curve fit is also ambiguous. This type of scale is seldom used with CRC data that has a concentration range over several log scales.

paragraph Order 466 word/document.xml:/w:document[1]/w:body[1]/w:p[448]

Figure 13C. This figure uses log-transformed concentration values on a linear scale and is the most common method for plotting CRC data. The common sigmoidal curve resulting from the nonlinear regression analysis is shown.

paragraph Order 469 word/document.xml:/w:document[1]/w:body[1]/w:p[450]

In a different example, consider the data in Table 13 from a standard radioligand binding saturation analysis (29). The goal of this type of experiment is to determine a plateau of binding activity that results from varying the concentration of radioligand (x-axis) and measuring the binding response (pmol/mg).

Immediately before a table block

table Order 471 word/document.xml:/w:document[1]/w:body[1]/w:tbl[18]

Table caption. Table 13. Data from a saturation binding analysis.

nM

Log nM

pmol/mg

0.30

-0.516

0.385

0.53

-0.274

0.587

0.98

-0.008

0.908

1.7

0.238

1.38

2.7

0.432

1.93

4.4

0.646

3.13

6.9

0.842

3.70

11

1.05

4.57

18

1.26

5.00

29

1.47

5.35

47

1.67

5.68

75

1.87

5.77

Table footprint: 14 rows, 42 cells

Attached table caption from preceding Word paragraph

Follows a paragraph block

Precedes a paragraph block

paragraph Order 475 word/document.xml:/w:document[1]/w:body[1]/w:p[455]

Like Figure 13, the data in Table 13 was plotted using the concentration vales on the x-axis with a log scale (Figure 14A), a linear scale (Figure 14B) and a linear scale using log-transformed data (Figure 14C).

Immediately after a table block

paragraph Order 477 Style Figuregraphic word/document.xml:/w:document[1]/w:body[1]/w:p[457]

image19.tiff Picture 33
paragraph Order 478 Style Figurecaptioncontinued word/document.xml:/w:document[1]/w:body[1]/w:p[458]

Data from Table 13 was plotted using a log x-axis scale (A), linear x-axis scale (B) or log-transformed data on a linear x-axis scale (C). Data points are from a single measurement at each concentration level and were fit in GraphPad Prism with nonlinear regression routines as follows: (A) and (B) using a one-site hyperbolic function; (C) using a four-parameter log inhibitor versus response function.

paragraph Order 480 Style Figurecaptioncontinued word/document.xml:/w:document[1]/w:body[1]/w:p[460]

Figure 14A. Using the actual concentration values with an x-axis logarithmic scale can yield information about the level of saturation achieved.

paragraph Order 482 Style Figurecaptioncontinued word/document.xml:/w:document[1]/w:body[1]/w:p[462]

Figure 14B. A linear scale is used with the actual concentration values. The binding activity does not appear to change after ~30 nM concentration on the x-axis.

paragraph Order 484 Style Figurecaptioncontinued word/document.xml:/w:document[1]/w:body[1]/w:p[464]

Figure 14C. Log-transformed concentration values on a linear scale are possible in software programs, however, this format is seldom used.

figure Order 486 Style Figurenumberandcaption word/document.xml:/w:document[1]/w:body[1]/w:p[466]

Figure. Figure 14. Saturation binding data plotted with three different x-axis scales.

paragraph Order 488 word/document.xml:/w:document[1]/w:body[1]/w:p[468]

The graph in Figure 14B is most commonly used when evaluating saturation binding experiments. However, the graph in Figure 14A may be a better example of demonstrating that the binding activity reaches a significant asymptote.

paragraph Order 490 word/document.xml:/w:document[1]/w:body[1]/w:p[470]

Therefore, the type of scale chosen for each graph should be carefully evaluated to ensure that information is conveyed as desired.

heading Order 493 Level 3 Style Heading3 word/document.xml:/w:document[1]/w:body[1]/w:p[473]

H3. Axis Scale Range

paragraph Order 494 word/document.xml:/w:document[1]/w:body[1]/w:p[474]

In most cases, the vertical (y) axis should begin at zero. Not having the origin begin at zero can distort the relative magnitudes between data values. This concept is shown in Figure 15A (scale begins at 1.6) and Figure 15B (scale begins at zero) for the same data values.

paragraph Order 496 Style Figuregraphic word/document.xml:/w:document[1]/w:body[1]/w:p[476]

image20.tiff Picture 34
paragraph Order 497 Style Figurecaptioncontinued word/document.xml:/w:document[1]/w:body[1]/w:p[477]

<?escape?>(A) Y-axis scale begins at a value of 1.6 and skews the relative difference between the three samples. (B) Y-axis scale begins at zero and the relative differences between the three samples are properly depicted.

figure Order 498 Style Figurenumberandcaption word/document.xml:/w:document[1]/w:body[1]/w:p[478]

Figure. Figure 15. Example bar graphs with non-zero and zero y-axis origin.

paragraph Order 500 word/document.xml:/w:document[1]/w:body[1]/w:p[480]

Starting the y-axis scale at zero may be preferred, but consider the example shown in Figure 16 where the same data is plotted with a y-axis scale that begins at zero (Figure 16A) and a y-axis scale that does not begin at zero (Figure 16B).

paragraph Order 502 Style Figuregraphic word/document.xml:/w:document[1]/w:body[1]/w:p[482]

image21.tiff Picture 35
paragraph Order 503 Style Figurecaptioncontinued word/document.xml:/w:document[1]/w:body[1]/w:p[483]

<?escape?>(A) Y-axis scale begins at a value of zero; the data appears to be uniform. (B) Y-axis scale begins at a value of 3800 and a repetitive pattern is observed. This is an example of systematic error.

figure Order 504 Style Figurenumberandcaption word/document.xml:/w:document[1]/w:body[1]/w:p[484]

Figure. Figure 16. Example line graphs with non-zero and zero y-axis origin.

paragraph Order 506 word/document.xml:/w:document[1]/w:body[1]/w:p[486]

While this data has been exaggerated to demonstrate a point, these types of patterns can exist with assays that involve automated liquid handlers, plate transfers, and detection equipment and usually suggest a causal relationship that can be investigated. If identified, these effects can be corrected and quality control chart monitored to reduce the overall noise and variation in the assay (1). Viewing data by rows or columns with a non-zero y-axis scale may be necessary to identify potential issues or patterns.

paragraph Order 508 word/document.xml:/w:document[1]/w:body[1]/w:p[488]

An example where a reduced scale may be required is demonstrated in Figure 17. Here a single obvious extreme outlier (data point in red) enlarges the scale, which masks the effect of the remaining data including the variability of n=3 technical replicates. Changing the scale so that the obvious outlier is not shown may yield the desired curve. An explanation for excluding the data point should be described in a figure legend, text, orally, etc.

paragraph Order 510 Style Figuregraphic word/document.xml:/w:document[1]/w:body[1]/w:p[490]

image22.tiff Picture 36
paragraph Order 511 Style Figurecaptioncontinued word/document.xml:/w:document[1]/w:body[1]/w:p[491]

<?escape?>(A) One extreme outlier (in red) creates a large y-axis scale that obscures the remaining data points. (B) The extreme outlier is omitted and a standard sigmoidal CRC curve is observed compared to the previously flattened CRC shown in A (n=3 technical replicates).

figure Order 512 Style Figurenumberandcaption word/document.xml:/w:document[1]/w:body[1]/w:p[492]

Figure. Figure 17. Effect of including all data when an extreme outlier exists.

paragraph Order 514 word/document.xml:/w:document[1]/w:body[1]/w:p[494]

Alternatively, the use of a broken axis (see Figure 19) may be an acceptable alternative to show both the expected CRC and the outlier.

paragraph Order 516 word/document.xml:/w:document[1]/w:body[1]/w:p[496]

Finally, two different samples that are tested in the same experiment should have the same scale range so that relative differences are obvious to the observer. Figure 18 demonstrates how conclusions could be misinterpreted when the y-axis scales for like-treated samples are not kept the same. In this example, Sample 1 and Sample 2 are two different preparations of the same protein tested in the same assay with the same conditions and reagents. Figure 18 panels A and C may give the appearance that Sample 1 and Sample 2 have similar activities, since y-axis scales, relative to each maximum, were used in the two graphs. Figure 18 panels B and D demonstrate the difference in relative activities for the two samples, when the same y-axis scale is used in each graph.

paragraph Order 518 Style Figuregraphic word/document.xml:/w:document[1]/w:body[1]/w:p[498]

image23.tiff Picture 38
paragraph Order 519 Style Figurecaptioncontinued word/document.xml:/w:document[1]/w:body[1]/w:p[499]

Sample 1 and Sample 2 are two different protein preparations that were treated in the same experiment with the same reagents and conditions. Nonspecific binding (solid squares) and total binding in the absence of competitor (open circles) were measured. In A and C, the y-axis scale range is determined by the maximum observed response for each sample. In B and D, the same y-axis scale range is used for both samples. Panels B and D give an accurate representation of their relative activities compared to each other (n=1).

figure Order 520 Style Figurenumberandcaption word/document.xml:/w:document[1]/w:body[1]/w:p[500]

Figure. Figure 18. The effect of using different y-axis scales on similarly treated samples.

paragraph Order 522 word/document.xml:/w:document[1]/w:body[1]/w:p[502]

Therefore, the scale range should be carefully evaluated to ensure that information is conveyed appropriately. In some cases, the activity of Sample 1 should be depicted as shown in Figure 18A to indicate that there is potentially a useable signal with that sample, depending on the situation.

paragraph Order 524 word/document.xml:/w:document[1]/w:body[1]/w:p[504]

Generally, it is acceptable to extend the scale range a few percent on either side of the axis to avoid data points on the plot frame (if using one) or on the axis extremes.

heading Order 527 Level 3 Style Heading3 word/document.xml:/w:document[1]/w:body[1]/w:p[507]

H3. Broken Axes

paragraph Order 528 word/document.xml:/w:document[1]/w:body[1]/w:p[508]

If a broken axis is used to emphasize a specific point regarding the data being plotted, a note to indicate the broken axis in the figure legend, text, discussion, etc. should be included. This alerts the observer to this non-standard technique and can prevent misinterpretation of the data. Figure 19 utilizes a broken y-axis to capture all data points of a CRC experiment (same data presented in Figure 17). It demonstrates the value of a broken axis so that key CRC information is retained while including all data points.

paragraph Order 530 Style Figuregraphic word/document.xml:/w:document[1]/w:body[1]/w:p[510]

image24.tiff Picture 39
paragraph Order 531 Style Figurecaptioncontinued word/document.xml:/w:document[1]/w:body[1]/w:p[511]

A broken y-axis is used to include an outlier while still maintaining an appropriate concentration response curve for analysis. See the graphs in Figure 17 to compare other possible ways to present this data (n=3 technical replicates).

figure Order 532 Style Figurenumberandcaption word/document.xml:/w:document[1]/w:body[1]/w:p[512]

Figure. Figure 19. Using a broken axis to include all data points in a CRC.

heading Order 535 Level 3 Style Heading3 word/document.xml:/w:document[1]/w:body[1]/w:p[515]

H3. Tick Marks

paragraph Order 536 word/document.xml:/w:document[1]/w:body[1]/w:p[516]

The most common placement of tick marks in a graph are on the outside of the axis; however, tick marks inside the axis are acceptable in many cases.

paragraph Order 538 word/document.xml:/w:document[1]/w:body[1]/w:p[518]

The number of tick marks being used on a graph axis should be chosen to represent the scale range adequately, without clutter, including extraneous tick marks that distract the reader. Placing tick marks at round integer numbers to correspond with the range is most frequently practiced. Minor tick marks and unlabeled tick marks should be avoided whenever possible.

paragraph Order 540 word/document.xml:/w:document[1]/w:body[1]/w:p[520]

The examples shown in Figure 20 demonstrate three different axis scales for the same data range. In Figure 20A, the tick marks are integers and evenly spaced across the scale range, which represents a desirable presentation of the axis. In Figure 20B, there are unnecessary minor tick marks included. In Figure 20C, there are too many numbered major tick marks and the labels are uneven integers.

paragraph Order 542 Style Figuregraphic word/document.xml:/w:document[1]/w:body[1]/w:p[522]

image25.tiff Picture 40
paragraph Order 543 Style Figurecaptioncontinued word/document.xml:/w:document[1]/w:body[1]/w:p[523]

<?escape?>(A) A standard, visually acceptable axis with even integers spaced across the entire length of the axis. (B) This axis has unmarked minor tick marks which do not add information necessary to understand a graph. (C) This axis has too many tick mark labels and the labels are non-standard integers.

figure Order 544 Style Figurenumberandcaption word/document.xml:/w:document[1]/w:body[1]/w:p[524]

Figure. Figure 20. The effect of the number of tick mark labels on a graph axis.

paragraph Order 546 word/document.xml:/w:document[1]/w:body[1]/w:p[526]

When the data values being plotted are large, resulting in axis labels with several zeros (Figure 21A), divide the numbers by a constant factor and indicate the manipulation in the axis label as shown in Figure 21B. The message and interpretation of the data is the same with the y-axis scale depicted in Figure 21B being clearer.

paragraph Order 548 Style Figuregraphic word/document.xml:/w:document[1]/w:body[1]/w:p[528]

image26.tiff Picture 41
paragraph Order 549 Style Figurecaptioncontinued word/document.xml:/w:document[1]/w:body[1]/w:p[529]

Plotted are raw data values using exponential notation on the y-axis scale (A) and a transformed scale on the y-axis using a multiplier expression indicated in the x-axis label (B). In this example, data points are connected with straight lines. The connected segments imply an appropriate continuous trend between the data points.

figure Order 550 Style Figurenumberandcaption word/document.xml:/w:document[1]/w:body[1]/w:p[530]

Figure. Figure 21. Y-axis scale with large numbers.

heading Order 553 Level 3 Style Heading3 word/document.xml:/w:document[1]/w:body[1]/w:p[533]

H3. Axes Labels

paragraph Order 554 word/document.xml:/w:document[1]/w:body[1]/w:p[534]

Accurate, unambiguous axes labels are important to avoid confusion regarding the data being plotted in a graph. As an example, all of the following labels were intended to represent log-transformed concentrations plotted on the x-axis. They were found within published journal articles for CRCs. Some comments are listed after the label:

paragraph Order 556 word/document.xml:/w:document[1]/w:body[1]/w:p[536]

Log [Compound]: Doesn’t specify the concentration units

paragraph Order 557 word/document.xml:/w:document[1]/w:body[1]/w:p[537]

Log [Compound] (M): Acceptable

paragraph Order 558 word/document.xml:/w:document[1]/w:body[1]/w:p[538]

Log [Compound], M: Acceptable

paragraph Order 559 word/document.xml:/w:document[1]/w:body[1]/w:p[539]

Log Compound Concentration (M): Acceptable (less standard)

paragraph Order 560 word/document.xml:/w:document[1]/w:body[1]/w:p[540]

Log [Compound]/M: Unclear, without further information on whether the x-axis values are a ratio

paragraph Order 561 word/document.xml:/w:document[1]/w:body[1]/w:p[541]

[Compound] in Log (M): Acceptable (less standard)

paragraph Order 562 word/document.xml:/w:document[1]/w:body[1]/w:p[542]

Log10 (Concentration (M)): Acceptable (less standard)

paragraph Order 563 word/document.xml:/w:document[1]/w:body[1]/w:p[543]

[Compound], M: Doesn’t specify that that the concentrations are log-transformed

paragraph Order 564 word/document.xml:/w:document[1]/w:body[1]/w:p[544]

Compound [Log M]: Non-standard

paragraph Order 565 word/document.xml:/w:document[1]/w:body[1]/w:p[545]

Log Compound, M: Suggests log of a compound name rather than a compound concentration

paragraph Order 566 word/document.xml:/w:document[1]/w:body[1]/w:p[546]

[Compound], Log M: Possibly suggests that the units are Log M

paragraph Order 567 word/document.xml:/w:document[1]/w:body[1]/w:p[547]

Compound, Log [M]: Acceptable (less standard)

paragraph Order 569 word/document.xml:/w:document[1]/w:body[1]/w:p[549]

This example illustrates a couple of important principles:

list Order 570 Style ListParagraph word/document.xml:/w:document[1]/w:body[1]/w:p[550]
  • Be consistent with the labels throughout a set of graphs
list Order 571 Style ListParagraph word/document.xml:/w:document[1]/w:body[1]/w:p[551]
  • Make sure that the label describes accurately what is being plotted
paragraph Order 573 word/document.xml:/w:document[1]/w:body[1]/w:p[553]

Y-axis labels should follow the same principles as those listed above. An important example is a scale from zero to 100 that could be a percent scale or actual data values. The label should always indicate percent if that is the intention of the data being plotted.

heading Order 576 Level 2 Style Heading2 word/document.xml:/w:document[1]/w:body[1]/w:p[556]

H2. Types of Graphs

paragraph Order 578 word/document.xml:/w:document[1]/w:body[1]/w:p[558]

Many graph and figure types exist so that your data can be tailored to a specific purpose or audience, regardless of whether it is in written or spoken format. A table for choosing when to use common data presentation techniques has been previously published (48). One online source even describes 44 types of graphs that can be chosen to present information (49). This section focuses on bar graphs, line graphs, scatterplots, frequency distributions, and heat maps.

heading Order 580 Level 3 Style Heading3 word/document.xml:/w:document[1]/w:body[1]/w:p[560]

H3. Bar Graph

paragraph Order 581 word/document.xml:/w:document[1]/w:body[1]/w:p[561]

Bar graphs contain horizontal or vertical bars with lengths proportional to the value of the data. A bar graph is usually used to show the relative differences between categories of data. See Figure 15 for an example.

paragraph Order 583 word/document.xml:/w:document[1]/w:body[1]/w:p[563]

If a bar graph has too many bars, it becomes cumbersome and can be difficult to interpret (Figure 22A). A scatterplot or heat map may be better choices for this level of data. Too few bars (Figure 22B) and the data in the plot could be displayed in a table or described within text and still be effective.

paragraph Order 585 Style Figuregraphic word/document.xml:/w:document[1]/w:body[1]/w:p[565]

image27.tiff Picture 42
paragraph Order 586 Style Figurecaptioncontinued word/document.xml:/w:document[1]/w:body[1]/w:p[566]

A representation of a bar graph with too many data points or bars (A) or too few bars (B).

figure Order 587 Style Figurenumberandcaption word/document.xml:/w:document[1]/w:body[1]/w:p[567]

Figure. Figure 22. Bar graphs with too many and too few values.

paragraph Order 589 word/document.xml:/w:document[1]/w:body[1]/w:p[569]

With respect to bar graphs, the following is also recommended:

list Order 590 Style ListParagraph word/document.xml:/w:document[1]/w:body[1]/w:p[570]
  • Use colorblind-accessible color combinations whenever possible. Red/green pairings are most problematic but refer to ColorBrewer, an online diagnostic tool for evaluating the robustness of individual color schemes, for more information.
list Order 591 Style ListParagraph word/document.xml:/w:document[1]/w:body[1]/w:p[571]
  • Ideally, keep text on the baseline axis in the horizontal direction, without overlap.
list Order 592 Style ListParagraph word/document.xml:/w:document[1]/w:body[1]/w:p[572]
  • The error (SD or SEM) should be addressed as discussed above (see Figure 1 and Figure 3).
heading Order 595 Level 3 Style Heading3 word/document.xml:/w:document[1]/w:body[1]/w:p[575]

H3. Line Graph

paragraph Order 596 word/document.xml:/w:document[1]/w:body[1]/w:p[576]

A line graph is a series of data points connected by a line or line segments. The lines may be connected data points (Figure 1B), a linear regression fit (Figure 6 or Figure 7) or a nonlinear regression fit (Figure 14). For the purposes of definition, linear regression uses a linear model to determine the relationship between a dependent variable and one or more independent variables. By contrast, non-linear regression is a form of analysis modeled by a non-linear function that utilizes model parameters and one or more independent variables.

paragraph Order 598 Style ListParagraph word/document.xml:/w:document[1]/w:body[1]/w:p[578]

Like a bar graph, if there are too many lines the reader is distracted from the message of the data in a line graph. This is especially true if the curves being shown have similar activities (Figure 23). Figures with too many curves often force the preparer to include a legend within the graph, which can be distracting and, in many cases, is considered chart junk.

paragraph Order 600 Style Figuregraphic word/document.xml:/w:document[1]/w:body[1]/w:p[580]

image28.tiff Picture 10
paragraph Order 601 Style Figurecaptioncontinued word/document.xml:/w:document[1]/w:body[1]/w:p[581]

In this graph, the CRC for several compounds are shown with nonlinear regression curve fits. Each line has a unique symbol and the symbol legend appears at the right. There are too many compounds being shown on this graph to decipher the individual information.

figure Order 602 Style Figurenumberandcaption word/document.xml:/w:document[1]/w:body[1]/w:p[582]

Figure. Figure 23. Multiple compound CRC graph.

paragraph Order 604 Style ListParagraph word/document.xml:/w:document[1]/w:body[1]/w:p[584]

Considerations include:

list Order 605 Style ListParagraph word/document.xml:/w:document[1]/w:body[1]/w:p[585]
  • Use colorblind-accessible color combinations whenever possible. (See above for more info.)
list Order 606 Style ListParagraph word/document.xml:/w:document[1]/w:body[1]/w:p[586]
  • Use colors only when necessary – avoid excessive use.
list Order 607 Style ListParagraph word/document.xml:/w:document[1]/w:body[1]/w:p[587]
  • The thickness of lines should be such that they are clearly visible but do not obscure individual data points.
list Order 608 Style ListParagraph word/document.xml:/w:document[1]/w:body[1]/w:p[588]
  • The method used to fit a set of data points to a line should always be conveyed with the graph (via legend, text, discussion, etc.).
paragraph Order 610 Style ListParagraph word/document.xml:/w:document[1]/w:body[1]/w:p[590]

In place of Figure 23, showing a representative curve for one or two compounds along with a table of calculated results (e.g. pIC50) for the other compounds tested may provide an improved approach for presenting this type of data. An example of this concept is shown in Figure 24 where the two compounds with the largest difference in activity are displayed in the graph (A) and the activity (pIC50) of all the compounds tested is displayed in the table (B).

paragraph Order 612 Style ListParagraph word/document.xml:/w:document[1]/w:body[1]/w:p[592]

Figure 24 represents an additional reason for using negative log-transformed activity values (pIC50) when comparing compounds. The larger the pIC50 value, the more potent the compound. Other advantages include the following: the nonlinear curve fit routine solves for the log IC50, the error associated with the pIC50 is symmetric and normally-distributed and expressed with the same level of significance (see significant digits section) as well as making the determination of the geometric means easier.

paragraph Order 613 Style Figuregraphic word/document.xml:/w:document[1]/w:body[1]/w:p[593]

image29.tiff Picture 1
paragraph Order 614 Style Figurecaptioncontinued word/document.xml:/w:document[1]/w:body[1]/w:p[594]

<?escape?>(A) Concentration response curves for two compounds in a study (n = 1). The curves represent the most potent compound (Compound 1, closed squares) and least potent compound (Compound 10, open circles) in an experiment that tested ten compounds. (B) Table showing the pIC50 values for all ten compounds tested in the experiment.

figure Order 615 Style Figurenumberandcaption word/document.xml:/w:document[1]/w:body[1]/w:p[595]

Figure. Figure 24. CRC and table for several test compounds.

heading Order 618 Level 3 Style Heading3 word/document.xml:/w:document[1]/w:body[1]/w:p[598]

H3. Scatter Plot

paragraph Order 619 word/document.xml:/w:document[1]/w:body[1]/w:p[599]

A scatter plot (also referred to as a scatter gram) is a graphic visualization of two-dimensional data using dots to represent data values. Scatter plots have x and y-axes and each data point is a coordinate on the plot. Scatterplots are often used to demonstrate the activity of the wells on a microplate, such as plate uniformity studies (28) and large data sets. In addition, they are the type of graphs used in the correlation plots discussed earlier (Figure 6 and 7).

paragraph Order 621 word/document.xml:/w:document[1]/w:body[1]/w:p[601]

Color coding data points can provide additional information within a scatter plot as shown in Figure 25 for plate data with the color-coded max and min plate controls. However, color-coding individual data points can be tedious and the data may be better represented with a heat map (Figure 28 and 29).

paragraph Order 622 Style Figuregraphic word/document.xml:/w:document[1]/w:body[1]/w:p[602]

image30.tiff Picture 44
paragraph Order 623 Style Figurecaptioncontinued word/document.xml:/w:document[1]/w:body[1]/w:p[603]

Individual well data from a 384-well plate that includes positive controls (red) and negative controls (blue). Wells are aligned by columns on the plate.

figure Order 624 Style Figurenumberandcaption word/document.xml:/w:document[1]/w:body[1]/w:p[604]

Figure. Figure 25. Scatterplot example.

paragraph Order 626 word/document.xml:/w:document[1]/w:body[1]/w:p[606]

With respect to scatter graphs, the following is also recommended:

list Order 627 Style ListParagraph word/document.xml:/w:document[1]/w:body[1]/w:p[607]
  • Use colorblind-accessible color combinations whenever possible. (See above for more info.)
list Order 628 Style ListParagraph word/document.xml:/w:document[1]/w:body[1]/w:p[608]
  • Data points that are too large obscure the detail of the pattern or trend being depicted in the scatterplot.
heading Order 631 Level 3 Style Heading3 word/document.xml:/w:document[1]/w:body[1]/w:p[611]

H3. Frequency Distribution

paragraph Order 632 Style ListParagraph word/document.xml:/w:document[1]/w:body[1]/w:p[612]

A frequency distribution typically uses bars or rectangles and can include a further analysis for normality, such as a Gaussian curve, embedded in the plot. The x-axis represents a range or group of ranges (“bins”) and the y-axis represents the frequency at each range or bin. An example of a frequency distribution is shown in Figure 26. In this example, the three panels demonstrate frequency distributions using an appropriate number of bins (A), too many bins (B) and too few bins (C).

paragraph Order 634 Style Figuregraphic word/document.xml:/w:document[1]/w:body[1]/w:p[614]

image31.tiff Picture 45
paragraph Order 635 Style Figurecaptioncontinued word/document.xml:/w:document[1]/w:body[1]/w:p[615]

Activity is divided into bins of percent specific inhibition (x-axis). The number of compounds in each bin (y-axis) are represented by the bars. To test for normality, a Gaussian distribution (red line) was fit to the frequency data. (A) Represents an appropriate number of separation bins (at 10% inhibition intervals), while (B) has too many separation bins (at 5 % inhibition intervals) and (C) has too few separation bins (at 20% inhibition intervals).

figure Order 636 Style Figurenumberandcaption word/document.xml:/w:document[1]/w:body[1]/w:p[616]

Figure. Figure 26. Frequency distribution.

paragraph Order 638 word/document.xml:/w:document[1]/w:body[1]/w:p[618]

With respect to frequency distributions, the following is also recommended:

list Order 639 Style ListParagraph word/document.xml:/w:document[1]/w:body[1]/w:p[619]
  • The number of bins ultimately depends on the number of data points, and determining the number of bins (as demonstrated above), may be a trial and error process to achieve a desired graphical result.
heading Order 642 Level 3 Style Heading3 word/document.xml:/w:document[1]/w:body[1]/w:p[622]

H3. Alternative Plots

paragraph Order 643 word/document.xml:/w:document[1]/w:body[1]/w:p[623]

While widely used, bar graphs have significant limitations. Since bar graphs only display summary statistics (generally mean and SD or SE) it is possible to generate identical bar graphs from different data sets due to outliers, bimodal distributions, differences in sample sizes, confounding variables or other reasons (50). Alternative graphical methods which display all of the data and the distribution information, such as Univariate Scatterplots, Box plots (which graphically overlay the summary statistics) or Violin plots are preferable. Since most preclinical studies use relatively small sample groups (n < 15) this is very feasible and Weissgerber et al (51) have provided an online tool which allows scientists to easily generate and download these plots using their own data. (Figure 27).

paragraph Order 645 Style Figuregraphic word/document.xml:/w:document[1]/w:body[1]/w:p[625]

Picture 15
image32.PNG Picture 15
paragraph Order 646 Style Figurecaptioncontinued word/document.xml:/w:document[1]/w:body[1]/w:p[626]

The full data may suggest different conclusions from the summary statistics. The means and SE values for the four example datasets shown in B-E are all within 0.5 units of the means and SE values shown in the bar graph A. ρ values were calculated in R statistical software (version 3.0.3) using an unpaired t-test, an unpaired t-test with Welch’s correction for unequal variances, or a Wilcoxon rank sum test. In B, the distribution in both groups appears symmetric. Although the data suggest a small difference between groups, there is substantial overlap between groups. In C, the apparent difference between groups is driven by an outlier. D suggests a possible bimodal distribution. Additional data are needed to confirm that the distribution is bimodal and to determine whether this effect is explained by a covariate. In E, the smaller range of values for group 2 may simply be due to the fact that there are only three observations. Additional data for group 2 would be needed to determine whether the groups are actually different. var, variance. (Adapted from Weissgerber et al. (50) under a creative commons license. Figure and figure legend used with permission.)

figure Order 647 Style Figurenumberandcaption word/document.xml:/w:document[1]/w:body[1]/w:p[627]

Figure. Figure 27. Many different distributions can lead to the same bar graph.

heading Order 650 Level 3 Style Heading3 word/document.xml:/w:document[1]/w:body[1]/w:p[630]

H3. Heat Maps

paragraph Order 651 word/document.xml:/w:document[1]/w:body[1]/w:p[631]

A heat map is a representation of data values using a color scale or grayscale. The resulting graph can be in a matrix format, which makes them popular for data generated using microplates. Figure 28 shows a heat map for the same data that was previously graphed in a scatter plot (Figure 25).

paragraph Order 653 Style Figuregraphic word/document.xml:/w:document[1]/w:body[1]/w:p[633]

image33.tiff Picture 46
paragraph Order 654 Style Figurecaptioncontinued word/document.xml:/w:document[1]/w:body[1]/w:p[634]

The scale at the right of the heat map shows the colors associated with the signal range. Note that in this example, the positive controls are in wells A1-H2 and I23-P24. The negative controls are in wells I1-P2 and A23-H24.

figure Order 655 Style Figurenumberandcaption word/document.xml:/w:document[1]/w:body[1]/w:p[635]

Figure. Figure 28. Heat map for the 384-well plate data shown in Figure 21.

paragraph Order 657 word/document.xml:/w:document[1]/w:body[1]/w:p[637]

Heat maps can be used to view multiple plates at a time and to assist with efficient identification of data patterns, position effects and trends (52).

paragraph Order 659 word/document.xml:/w:document[1]/w:body[1]/w:p[639]

With respect to heat maps, the following is also recommended:

list Order 660 Style ListParagraph word/document.xml:/w:document[1]/w:body[1]/w:p[640]
  • Individual data points can be framed with a thin border. Frames are typically avoided when there is a large amount of data points such that the frames would obscure the interpretation of the data.
list Order 661 Style ListParagraph word/document.xml:/w:document[1]/w:body[1]/w:p[641]
  • Heat maps can be displayed in color or grayscale. The choice of color versus grayscale often depends on the data being presented. For data with large dynamic ranges, two colors may be appropriate, whereas for data with small dynamic ranges, grayscale may be sufficient (Figure 29). As stated previously, the use of red and green coloring schemes should be avoided. (This is perhaps a relic of certain microarrays that utilized red and green fluorescent proteins to assay gene expression.)
list Order 662 Style ListParagraph word/document.xml:/w:document[1]/w:body[1]/w:p[642]
  • As with other plot formats, the scaling of the color can be adjusted to better illustrate key scientific points, such as plate positional effects where relatively minor systematic errors may be significant (Figure 29).
list Order 663 Style ListParagraph word/document.xml:/w:document[1]/w:body[1]/w:p[643]
  • Outliers can be displayed on heat maps as a separate color and then noted in the legend.
list Order 664 Style ListParagraph word/document.xml:/w:document[1]/w:body[1]/w:p[644]
  • Consider providing a supplemental file listing the data in numerical format. Data points can also be printed within each matrix point, though this can add considerable visual artifacts that detract from interpreting the figure (Figure 29E).
list Order 665 Style ListParagraph word/document.xml:/w:document[1]/w:body[1]/w:p[645]
  • The use of multiple colors (“rainbow schemes”) can make the perception of gradients difficult (Figure 29F) and are generally more appropriate for categorical or grouped data.
list Order 666 Style ListParagraph word/document.xml:/w:document[1]/w:body[1]/w:p[646]
  • Another approach to emphasizing positional effects is to express data as deviation from the mean or median values (Figure 29G). This is also amenable to analyzing data for up- or down-regulation, such as gene expression or protein abundance.
paragraph Order 668 Style Figuregraphic word/document.xml:/w:document[1]/w:body[1]/w:p[648]

image34.tiff Picture 14
paragraph Order 669 Style Figurecaptioncontinued word/document.xml:/w:document[1]/w:body[1]/w:p[649]

Data are normalized fluorescence intensity measurements from a uniformity plate (i.e., all wells should have equal, 100% max signal) to assess for microplate positional effects. Clearly, rows I, J, M, and N show decreased signal, which is due to a clogged liquid dispensing nozzle. Panels A and B demonstrate the effect of varying scales; a coloring scheme starting at the minimum value (panel B) better highlights the systematic errors in this particular data. The same data can be plotted in color (panels C and D), depending on the desired aesthetics. Individual values can be printed within each matrix point (panel E), but this generally adds noise and should be done only when necessary. The same data can be plotted in a rainbow coloring scheme (panel F), though the meaning of individual colors is not necessarily intuitive. Finally, the same data can be plotted as a function of deviation from the mean or median (panel G). (Unpublished data courtesy of JL Dahlin.)

figure Order 670 Style Figurenumberandcaption word/document.xml:/w:document[1]/w:body[1]/w:p[650]

Figure. Figure 29. Effect of color schemes on heat maps.

heading Order 673 Level 3 Style Heading3 word/document.xml:/w:document[1]/w:body[1]/w:p[653]

H3. Three-Dimensional Graphs

paragraph Order 674 word/document.xml:/w:document[1]/w:body[1]/w:p[654]

In most cases, 3-dimensional (3D) graphs are distracting to the reader and can be difficult to interpret due to the added complexity or distortion that can occur. They are often used in business publications and newspapers but should be avoided in scientific data presentations. A simple, classic example is shown in Figure 30 using a standard bar graph and a 3-D bar graph generated with Microsoft Excel. In the example, both samples 2 and 4 have median values above 100% of the control in the standard bar graph. However, in the 3-D bar graph, it appears that samples 2 and 4 do not reach the 100% of control level based on the axis scale lines for the graph. While this is a matter of perspective, it can lead to incorrect analysis and conclusions.

paragraph Order 675 Style Figuregraphic word/document.xml:/w:document[1]/w:body[1]/w:p[655]

image35.tiff Picture 47
paragraph Order 676 Style Figurecaptioncontinued word/document.xml:/w:document[1]/w:body[1]/w:p[656]

<?escape?>(A) (Typical bar graph) and (B) (3-D bar graph) are from the same data. Five different samples were tested for activity against a control. Blue bars are total activity and grey bars are nonspecific activity. Bars are the median of 8 technical replicates from the same run.

figure Order 677 Style Figurenumberandcaption word/document.xml:/w:document[1]/w:body[1]/w:p[657]

Figure. Figure 30. Standard and 3D bar graphs.

heading Order 680 Level 2 Style Heading2 word/document.xml:/w:document[1]/w:body[1]/w:p[660]

H2. Graphing/Statistical Software Programs

paragraph Order 681 word/document.xml:/w:document[1]/w:body[1]/w:p[661]

This chapter asserts no preference in software used for creating graphs or statistically analyzing data. Specific programs cited are at the discretion of the authors, based on experience. Some notable software programs for graphing or statistical analysis include, but are not limited to, the list below.

paragraph Order 683 word/document.xml:/w:document[1]/w:body[1]/w:p[663]

GraphPad Prism

paragraph Order 684 word/document.xml:/w:document[1]/w:body[1]/w:p[664]

JMP

paragraph Order 685 word/document.xml:/w:document[1]/w:body[1]/w:p[665]

KaleidaGraph

paragraph Order 686 word/document.xml:/w:document[1]/w:body[1]/w:p[666]

MATLAB

paragraph Order 687 word/document.xml:/w:document[1]/w:body[1]/w:p[667]

Microsoft Excel

paragraph Order 688 word/document.xml:/w:document[1]/w:body[1]/w:p[668]

Minitab

paragraph Order 689 word/document.xml:/w:document[1]/w:body[1]/w:p[669]

Origin

paragraph Order 690 word/document.xml:/w:document[1]/w:body[1]/w:p[670]

R Statistical Package

paragraph Order 691 word/document.xml:/w:document[1]/w:body[1]/w:p[671]

SAS

paragraph Order 692 word/document.xml:/w:document[1]/w:body[1]/w:p[672]

ScreenAble

paragraph Order 693 word/document.xml:/w:document[1]/w:body[1]/w:p[673]

SigmaPlot

paragraph Order 694 word/document.xml:/w:document[1]/w:body[1]/w:p[674]

SPSS

paragraph Order 695 word/document.xml:/w:document[1]/w:body[1]/w:p[675]

Stata

heading Order 697 Level 2 Style Heading2 word/document.xml:/w:document[1]/w:body[1]/w:p[677]

H2. Tables

paragraph Order 698 word/document.xml:/w:document[1]/w:body[1]/w:p[678]

It is important to note that graphs and tables may lead to completely different interpretations, even with the same data set. Each has a purpose, depending on how the data will be used and who the consumers of the data will be. Large tables do not work well in a presentation setting and graphs may not provide the level of detail required for a further calculation or show exact values and differences when analyzing data.

paragraph Order 700 word/document.xml:/w:document[1]/w:body[1]/w:p[680]

Tables should not contain so much data that they are hard to follow. Likewise, tables should not have print that is so small that it is difficult to read. Conversely, simple tables with only a few values may be less effective.

paragraph Order 702 word/document.xml:/w:document[1]/w:body[1]/w:p[682]

Considerations for tables include:

list Order 703 Style ListParagraph word/document.xml:/w:document[1]/w:body[1]/w:p[683]
  • Include a legend/caption where needed to describe or define any abbreviations, etc.
list Order 704 Style ListParagraph word/document.xml:/w:document[1]/w:body[1]/w:p[684]
  • Include units with the table (e.g. as part of the header title for the column)
list Order 705 Style ListParagraph word/document.xml:/w:document[1]/w:body[1]/w:p[685]
  • Use a minimum number of significant digits that are consistent throughout the table
list Order 706 Style ListParagraph word/document.xml:/w:document[1]/w:body[1]/w:p[686]
  • Shading rows or columns can help improve readability
list Order 707 Style ListParagraph word/document.xml:/w:document[1]/w:body[1]/w:p[687]
  • Use lines to separate sections; do not overuse lines within the table
list Order 708 Style ListParagraph word/document.xml:/w:document[1]/w:body[1]/w:p[688]
  • Include only essential data
list Order 709 Style ListParagraph word/document.xml:/w:document[1]/w:body[1]/w:p[689]
  • Be clear, concise and legible
list Order 710 Style ListParagraph word/document.xml:/w:document[1]/w:body[1]/w:p[690]
  • There should be adequate space within the table for clarity
list Order 711 Style ListParagraph word/document.xml:/w:document[1]/w:body[1]/w:p[691]
  • Use caution with word wrapping, especially when it only occurs in a couple of table cells
paragraph Order 713 word/document.xml:/w:document[1]/w:body[1]/w:p[693]

Some examples of tables appear earlier in this chapter.

heading Order 716 Level 2 Style Heading2 word/document.xml:/w:document[1]/w:body[1]/w:p[696]

H2. Interactive Data Visualization

paragraph Order 717 word/document.xml:/w:document[1]/w:body[1]/w:p[697]

Visual inspection of graphic displays can identify natural groupings of data points suggestive of correlations in the underlying data. Modern computer graphics interfaces often have the capability to use a computer’s mouse or other pointing device to select one or more data points in a graphic for further investigation. Some of the more commonly encountered types of these tools are discussed below.

heading Order 719 Level 3 Style Heading3 word/document.xml:/w:document[1]/w:body[1]/w:p[699]

H3. Data Hints

paragraph Order 720 word/document.xml:/w:document[1]/w:body[1]/w:p[700]

The data or chart “hint” is often used to describe the action that occurs when the mouse hovers over, or is clicked on, single points (e.g. scatterplot) or specific sub-plot regions of a multi-plot display. Typically, this is used to display an informational child window near the selected point/region. An example of a chart hint is shown in Figure 31. If a data grid is visible in the application, then a useful side effect would be to select (highlight) the row of interest.

paragraph Order 721 Style Figuregraphic word/document.xml:/w:document[1]/w:body[1]/w:p[701]

Picture 11
image36.jpg Picture 11
paragraph Order 722 Style Figurecaptioncontinued word/document.xml:/w:document[1]/w:body[1]/w:p[702]

In this example, hovering over the curve provides a value for each of the parameters listed as a quick reference to the underlying data row contents. The availability of hints for graphics is typically indicated by a hand with pointing finger cursor.

figure Order 723 Style Figurenumberandcaption word/document.xml:/w:document[1]/w:body[1]/w:p[703]

Figure. Figure 31. The result of a chart hint action on a multi-panel CRC plot.

heading Order 725 Level 3 Style Heading3 word/document.xml:/w:document[1]/w:body[1]/w:p[705]

H3. Brushing Data

paragraph Order 726 word/document.xml:/w:document[1]/w:body[1]/w:p[706]

Data brushing refers to using the pointer in a click-and-drag fashion to define a region that encompasses one or more data points of interest on a chosen graphic. The selected points/region are subsequently “highlighted” (e.g. using color or plot characters) in the selection plot device in order to focus subsequent User attention. Typically, graphic plots using other variables/responses are similarly updated and the relevant data grid rows are selected. For maximal utility, this should be a two-way process and the ability to highlight graph points from row selections of data displayed in grids following complex sorting operations should also be available.

heading Order 728 Level 3 Style Heading3 word/document.xml:/w:document[1]/w:body[1]/w:p[708]

H3. Context Menus

paragraph Order 729 word/document.xml:/w:document[1]/w:body[1]/w:p[709]

Many graphics displays allow the development of menu functionality for common operations such as copying, printing and file creation to be readily available to the user. The content of these menus can be programmatically linked to the type of plot and the data under consideration. Plot scaling can be keyed to mouse behavior (e.g. mouse wheel) and, for 3-dimensional graphics, useful behaviors such as rotational direction and speed can be intuitively linked to gestural pointer actions. Figure 32 demonstrates the concept of a context menu.

paragraph Order 730 Style Figuregraphic word/document.xml:/w:document[1]/w:body[1]/w:p[710]

Picture 12
image37.gif Picture 12
paragraph Order 731 Style Figurecaptioncontinued word/document.xml:/w:document[1]/w:body[1]/w:p[711]

Several options for rotational control, plotting of axes etc. can be accessed using either the pointer or keyboard shortcut combinations. Options with checkboxes are those that have an on/off toggle function.

figure Order 732 Style Figurenumberandcaption word/document.xml:/w:document[1]/w:body[1]/w:p[712]

Figure. Figure 32. Example of a Context Menu associated with a 3D graphics display.

heading Order 734 Level 2 Style Heading2 word/document.xml:/w:document[1]/w:body[1]/w:p[714]

H2. Databases

paragraph Order 735 word/document.xml:/w:document[1]/w:body[1]/w:p[715]

A well designed and properly implemented Relational Database Management System (RDBMS) (53) is a valuable tool not only for the storage of information but also for promoting both reproducibility of experiments and consistent, appropriate data reporting. A database offers a number of advantages when compared to flat files such as spreadsheets:

list Order 737 Style ListParagraph word/document.xml:/w:document[1]/w:body[1]/w:p[717]
  • Data Safety and Integrity. Built in features of the database automatically back up data and track any changes made to the data. In the event of errors, changes can be rolled back
list Order 739 Style ListParagraph word/document.xml:/w:document[1]/w:body[1]/w:p[719]
  • Data Consistency. Much of the data in spreadsheets is repeated and subject to errors in data entry such as typographic errors or cut and paste errors. A structured data model eliminates this redundancy and provides tools to validate data entries. Additionally, having all the data in one place simplifies the identification of systematic shifts or random errors in the data.
list Order 741 Style ListParagraph word/document.xml:/w:document[1]/w:body[1]/w:p[721]
  • Change Management. Results can be recalculated automatically from the original primary data to explore new models or allow for changes in methodology. Explanations of any changes along with a record of when and who made the change(s) are also systematically captured.
list Order 743 Style ListParagraph word/document.xml:/w:document[1]/w:body[1]/w:p[723]
  • Data Analysis and Reporting. Structured reports ensure that data is always presented in a consistent manner with proper mathematical and statistical treatment, regardless of the sophistication of the user, while still allowing for export and exploration of the data to other tools. Best practices, such as those described in this chapter, can be incorporated into these reports. Reporting of both primary and derived data has also become more common for both publication and funding and has been demonstrated to enhance both transparency and reproducibility.
paragraph Order 745 word/document.xml:/w:document[1]/w:body[1]/w:p[725]

There are numerous commercial and open source implementations of RDBMS technologies. All of them feature some implementation of standardized Structured Query Language (SQL) commands for obtaining results for data reporting purposes. For maximal data integrity and control of data flow, the chosen system should also feature a procedural language for sophisticated development and implementation of the automated processes discussed above. Most commercially available Laboratory Information Management Systems (LIMS) use RDBMS technology to provide much of their functionality and represent a path to obtain the benefits of a relational database without the need to directly manage one.

heading Order 748 Level 2 Style Heading2 word/document.xml:/w:document[1]/w:body[1]/w:p[728]

H2. References

reference Order 749 Style Reference word/document.xml:/w:document[1]/w:body[1]/w:p[729]

1. Beck B, Chen YF, Dere W, Devanarayan V, Eastwood BJ, Farmen MW, et al. Assay Operations for SAR Support. In: Sittampalam GS, Gal-Edd N, Arkin M, Auld D, Austin C, Bejcek B, et al., editors. Assay Guidance Manual. Bethesda (MD)2004.

reference Order 751 Style Reference word/document.xml:/w:document[1]/w:body[1]/w:p[731]

2. Devanarayan V, Sawyer BD, Montrose C, Johnson D, Greenen DP, Sittampalam GS, et al. Glossary of Quantitative Biology Terms. In: Sittampalam GS, Gal-Edd N, Arkin M, Auld D, Austin C, Bejcek B, et al., editors. Assay Guidance Manual. Bethesda (MD)2004.

reference Order 753 Style Reference word/document.xml:/w:document[1]/w:body[1]/w:p[733]

3. Campbell RM, Dymshitz J, Eastwood BJ, Emkey R, Greenen DP, Heerding JM, et al. Data Standardization for Results Management. In: Sittampalam GS, Gal-Edd N, Arkin M, Auld D, Austin C, Bejcek B, et al., editors. Assay Guidance Manual. Bethesda (MD)2004.

reference Order 755 Style Reference word/document.xml:/w:document[1]/w:body[1]/w:p[735]

4. Tufte ER. The Visual Display of Quantitative Information. 1st ed. Chelshire, Conneticut: Graphics Press; 1983.

reference Order 757 Style Reference word/document.xml:/w:document[1]/w:body[1]/w:p[737]

5. King L. Preparing better graphs. Journal of Public Health and Emergency. 2018;2(1).

reference Order 759 Style Reference word/document.xml:/w:document[1]/w:body[1]/w:p[739]

6. Boers M. Designing effective graphs to get your message across. Annals of the rheumatic diseases. 2018;77(6):833-9. doi: 10.1136/annrheumdis-2018-213396. PubMed PMID: 29748338.

reference Order 761 Style Reference word/document.xml:/w:document[1]/w:body[1]/w:p[741]

7. Kelleher C, Wagener T. Ten guidelines for effective data visualization in scientific publications. Environmental Modelling & Software. 2011;26(6):822-7. doi: https://doi.org/10.1016/j.envsoft.2010.12.006.

reference Order 763 Style Reference word/document.xml:/w:document[1]/w:body[1]/w:p[743]

8. Puhan MA, ter Riet G, Eichler K, Steurer J, Bachmann LM. More medical journals should inform their contributors about three key principles of graph construction. J Clin Epidemiol. 2006;59(10):1017-22. doi: 10.1016/j.jclinepi.2005.12.016. PubMed PMID: 16980140.

reference Order 765 Style Reference word/document.xml:/w:document[1]/w:body[1]/w:p[745]

9. Rougier NP, Droettboom M, Bourne PE. Ten Simple Rules for Better Figures. PLOS Computational Biology. 2014;10(9):e1003833. doi: 10.1371/journal.pcbi.1003833.

reference Order 767 Style Reference word/document.xml:/w:document[1]/w:body[1]/w:p[747]

10. Cleveland WS. The Elements of Graphing Data. Summit, NJ: Hobart Press; 1985.

reference Order 769 Style Reference word/document.xml:/w:document[1]/w:body[1]/w:p[749]

11. Cabanski C, Gilbert H, Mosesova S. Can Graphics Tell Lies? A Tutorial on How To Visualize Your Data. Clin Transl Sci. 2018;11(4):371-7. doi: 10.1111/cts.12554. PubMed PMID: 29603646; PubMed Central PMCID: PMCPMC6039197.

reference Order 771 Style Reference word/document.xml:/w:document[1]/w:body[1]/w:p[751]

12. Haas JV, Eastwood BJ, Iversen PW, Weidner JR. Minimum Significant Ratio - A Statistic to Asses Assay Variability. In: Sittampalam GS, Gal-Edd N, Arkin M, Auld D, Austin C, Bejcek B, et al., editors. Assay Guidance Manual. Bethesda (MD)2004.

reference Order 773 Style Reference word/document.xml:/w:document[1]/w:body[1]/w:p[753]

13. Blackstone EH. Rounding numbers. J Thorac Cardiovasc Surg. 2016;152(6):1481-3. doi: 10.1016/j.jtcvs.2016.09.003. PubMed PMID: 27726878.

reference Order 775 Style Reference word/document.xml:/w:document[1]/w:body[1]/w:p[755]

14. Nagele P. Misuse of standard error of the mean (SEM) when reporting variability of a sample. A critical evaluation of four anaesthesia journals. Br J Anaesth. 2003;90(4):514-6. PubMed PMID: 12644429.

reference Order 777 Style Reference word/document.xml:/w:document[1]/w:body[1]/w:p[757]

15. Sedgwick P. Standard deviation or the standard error of the mean. Br Med J. 2015;350:h831. doi: 10.1136/bmj.h831.

reference Order 779 Style Reference word/document.xml:/w:document[1]/w:body[1]/w:p[759]

16. Motulsky HJ. Common misconceptions about data analysis and statistics. Naunyn-Schmiedeberg's archives of pharmacology. 2014;387(11):1017-23. doi: 10.1007/s00210-014-1037-6. PubMed PMID: 25213136; PubMed Central PMCID: PMCPMC4203998.

reference Order 781 Style Reference word/document.xml:/w:document[1]/w:body[1]/w:p[761]

17. Motulsky HJ. Common misconceptions about data analysis and statistics. The Journal of pharmacology and experimental therapeutics. 2014;351(1):200-5. doi: 10.1124/jpet.114.219170. PubMed PMID: 25204545.

reference Order 783 Style Reference word/document.xml:/w:document[1]/w:body[1]/w:p[763]

18. Motulsky HJ. Common misconceptions about data analysis and statistics. Pharmacology research & perspectives. 2015;3(1):e00093. doi: 10.1002/prp2.93. PubMed PMID: 25692012; PubMed Central PMCID: PMCPMC4317225.

reference Order 785 Style Reference word/document.xml:/w:document[1]/w:body[1]/w:p[765]

19. Motulsky HJ. Common misconceptions about data analysis and statistics. Br J Pharmacol. 2015;172(8):2126-32. doi: 10.1111/bph.12884. PubMed PMID: 25134425; PubMed Central PMCID: PMCPMC4386986.

reference Order 787 Style Reference word/document.xml:/w:document[1]/w:body[1]/w:p[767]

20. Zhang JH, Chung TD, Oldenburg KR. A Simple Statistical Parameter for Use in Evaluation and Validation of High Throughput Screening Assays. J Biomol Screen. 1999;4(2):67-73. PubMed PMID: 10838414.

reference Order 789 Style Reference word/document.xml:/w:document[1]/w:body[1]/w:p[769]

21. Baker M. Is there a reproducibility crisis? Nature. 2016;533(7604):452-4. doi: 10.1038/533452a. PubMed PMID: 27225100.

reference Order 791 Style Reference word/document.xml:/w:document[1]/w:body[1]/w:p[771]

22. Fanelli D. Opinion: Is science really facing a reproducibility crisis, and do we need it to? Proc Natl Acad Sci U S A. 2018;115(11):2628-31. doi: 10.1073/pnas.1708272114. PubMed PMID: 29531051; PubMed Central PMCID: PMCPMC5856498.

reference Order 793 Style Reference word/document.xml:/w:document[1]/w:body[1]/w:p[773]

23. Goodman SN, Fanelli D, Ioannidis JP. What does research reproducibility mean? Sci Transl Med. 2016;8(341):341ps12. doi: 10.1126/scitranslmed.aaf5027. PubMed PMID: 27252173.

reference Order 795 Style Reference word/document.xml:/w:document[1]/w:body[1]/w:p[775]

24. Vaux DL, Fidler F, Cumming G. Replicates and repeats--what is the difference and is it significant? A brief discussion of statistics and experimental design. EMBO Rep. 2012;13(4):291-6. doi: 10.1038/embor.2012.36. PubMed PMID: 22421999; PubMed Central PMCID: PMCPMC3321166.

reference Order 797 Style Reference word/document.xml:/w:document[1]/w:body[1]/w:p[777]

25. Bell G. Replicates and repeats. BMC Biol. 2016;14(1):28. doi: 10.1186/s12915-016-0254-5.

reference Order 799 Style Reference word/document.xml:/w:document[1]/w:body[1]/w:p[779]

26. Blainey P, Krzywinski M, Altman N. Replication. Nat Methods. 2014;11:879. doi: 10.1038/nmeth.3091.

reference Order 801 Style Reference word/document.xml:/w:document[1]/w:body[1]/w:p[781]

27. Scott JE, Williams KP. Validating Identity, Mass Purity and Enzymatic Purity of Enzyme Preparations. In: Sittampalam GS, Gal-Edd N, Arkin M, Auld D, Austin C, Bejcek B, et al., editors. Assay Guidance Manual. Bethesda (MD)2004.

reference Order 803 Style Reference word/document.xml:/w:document[1]/w:body[1]/w:p[783]

28. Iversen PW, Beck B, Chen YF, Dere W, Devanarayan V, Eastwood BJ, et al. HTS Assay Validation. In: Sittampalam GS, Gal-Edd N, Arkin M, Auld D, Austin C, Bejcek B, et al., editors. Assay Guidance Manual. Bethesda (MD)2004.

reference Order 805 Style Reference word/document.xml:/w:document[1]/w:body[1]/w:p[785]

29. Auld DS, Farmen MW, Kahl SD, Kriauciunas A, McKnight KL, Montrose C, et al. Receptor Binding Assays for HTS and Drug Discovery. In: Sittampalam GS, Coussens NP, Brimacombe K, Grossman A, Arkin M, Auld D, et al., editors. Assay Guidance Manual. Bethesda (MD)2004.

reference Order 807 Style Reference word/document.xml:/w:document[1]/w:body[1]/w:p[787]

30. Kahl SD, Hubbard FR, Sittampalam GS, Zock JM. Validation of a High Throughput Scintillation Proximity Assay for 5-Hydroxytryptamine1E Receptor Binding Activity. Journal of Biomolecular Screening. 1997;2(1):33-40.

reference Order 809 Style Reference word/document.xml:/w:document[1]/w:body[1]/w:p[789]

31. Brideau C, Gunter B, Pikounis B, Liaw A. Improved statistical methods for hit selection in high-throughput screening. J Biomol Screen. 2003;8(6):634-47. doi: 10.1177/1087057103258285. PubMed PMID: 14711389.

reference Order 811 Style Reference word/document.xml:/w:document[1]/w:body[1]/w:p[791]

32. Dragiev P, Nadon R, Makarenkov V. Systematic error detection in experimental high-throughput screening. BMC bioinformatics. 2011;12:25-. doi: 10.1186/1471-2105-12-25. PubMed PMID: PMC3034671.

reference Order 813 Style Reference word/document.xml:/w:document[1]/w:body[1]/w:p[793]

33. Kevorkov D, Makarenkov V. Statistical Analysis of Systematic Errors in High-Throughput Screening. Journal of Biomolecular Screening. 2005;10(6):557-67. doi: 10.1177/1087057105276989. PubMed PMID: 16103415.

reference Order 815 Style Reference word/document.xml:/w:document[1]/w:body[1]/w:p[795]

34. Makarenkov V, Kevorkov D, Zentilli P, Gagarin A, Malo N, Nadon R. HTS-Corrector: software for the statistical analysis and correction of experimental high-throughput screening data. Bioinformatics. 2006;22(11):1408-9. doi: 10.1093/bioinformatics/btl126. PubMed PMID: 16595559.

reference Order 817 Style Reference word/document.xml:/w:document[1]/w:body[1]/w:p[797]

35. Makarenkov V, Zentilli P, Kevorkov D, Gagarin A, Malo N, Nadon R. An efficient method for the detection and elimination of systematic error in high-throughput screening. Bioinformatics. 2007;23(13):1648-57. doi: 10.1093/bioinformatics/btm145.

reference Order 819 Style Reference word/document.xml:/w:document[1]/w:body[1]/w:p[799]

36. Mazoure B, Nadon R, Makarenkov V. Identification and correction of spatial bias are essential for obtaining quality data in high-throughput screening technologies. Scientific reports. 2017;7(1):11921. doi: 10.1038/s41598-017-11940-4.

reference Order 821 Style Reference word/document.xml:/w:document[1]/w:body[1]/w:p[801]

37. Mukaka MM. A guide to appropriate use of Correlation coefficient in medical research. Malawi Med J. 2012;24(3):69-71. PubMed PMID: PMC3576830.

reference Order 823 Style Reference word/document.xml:/w:document[1]/w:body[1]/w:p[803]

38. Chambers JM. Graphical Methods for Data Analysis: Wadsworth International Group ; Boston : Duxbury Press, Belmont, CA; 1983.

reference Order 825 Style Reference word/document.xml:/w:document[1]/w:body[1]/w:p[805]

39. Lin LI. A concordance correlation coefficient to evaluate reproducibility. Biometrics. 1989;45(1):255-68. Epub 1989/03/01. PubMed PMID: 2720055.

reference Order 827 Style Reference word/document.xml:/w:document[1]/w:body[1]/w:p[807]

40. Bland JM, Altman DG. Measuring agreement in method comparison studies. Stat Methods Med Res. 1999;8(2):135-60. doi: 10.1177/096228029900800204. PubMed PMID: 10501650.

reference Order 829 Style Reference word/document.xml:/w:document[1]/w:body[1]/w:p[809]

41. Wasserstein RL, Lazar NA. The ASA's Statement on p-Values: Context, Process, and Purpose. The American Statistician. 2016;70(2):129-33. doi: 10.1080/00031305.2016.1154108.

reference Order 831 Style Reference word/document.xml:/w:document[1]/w:body[1]/w:p[811]

42. Bretz F, Hothorn T, Westfall P. Multiple Comparisons Using R. New York, NY: Chapman and Hall/CRC; 2010.

reference Order 833 Style Reference word/document.xml:/w:document[1]/w:body[1]/w:p[813]

43. Benjamini Y, Hochberg Y. Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing. Journal of the Royal Statistical Society Series B (Methodological). 1995;57(1):289-300.

reference Order 835 Style Reference word/document.xml:/w:document[1]/w:body[1]/w:p[815]

44. Cox KL, Devanarayan V, Kriauciunas A, Manetta J, Montrose C, Sittampalam S. Immunoassay Methods. In: Sittampalam GS, Coussens NP, Nelson H, Arkin M, Auld D, Austin C, et al., editors. Assay Guidance Manual. Bethesda (MD)2004.

reference Order 837 Style Reference word/document.xml:/w:document[1]/w:body[1]/w:p[817]

45. Lee JW, Weiner RS, Sailstad JM, Bowsher RR, Knuth DW, O'Brien PJ, et al. Method validation and measurement of biomarkers in nonclinical and clinical samples in drug development: a conference report. Pharmaceutical research. 2005;22(4):499-511. doi: 10.1007/s11095-005-2495-9. PubMed PMID: 15846456.

reference Order 839 Style Reference word/document.xml:/w:document[1]/w:body[1]/w:p[819]

46. Fallahi-Sichani M, Honarnejad S, Heiser LM, Gray JW, Sorger PK. Metrics other than potency reveal systematic variation in responses to cancer drugs. Nature chemical biology. 2013;9(11):708.

reference Order 841 Style Reference word/document.xml:/w:document[1]/w:body[1]/w:p[821]

47. Weiss JN. The Hill equation revisited: uses and misuses. FASEB journal : official publication of the Federation of American Societies for Experimental Biology. 1997;11(11):835-41. Epub 1997/09/01. PubMed PMID: 9285481.

reference Order 843 Style Reference word/document.xml:/w:document[1]/w:body[1]/w:p[823]

48. Franzblau LE, Chung KC. Graphs, Tables, and Figures in Scientific Publications: The Good, the Bad, and How Not to Be the Latter. The Journal of Hand Surgery. 2012;37(3):591-6. doi: https://doi.org/10.1016/j.jhsa.2011.12.041.

reference Order 845 Style Reference word/document.xml:/w:document[1]/w:body[1]/w:p[825]

49. Lile S. 44 Types of Graphs: Perfect for Every Top Industry [Accessed January 23, 2019]. Available from: https://visme.co/blog/types-of-graphs/.

reference Order 847 Style Reference word/document.xml:/w:document[1]/w:body[1]/w:p[827]

50. Weissgerber TL, Milic NM, Winham SJ, Garovic VD. Beyond bar and line graphs: time for a new data presentation paradigm. PLoS Biol. 2015;13(4):e1002128. doi: 10.1371/journal.pbio.1002128. PubMed PMID: 25901488; PubMed Central PMCID: PMCPMC4406565.

reference Order 849 Style Reference word/document.xml:/w:document[1]/w:body[1]/w:p[829]

51. Weissgerber TL, Savic M, Winham SJ, Stanisavljevic D, Garovic VD, Milic NM. Data visualization, bar naked: A free tool for creating interactive graphics. J Biol Chem. 2017;292(50):20592-8. doi: 10.1074/jbc.RA117.000147. PubMed PMID: 28974579; PubMed Central PMCID: PMCPMC5733595.

reference Order 851 Style Reference word/document.xml:/w:document[1]/w:body[1]/w:p[831]

52. Dahlin JL, Sinville R, Solberg J, Zhou H, Han J, Francis S, et al. A cell-free fluorometric high-throughput screen for inhibitors of Rtt109-catalyzed histone acetylation. PLoS One. 2013;8(11):e78877. doi: 10.1371/journal.pone.0078877. PubMed PMID: 24260132; PubMed Central PMCID: PMCPMC3832525.

reference Order 853 Style Reference word/document.xml:/w:document[1]/w:body[1]/w:p[833]

53. Relational Model. In Wikipedia, The Free Encyclopedia: Wikepedia Contributors; 2019.

heading Order 855 Level 2 Style FiguresTablesBoxesSectionHeading word/document.xml:/w:document[1]/w:body[1]/w:p[835]

H2. [figs-and-tables] Figures, Tables and Boxes Appendix (do not delete)

paragraph Order 856 Style Comment word/document.xml:/w:document[1]/w:body[1]/w:p[836]

Place numbered figures, tables and boxes (referred to from the main text) below.

paragraph Order 857 Style Comment word/document.xml:/w:document[1]/w:body[1]/w:p[837]

“In-line” figures (e.g. equations) and tables should be placed within the main text in their desired final location.

paragraph Order 858 Style Comment word/document.xml:/w:document[1]/w:body[1]/w:p[838]

Boxes can have a single level of sections; the titles for these sections should be marked up in “Box subhead” style.

References Reference list and validation

Review the reference list in document order, confirm whether each entry is cited in text, and inspect local citation matching plus DOI or PubMed-backed validation details from one place.

Normalized reference fields are shown for authors, year, title, DOI, PMID, and external validation status.

Local citation coverage

Counts below summarize citations resolved to these reference entries.

Validated 36 Needs review 29 Not found 0 Not applicable 0

External outcomes

PubMed and DOI validation outcomes are listed on each reference row below.

Validated 36 Probable 1 Ambiguous 0 Not found 28 Not applicable 0 Pending 0
reference Order 1 Matched in text not found

Beck B, Chen YF, Dere W, Devanarayan V, Eastwood BJ, Farmen MW, et al. Assay Operations for SAR Support. In: Sittampalam GS, Gal-Edd N, Arkin M, Auld D, Austin C, Bejcek B, et al., editors. Assay Guidance Manual. Bethesda (MD)2004.

At least one in-text citation resolves to this reference entry.

Matched in-text citations (2)

Citation XIII Needs review

1 | Style: numeric | Normalized: 1

Detected in block order 141: Note: many of the parameters discussed in the AGM chapter on Assay Operations for SAR S...

Citation LVII Needs review

1 | Style: numeric | Normalized: 1

Detected in block order 506: While this data has been exaggerated to demonstrate a point, these types of patterns ca...

Authors: Beck B, Chen YF, Dere W, Devanarayan V, Eastwood BJ, Farmen MW, et al | Year: 2004 | Title: Assay Operations for SAR Support

DOI: n/a | PMID: n/a

External validation: not found via title_author_year_search

No PubMed-backed source match was found for this matched reference.

External validation was recorded without a normalized source summary

No PubMed search candidates were returned for the normalized reference metadata.

reference Order 2 No in-text citation match not found

Devanarayan V, Sawyer BD, Montrose C, Johnson D, Greenen DP, Sittampalam GS, et al. Glossary of Quantitative Biology Terms. In: Sittampalam GS, Gal-Edd N, Arkin M, Auld D, Austin C, Bejcek B, et al., editors. Assay Guidance Manual. Bethesda (MD)2004.

This reference was extracted successfully, but no in-text citation currently resolves to it.

Authors: Devanarayan V, Sawyer BD, Montrose C, Johnson D, Greenen DP, Sittampalam GS, et al | Year: 2004 | Title: Glossary of Quantitative Biology Terms

DOI: n/a | PMID: n/a

External validation: not found via title_author_year_search

No PubMed-backed source match was found for this matched reference.

External validation was recorded without a normalized source summary

No PubMed search candidates were returned for the normalized reference metadata.

reference Order 3 Matched in text not found

Campbell RM, Dymshitz J, Eastwood BJ, Emkey R, Greenen DP, Heerding JM, et al. Data Standardization for Results Management. In: Sittampalam GS, Gal-Edd N, Arkin M, Auld D, Austin C, Bejcek B, et al., editors. Assay Guidance Manual. Bethesda (MD)2004.

At least one in-text citation resolves to this reference entry.

Matched in-text citations (1)

Citation XII Needs review

3 | Style: numeric | Normalized: 3

Detected in block order 100: where 10 is the base and a is the exponent or power to which the base is raised. For ex...

Authors: Campbell RM, Dymshitz J, Eastwood BJ, Emkey R, Greenen DP, Heerding JM, et al | Year: 2004 | Title: Data Standardization for Results Management

DOI: n/a | PMID: n/a

External validation: not found via title_author_year_search

No PubMed-backed source match was found for this matched reference.

External validation was recorded without a normalized source summary

No PubMed search candidates were returned for the normalized reference metadata.

reference Order 4 Matched in text not found

Tufte ER. The Visual Display of Quantitative Information. 1st ed. Chelshire, Conneticut: Graphics Press; 1983.

At least one in-text citation resolves to this reference entry.

Matched in-text citations (2)

Citation I Needs review

4 | Style: numeric | Normalized: 4

Citation VII Needs review

4,10 | Style: numeric | Normalized: 4

Authors: Tufte ER | Year: 1983 | Title: The Visual Display of Quantitative Information

DOI: n/a | PMID: n/a

External validation: not found via title_author_year_search

No PubMed-backed source match was found for this matched reference.

External validation was recorded without a normalized source summary

No PubMed search candidates were returned for the normalized reference metadata.

reference Order 5 Matched in text not found

King L. Preparing better graphs. Journal of Public Health and Emergency. 2018;2(1).

At least one in-text citation resolves to this reference entry.

Matched in-text citations (1)

Citation II Needs review

5-9 | Style: numeric | Normalized: 5

Authors: King L | Year: 2018 | Title: Preparing better graphs

DOI: n/a | PMID: n/a

External validation: not found via title_author_year_search

No PubMed-backed source match was found for this matched reference.

External validation was recorded without a normalized source summary

No PubMed search candidates were returned for the normalized reference metadata.

reference Order 6 Matched in text validated

Boers M. Designing effective graphs to get your message across. Annals of the rheumatic diseases. 2018;77(6):833-9. doi: 10.1136/annrheumdis-2018-213396. PubMed PMID: 29748338.

At least one in-text citation resolves to this reference entry.

Matched in-text citations (1)

Citation III Validated

5-9 | Style: numeric | Normalized: 6

Authors: Boers M | Year: 2018 | Title: Designing effective graphs to get your message across

DOI: 10.1136/annrheumdis-2018-213396 | PMID: 29748338

External validation: validated via pmid_exact

A PubMed-backed source record was validated for this matched reference.

Designing effective graphs to get your message across. | Boers M | 2018 | PMID 29748338

reference Order 7 Matched in text not found

Kelleher C, Wagener T. Ten guidelines for effective data visualization in scientific publications. Environmental Modelling & Software. 2011;26(6):822-7. doi: https://doi.org/10.1016/j.envsoft.2010.12.006.

At least one in-text citation resolves to this reference entry.

Matched in-text citations (1)

Citation IV Needs review

5-9 | Style: numeric | Normalized: 7

Authors: Kelleher C, Wagener T | Year: 2010 | Title: Ten guidelines for effective data visualization in scientific publications

DOI: 10.1016/j.envsoft.2010.12.006 | PMID: n/a

External validation: not found via doi_exact

No PubMed-backed source match was found for this matched reference.

External validation was recorded without a normalized source summary

No PubMed record was returned for DOI 10.1016/j.envsoft.2010.12.006.

reference Order 8 Matched in text probable

Puhan MA, ter Riet G, Eichler K, Steurer J, Bachmann LM. More medical journals should inform their contributors about three key principles of graph construction. J Clin Epidemiol. 2006;59(10):1017-22. doi: 10.1016/j.jclinepi.2005.12.016. PubMed PMID: 16980140.

At least one in-text citation resolves to this reference entry.

Matched in-text citations (1)

Citation V Needs review

5-9 | Style: numeric | Normalized: 8

Authors: Puhan MA, ter Riet G, Eichler K, Steurer J, Bachmann LM | Year: 2005 | Title: More medical journals should inform their contributors about three key principles of graph construction

DOI: 10.1016/j.jclinepi.2005.12.016 | PMID: 16980140

External validation: probable via pmid_exact

A likely external source match was found, but some parsed fields still disagree or need confirmation.

More medical journals should inform their contributors about three key principles of graph construction. | Puhan MA, ter Riet G, Eichler K, et al | 2006 | PMID 16980140

Parsed publication year differs from the returned PubMed year. Parsed author summary differs from the returned PubMed authors.

reference Order 9 Matched in text validated

Rougier NP, Droettboom M, Bourne PE. Ten Simple Rules for Better Figures. PLOS Computational Biology. 2014;10(9):e1003833. doi: 10.1371/journal.pcbi.1003833.

At least one in-text citation resolves to this reference entry.

Matched in-text citations (1)

Citation VI Validated

5-9 | Style: numeric | Normalized: 9

Authors: Rougier NP, Droettboom M, Bourne PE | Year: 2014 | Title: Ten Simple Rules for Better Figures

DOI: 10.1371/journal.pcbi.1003833 | PMID: 25210732

External validation: validated via doi_exact

A PubMed-backed source record was validated for this matched reference.

Ten simple rules for better figures. | Rougier NP, Droettboom M, Bourne PE | 2014 | PMID 25210732

reference Order 10 Matched in text not found

Cleveland WS. The Elements of Graphing Data. Summit, NJ: Hobart Press; 1985.

At least one in-text citation resolves to this reference entry.

Matched in-text citations (1)

Citation VIII Needs review

4,10 | Style: numeric | Normalized: 10

Authors: Cleveland WS | Year: 1985 | Title: The Elements of Graphing Data

DOI: n/a | PMID: n/a

External validation: not found via title_author_year_search

No PubMed-backed source match was found for this matched reference.

External validation was recorded without a normalized source summary

No PubMed search candidates were returned for the normalized reference metadata.

reference Order 11 Matched in text validated

Cabanski C, Gilbert H, Mosesova S. Can Graphics Tell Lies? A Tutorial on How To Visualize Your Data. Clin Transl Sci. 2018;11(4):371-7. doi: 10.1111/cts.12554. PubMed PMID: 29603646; PubMed Central PMCID: PMCPMC6039197.

At least one in-text citation resolves to this reference entry.

Matched in-text citations (2)

Citation IX Validated

11 | Style: numeric | Normalized: 11

Citation XLI Validated

11,38 | Style: numeric | Normalized: 11

Detected in block order 310: The correlation coefficient number alone is not adequate for demonstration of statistic...

Authors: Cabanski C, Gilbert H, Mosesova S | Year: 2018 | Title: Can Graphics Tell Lies? A Tutorial on How To Visualize Your Data

DOI: 10.1111/cts.12554 | PMID: 29603646

External validation: validated via pmid_exact

A PubMed-backed source record was validated for this matched reference.

Can Graphics Tell Lies? A Tutorial on How To Visualize Your Data. | Cabanski C, Gilbert H, Mosesova S | 2018 | PMID 29603646

reference Order 12 Matched in text not found

Haas JV, Eastwood BJ, Iversen PW, Weidner JR. Minimum Significant Ratio - A Statistic to Asses Assay Variability. In: Sittampalam GS, Gal-Edd N, Arkin M, Auld D, Austin C, Bejcek B, et al., editors. Assay Guidance Manual. Bethesda (MD)2004.

At least one in-text citation resolves to this reference entry.

Matched in-text citations (1)

Citation X Needs review

12 | Style: numeric | Normalized: 12

Detected in block order 43: Proper statistical treatment of the data is essential and consulting with a statisticia...

Authors: Haas JV, Eastwood BJ, Iversen PW, Weidner JR | Year: 2004 | Title: Minimum Significant Ratio - A Statistic to Asses Assay Variability

DOI: n/a | PMID: n/a

External validation: not found via title_author_year_search

No PubMed-backed source match was found for this matched reference.

External validation was recorded without a normalized source summary

No PubMed search candidates were returned for the normalized reference metadata.

reference Order 13 Matched in text validated

Blackstone EH. Rounding numbers. J Thorac Cardiovasc Surg. 2016;152(6):1481-3. doi: 10.1016/j.jtcvs.2016.09.003. PubMed PMID: 27726878.

At least one in-text citation resolves to this reference entry.

Matched in-text citations (1)

Citation XI Validated

13 | Style: numeric | Normalized: 13

Detected in block order 53: Although the authors subscribe to the common rounding rule which “rounds down” for...

Authors: Blackstone EH | Year: 2016 | Title: Rounding numbers

DOI: 10.1016/j.jtcvs.2016.09.003 | PMID: 27726878

External validation: validated via pmid_exact

A PubMed-backed source record was validated for this matched reference.

Rounding numbers. | Blackstone EH | 2016 | PMID 27726878

reference Order 14 Matched in text validated

Nagele P. Misuse of standard error of the mean (SEM) when reporting variability of a sample. A critical evaluation of four anaesthesia journals. Br J Anaesth. 2003;90(4):514-6. PubMed PMID: 12644429.

At least one in-text citation resolves to this reference entry.

Matched in-text citations (1)

Citation XIV Validated

14,15 | Style: numeric | Normalized: 14

Detected in block order 152: Two statistical quantities that are often (incorrectly) used interchangeably are SEM an...

Authors: Nagele P | Year: 2003 | Title: Misuse of standard error of the mean (SEM) when reporting variability of a sample

DOI: n/a | PMID: 12644429

External validation: validated via pmid_exact

A PubMed-backed source record was validated for this matched reference.

Misuse of standard error of the mean (SEM) when reporting variability of a sample. A critical evaluation of four anaesthesia journals. | Nagele P | 2003 | PMID 12644429

Parsed title differs from the returned PubMed title.

reference Order 15 Matched in text validated

Sedgwick P. Standard deviation or the standard error of the mean. Br Med J. 2015;350:h831. doi: 10.1136/bmj.h831.

At least one in-text citation resolves to this reference entry.

Matched in-text citations (1)

Citation XV Validated

14,15 | Style: numeric | Normalized: 15

Detected in block order 152: Two statistical quantities that are often (incorrectly) used interchangeably are SEM an...

Authors: Sedgwick P | Year: 2015 | Title: Standard deviation or the standard error of the mean

DOI: 10.1136/bmj.h831 | PMID: 25691433

External validation: validated via doi_exact

A PubMed-backed source record was validated for this matched reference.

Standard deviation or the standard error of the mean. | Sedgwick P | 2015 | PMID 25691433

reference Order 16 Matched in text validated

Motulsky HJ. Common misconceptions about data analysis and statistics. Naunyn-Schmiedeberg's archives of pharmacology. 2014;387(11):1017-23. doi: 10.1007/s00210-014-1037-6. PubMed PMID: 25213136; PubMed Central PMCID: PMCPMC4203998.

At least one in-text citation resolves to this reference entry.

Matched in-text citations (1)

Citation XVI Validated

16-19 | Style: numeric | Normalized: 16

Detected in block order 188: An example presented in an article that was published in four separate journals shows a...

Authors: Motulsky HJ | Year: 2014 | Title: Common misconceptions about data analysis and statistics

DOI: 10.1007/s00210-014-1037-6 | PMID: 25213136

External validation: validated via pmid_exact

A PubMed-backed source record was validated for this matched reference.

Common misconceptions about data analysis and statistics. | Motulsky HJ | 2014 | PMID 25213136

reference Order 17 Matched in text validated

Motulsky HJ. Common misconceptions about data analysis and statistics. The Journal of pharmacology and experimental therapeutics. 2014;351(1):200-5. doi: 10.1124/jpet.114.219170. PubMed PMID: 25204545.

At least one in-text citation resolves to this reference entry.

Matched in-text citations (2)

Citation XVII Validated

16-19 | Style: numeric | Normalized: 17

Detected in block order 188: An example presented in an article that was published in four separate journals shows a...

Citation XX Validated

17 | Style: numeric | Normalized: 17

Detected in block order 191: The activity for an assay control compound from several different runs is plotted on th...

Authors: Motulsky HJ | Year: 2014 | Title: Common misconceptions about data analysis and statistics

DOI: 10.1124/jpet.114.219170 | PMID: 25204545

External validation: validated via pmid_exact

A PubMed-backed source record was validated for this matched reference.

Common misconceptions about data analysis and statistics. | Motulsky HJ | 2014 | PMID 25204545

reference Order 18 Matched in text validated

Motulsky HJ. Common misconceptions about data analysis and statistics. Pharmacology research & perspectives. 2015;3(1):e00093. doi: 10.1002/prp2.93. PubMed PMID: 25692012; PubMed Central PMCID: PMCPMC4317225.

At least one in-text citation resolves to this reference entry.

Matched in-text citations (1)

Citation XVIII Validated

16-19 | Style: numeric | Normalized: 18

Detected in block order 188: An example presented in an article that was published in four separate journals shows a...

Authors: Motulsky HJ | Year: 2015 | Title: Common misconceptions about data analysis and statistics

DOI: 10.1002/prp2.93 | PMID: 25692012

External validation: validated via pmid_exact

A PubMed-backed source record was validated for this matched reference.

Common misconceptions about data analysis and statistics. | Motulsky HJ | 2015 | PMID 25692012

reference Order 19 Matched in text validated

Motulsky HJ. Common misconceptions about data analysis and statistics. Br J Pharmacol. 2015;172(8):2126-32. doi: 10.1111/bph.12884. PubMed PMID: 25134425; PubMed Central PMCID: PMCPMC4386986.

At least one in-text citation resolves to this reference entry.

Matched in-text citations (1)

Citation XIX Validated

16-19 | Style: numeric | Normalized: 19

Detected in block order 188: An example presented in an article that was published in four separate journals shows a...

Authors: Motulsky HJ | Year: 2015 | Title: Common misconceptions about data analysis and statistics

DOI: 10.1111/bph.12884 | PMID: 25134425

External validation: validated via pmid_exact

A PubMed-backed source record was validated for this matched reference.

Common misconceptions about data analysis and statistics. | Motulsky HJ | 2015 | PMID 25134425

reference Order 20 Matched in text validated

Zhang JH, Chung TD, Oldenburg KR. A Simple Statistical Parameter for Use in Evaluation and Validation of High Throughput Screening Assays. J Biomol Screen. 1999;4(2):67-73. PubMed PMID: 10838414.

At least one in-text citation resolves to this reference entry.

Matched in-text citations (1)

Citation XXI Validated

20 | Style: numeric | Normalized: 20

Detected in block order 214: The Z’-Factor (20) is considered the best metric to describe assay quality because it...

Authors: Zhang JH, Chung TD, Oldenburg KR | Year: 1999 | Title: A Simple Statistical Parameter for Use in Evaluation and Validation of High Throughput Screening Assays

DOI: n/a | PMID: 10838414

External validation: validated via pmid_exact

A PubMed-backed source record was validated for this matched reference.

A Simple Statistical Parameter for Use in Evaluation and Validation of High Throughput Screening Assays. | Zhang JH, Chung TD, Oldenburg KR | 1999 | PMID 10838414

reference Order 21 Matched in text validated

Baker M. Is there a reproducibility crisis? Nature. 2016;533(7604):452-4. doi: 10.1038/533452a. PubMed PMID: 27225100.

At least one in-text citation resolves to this reference entry.

Matched in-text citations (1)

Citation XXII Validated

21,22 | Style: numeric | Normalized: 21

Detected in block order 227: There has been considerable focus on irreproducible research in science, identifying th...

Authors: Baker M | Year: 2016 | Title: Is there a reproducibility crisis? Nature

DOI: 10.1038/533452a | PMID: 27225100

External validation: validated via pmid_exact

A PubMed-backed source record was validated for this matched reference.

1,500 scientists lift the lid on reproducibility. | Baker M | 2016 | PMID 27225100

Parsed title differs from the returned PubMed title.

reference Order 22 Matched in text validated

Fanelli D. Opinion: Is science really facing a reproducibility crisis, and do we need it to? Proc Natl Acad Sci U S A. 2018;115(11):2628-31. doi: 10.1073/pnas.1708272114. PubMed PMID: 29531051; PubMed Central PMCID: PMCPMC5856498.

At least one in-text citation resolves to this reference entry.

Matched in-text citations (1)

Citation XXIII Validated

21,22 | Style: numeric | Normalized: 22

Detected in block order 227: There has been considerable focus on irreproducible research in science, identifying th...

Authors: Fanelli D | Year: 2018 | Title: Opinion: Is science really facing a reproducibility crisis, and do we need it to? Proc Natl Acad Sci U S A

DOI: 10.1073/pnas.1708272114 | PMID: 29531051

External validation: validated via pmid_exact

A PubMed-backed source record was validated for this matched reference.

Opinion: Is science really facing a reproducibility crisis, and do we need it to? | Fanelli D | 2018 | PMID 29531051

Parsed title differs from the returned PubMed title.

reference Order 23 Matched in text validated

Goodman SN, Fanelli D, Ioannidis JP. What does research reproducibility mean? Sci Transl Med. 2016;8(341):341ps12. doi: 10.1126/scitranslmed.aaf5027. PubMed PMID: 27252173.

At least one in-text citation resolves to this reference entry.

Matched in-text citations (1)

Citation XXIV Validated

23 | Style: numeric | Normalized: 23

Detected in block order 227: There has been considerable focus on irreproducible research in science, identifying th...

Authors: Goodman SN, Fanelli D, Ioannidis JP | Year: 2016 | Title: What does research reproducibility mean? Sci Transl Med

DOI: 10.1126/scitranslmed.aaf5027 | PMID: 27252173

External validation: validated via pmid_exact

A PubMed-backed source record was validated for this matched reference.

What does research reproducibility mean? | Goodman SN, Fanelli D, Ioannidis JP | 2016 | PMID 27252173

Parsed title differs from the returned PubMed title.

reference Order 24 Matched in text validated

Vaux DL, Fidler F, Cumming G. Replicates and repeats--what is the difference and is it significant? A brief discussion of statistics and experimental design. EMBO Rep. 2012;13(4):291-6. doi: 10.1038/embor.2012.36. PubMed PMID: 22421999; PubMed Central PMCID: PMCPMC3321166.

At least one in-text citation resolves to this reference entry.

Matched in-text citations (2)

Citation XXV Validated

24 | Style: numeric | Normalized: 24

Detected in block order 227: There has been considerable focus on irreproducible research in science, identifying th...

Citation XXVI Validated

24-26 | Style: numeric | Normalized: 24

Detected in block order 239: When presenting data, it is important to provide the number and type of replicates (24-...

Authors: Vaux DL, Fidler F, Cumming G | Year: 2012 | Title: Replicates and repeats--what is the difference and is it significant? A brief discussion of statistics and experimental design

DOI: 10.1038/embor.2012.36 | PMID: 22421999

External validation: validated via pmid_exact

A PubMed-backed source record was validated for this matched reference.

Replicates and repeats--what is the difference and is it significant? A brief discussion of statistics and experimental design. | Vaux DL, Fidler F, Cumming G | 2012 | PMID 22421999

reference Order 25 Matched in text validated

Bell G. Replicates and repeats. BMC Biol. 2016;14(1):28. doi: 10.1186/s12915-016-0254-5.

At least one in-text citation resolves to this reference entry.

Matched in-text citations (1)

Citation XXVII Validated

24-26 | Style: numeric | Normalized: 25

Detected in block order 239: When presenting data, it is important to provide the number and type of replicates (24-...

Authors: Bell G | Year: 2016 | Title: Replicates and repeats

DOI: 10.1186/s12915-016-0254-5 | PMID: 27055650

External validation: validated via doi_exact

A PubMed-backed source record was validated for this matched reference.

Replicates and repeats. | Bell G | 2016 | PMID 27055650

reference Order 26 Matched in text validated

Blainey P, Krzywinski M, Altman N. Replication. Nat Methods. 2014;11:879. doi: 10.1038/nmeth.3091.

At least one in-text citation resolves to this reference entry.

Matched in-text citations (1)

Citation XXVIII Validated

24-26 | Style: numeric | Normalized: 26

Detected in block order 239: When presenting data, it is important to provide the number and type of replicates (24-...

Authors: Blainey P, Krzywinski M, Altman N | Year: 2014 | Title: Replication

DOI: 10.1038/nmeth.3091 | PMID: 25317452

External validation: validated via doi_exact

A PubMed-backed source record was validated for this matched reference.

Points of significance: replication. | Blainey P, Krzywinski M, Altman N | 2014 | PMID 25317452

Parsed title differs from the returned PubMed title.

reference Order 27 Matched in text not found

Scott JE, Williams KP. Validating Identity, Mass Purity and Enzymatic Purity of Enzyme Preparations. In: Sittampalam GS, Gal-Edd N, Arkin M, Auld D, Austin C, Bejcek B, et al., editors. Assay Guidance Manual. Bethesda (MD)2004.

At least one in-text citation resolves to this reference entry.

Matched in-text citations (1)

Citation XXIX Needs review

27 | Style: numeric | Normalized: 27

Detected in block order 241: In practice, the distinction between technical, independent and inter-run replicates is...

Authors: Scott JE, Williams KP | Year: 2004 | Title: Validating Identity, Mass Purity and Enzymatic Purity of Enzyme Preparations

DOI: n/a | PMID: n/a

External validation: not found via title_author_year_search

No PubMed-backed source match was found for this matched reference.

External validation was recorded without a normalized source summary

No PubMed search candidates were returned for the normalized reference metadata.

reference Order 28 Matched in text not found

Iversen PW, Beck B, Chen YF, Dere W, Devanarayan V, Eastwood BJ, et al. HTS Assay Validation. In: Sittampalam GS, Gal-Edd N, Arkin M, Auld D, Austin C, Bejcek B, et al., editors. Assay Guidance Manual. Bethesda (MD)2004.

At least one in-text citation resolves to this reference entry.

Matched in-text citations (4)

Citation XXX Needs review

28 | Style: numeric | Normalized: 28

Detected in block order 265: A related concept is the number of independent experiments or inter-run replicates requ...

Citation XXXIX Needs review

28 | Style: numeric | Normalized: 28

Detected in block order 292: Some examples of plate drift and spatial effects, which are systematic errors, are show...

Citation XLVI Needs review

28 | Style: numeric | Normalized: 28

Detected in block order 357: If the interest is more in assessing the agreement between two variables, then the Blan...

Citation LX Needs review

28 | Style: numeric | Normalized: 28

Detected in block order 619: A scatter plot (also referred to as a scatter gram) is a graphic visualization of two-d...

Authors: Iversen PW, Beck B, Chen YF, Dere W, Devanarayan V, Eastwood BJ, et al | Year: 2004 | Title: HTS Assay Validation

DOI: n/a | PMID: n/a

External validation: not found via title_author_year_search

No PubMed-backed source match was found for this matched reference.

External validation was recorded without a normalized source summary

No PubMed search candidates were returned for the normalized reference metadata.

reference Order 29 Matched in text not found

Auld DS, Farmen MW, Kahl SD, Kriauciunas A, McKnight KL, Montrose C, et al. Receptor Binding Assays for HTS and Drug Discovery. In: Sittampalam GS, Coussens NP, Brimacombe K, Grossman A, Arkin M, Auld D, et al., editors. Assay Guidance Manual. Bethesda (MD)2004.

At least one in-text citation resolves to this reference entry.

Matched in-text citations (2)

Citation XXXI Needs review

29 | Style: numeric | Normalized: 29

Detected in block order 288: Random errors are unpredictable and have no defined pattern. Random errors are fluctuat...

Citation LVI Needs review

29 | Style: numeric | Normalized: 29

Detected in block order 469: In a different example, consider the data in Table 13 from a standard radioligand bindi...

Authors: Auld DS, Farmen MW, Kahl SD, Kriauciunas A, McKnight KL, Montrose C, et al | Year: 2004 | Title: Receptor Binding Assays for HTS and Drug Discovery

DOI: n/a | PMID: n/a

External validation: not found via title_author_year_search

No PubMed-backed source match was found for this matched reference.

External validation was recorded without a normalized source summary

No PubMed search candidates were returned for the normalized reference metadata.

reference Order 30 Matched in text not found

Kahl SD, Hubbard FR, Sittampalam GS, Zock JM. Validation of a High Throughput Scintillation Proximity Assay for 5-Hydroxytryptamine1E Receptor Binding Activity. Journal of Biomolecular Screening. 1997;2(1):33-40.

At least one in-text citation resolves to this reference entry.

Matched in-text citations (1)

Citation XXXII Needs review

30 | Style: numeric | Normalized: 30

Detected in block order 288: Random errors are unpredictable and have no defined pattern. Random errors are fluctuat...

Authors: Kahl SD, Hubbard FR, Sittampalam GS, Zock JM | Year: 1997 | Title: Validation of a High Throughput Scintillation Proximity Assay for 5-Hydroxytryptamine1E Receptor Binding Activity

DOI: n/a | PMID: n/a

External validation: not found via title_author_year_search

No PubMed-backed source match was found for this matched reference.

External validation was recorded without a normalized source summary

No PubMed search candidates were returned for the normalized reference metadata.

reference Order 31 Matched in text validated

Brideau C, Gunter B, Pikounis B, Liaw A. Improved statistical methods for hit selection in high-throughput screening. J Biomol Screen. 2003;8(6):634-47. doi: 10.1177/1087057103258285. PubMed PMID: 14711389.

At least one in-text citation resolves to this reference entry.

Matched in-text citations (1)

Citation XXXIII Validated

31 | Style: numeric | Normalized: 31

Detected in block order 290: Systematic errors are persistent (unless addressed) and can be associated with instrume...

Authors: Brideau C, Gunter B, Pikounis B, Liaw A | Year: 2003 | Title: Improved statistical methods for hit selection in high-throughput screening

DOI: 10.1177/1087057103258285 | PMID: 14711389

External validation: validated via pmid_exact

A PubMed-backed source record was validated for this matched reference.

Improved statistical methods for hit selection in high-throughput screening. | Brideau C, Gunter B, Pikounis B, Liaw A | 2003 | PMID 14711389

reference Order 32 Matched in text validated

Dragiev P, Nadon R, Makarenkov V. Systematic error detection in experimental high-throughput screening. BMC bioinformatics. 2011;12:25-. doi: 10.1186/1471-2105-12-25. PubMed PMID: PMC3034671.

At least one in-text citation resolves to this reference entry.

Matched in-text citations (1)

Citation XXXIV Validated

32-35 | Style: numeric | Normalized: 32

Detected in block order 290: Systematic errors are persistent (unless addressed) and can be associated with instrume...

Authors: Dragiev P, Nadon R, Makarenkov V | Year: 2011 | Title: Systematic error detection in experimental high-throughput screening

DOI: 10.1186/1471-2105-12-25 | PMID: 21247425

External validation: validated via doi_exact

A PubMed-backed source record was validated for this matched reference.

Systematic error detection in experimental high-throughput screening. | Dragiev P, Nadon R, Makarenkov V | 2011 | PMID 21247425

reference Order 33 Matched in text validated

Kevorkov D, Makarenkov V. Statistical Analysis of Systematic Errors in High-Throughput Screening. Journal of Biomolecular Screening. 2005;10(6):557-67. doi: 10.1177/1087057105276989. PubMed PMID: 16103415.

At least one in-text citation resolves to this reference entry.

Matched in-text citations (1)

Citation XXXV Validated

32-35 | Style: numeric | Normalized: 33

Detected in block order 290: Systematic errors are persistent (unless addressed) and can be associated with instrume...

Authors: Kevorkov D, Makarenkov V | Year: 2005 | Title: Statistical Analysis of Systematic Errors in High-Throughput Screening

DOI: 10.1177/1087057105276989 | PMID: 16103415

External validation: validated via pmid_exact

A PubMed-backed source record was validated for this matched reference.

Statistical analysis of systematic errors in high-throughput screening. | Kevorkov D, Makarenkov V | 2005 | PMID 16103415

reference Order 34 Matched in text validated

Makarenkov V, Kevorkov D, Zentilli P, Gagarin A, Malo N, Nadon R. HTS-Corrector: software for the statistical analysis and correction of experimental high-throughput screening data. Bioinformatics. 2006;22(11):1408-9. doi: 10.1093/bioinformatics/btl126. PubMed PMID: 16595559.

At least one in-text citation resolves to this reference entry.

Matched in-text citations (1)

Citation XXXVI Validated

32-35 | Style: numeric | Normalized: 34

Detected in block order 290: Systematic errors are persistent (unless addressed) and can be associated with instrume...

Authors: Makarenkov V, Kevorkov D, Zentilli P, Gagarin A, Malo N, Nadon R | Year: 2006 | Title: HTS-Corrector: software for the statistical analysis and correction of experimental high-throughput screening data

DOI: 10.1093/bioinformatics/btl126 | PMID: 16595559

External validation: validated via pmid_exact

A PubMed-backed source record was validated for this matched reference.

HTS-Corrector: software for the statistical analysis and correction of experimental high-throughput screening data. | Makarenkov V, Kevorkov D, Zentilli P, et al | 2006 | PMID 16595559

Parsed author summary differs from the returned PubMed authors.

reference Order 35 Matched in text validated

Makarenkov V, Zentilli P, Kevorkov D, Gagarin A, Malo N, Nadon R. An efficient method for the detection and elimination of systematic error in high-throughput screening. Bioinformatics. 2007;23(13):1648-57. doi: 10.1093/bioinformatics/btm145.

At least one in-text citation resolves to this reference entry.

Matched in-text citations (1)

Citation XXXVII Validated

32-35 | Style: numeric | Normalized: 35

Detected in block order 290: Systematic errors are persistent (unless addressed) and can be associated with instrume...

Authors: Makarenkov V, Zentilli P, Kevorkov D, Gagarin A, Malo N, Nadon R | Year: 2007 | Title: An efficient method for the detection and elimination of systematic error in high-throughput screening

DOI: 10.1093/bioinformatics/btm145 | PMID: 17463024

External validation: validated via doi_exact

A PubMed-backed source record was validated for this matched reference.

An efficient method for the detection and elimination of systematic error in high-throughput screening. | Makarenkov V, Zentilli P, Kevorkov D, et al | 2007 | PMID 17463024

Parsed author summary differs from the returned PubMed authors.

reference Order 36 Matched in text validated

Mazoure B, Nadon R, Makarenkov V. Identification and correction of spatial bias are essential for obtaining quality data in high-throughput screening technologies. Scientific reports. 2017;7(1):11921. doi: 10.1038/s41598-017-11940-4.

At least one in-text citation resolves to this reference entry.

Matched in-text citations (1)

Citation XXXVIII Validated

36 | Style: numeric | Normalized: 36

Detected in block order 290: Systematic errors are persistent (unless addressed) and can be associated with instrume...

Authors: Mazoure B, Nadon R, Makarenkov V | Year: 2017 | Title: Identification and correction of spatial bias are essential for obtaining quality data in high-throughput screening technologies

DOI: 10.1038/s41598-017-11940-4 | PMID: 28931934

External validation: validated via doi_exact

A PubMed-backed source record was validated for this matched reference.

Identification and correction of spatial bias are essential for obtaining quality data in high-throughput screening technologies. | Mazoure B, Nadon R, Makarenkov V | 2017 | PMID 28931934

reference Order 37 Matched in text not found

Mukaka MM. A guide to appropriate use of Correlation coefficient in medical research. Malawi Med J. 2012;24(3):69-71. PubMed PMID: PMC3576830.

At least one in-text citation resolves to this reference entry.

Matched in-text citations (1)

Citation XL Needs review

37 | Style: numeric | Normalized: 37

Detected in block order 304: The larger the absolute value of the correlation coefficient, the stronger the linear r...

Authors: Mukaka MM | Year: 2012 | Title: A guide to appropriate use of Correlation coefficient in medical research

DOI: n/a | PMID: n/a

External validation: not found via title_author_year_search

No PubMed-backed source match was found for this matched reference.

External validation was recorded without a normalized source summary

No PubMed search candidates were returned for the normalized reference metadata.

reference Order 38 Matched in text not found

Chambers JM. Graphical Methods for Data Analysis: Wadsworth International Group; Boston: Duxbury Press, Belmont, CA; 1983.

At least one in-text citation resolves to this reference entry.

Matched in-text citations (2)

Citation XLII Needs review

11,38 | Style: numeric | Normalized: 38

Detected in block order 310: The correlation coefficient number alone is not adequate for demonstration of statistic...

Citation XLIII Needs review

38 | Style: numeric | Normalized: 38

Detected in block order 313: All the graphs shown have Pearson correlation coefficients equal to 0.7. The example in...

Authors: Chambers JM | Year: 1983 | Title: Graphical Methods for Data Analysis: Wadsworth International Group; Boston: Duxbury Press, Belmont, CA; 1983

DOI: n/a | PMID: n/a

External validation: not found via title_author_year_search

No PubMed-backed source match was found for this matched reference.

External validation was recorded without a normalized source summary

No PubMed search candidates were returned for the normalized reference metadata.

reference Order 39 Matched in text validated

Lin LI. A concordance correlation coefficient to evaluate reproducibility. Biometrics. 1989;45(1):255-68. Epub 1989/03/01. PubMed PMID: 2720055.

At least one in-text citation resolves to this reference entry.

Matched in-text citations (1)

Citation XLIV Validated

39 | Style: numeric | Normalized: 39

Detected in block order 353: Another parameter that will be useful to evaluate and report for comparing variables th...

Authors: Lin LI | Year: 1989 | Title: A concordance correlation coefficient to evaluate reproducibility

DOI: n/a | PMID: 2720055

External validation: validated via pmid_exact

A PubMed-backed source record was validated for this matched reference.

A concordance correlation coefficient to evaluate reproducibility. | Lin LI | 1989 | PMID 2720055

reference Order 40 Matched in text validated

Bland JM, Altman DG. Measuring agreement in method comparison studies. Stat Methods Med Res. 1999;8(2):135-60. doi: 10.1177/096228029900800204. PubMed PMID: 10501650.

At least one in-text citation resolves to this reference entry.

Matched in-text citations (1)

Citation XLV Validated

40 | Style: numeric | Normalized: 40

Detected in block order 357: If the interest is more in assessing the agreement between two variables, then the Blan...

Authors: Bland JM, Altman DG | Year: 1999 | Title: Measuring agreement in method comparison studies

DOI: 10.1177/096228029900800204 | PMID: 10501650

External validation: validated via pmid_exact

A PubMed-backed source record was validated for this matched reference.

Measuring agreement in method comparison studies. | Bland JM, Altman DG | 1999 | PMID 10501650

reference Order 41 Matched in text not found

Wasserstein RL, Lazar NA. The ASA's Statement on p-Values: Context, Process, and Purpose. The American Statistician. 2016;70(2):129-33. doi: 10.1080/00031305.2016.1154108.

At least one in-text citation resolves to this reference entry.

Matched in-text citations (2)

Citation XLVII Needs review

41 | Style: numeric | Normalized: 41

Detected in block order 371: In 2016, the American Statistical Association published an official statement on p-valu...

Citation XLVIII Needs review

41,42 | Style: numeric | Normalized: 41

Detected in block order 380: We refer the reader to the main article and a discussion article for further informatio...

Authors: Wasserstein RL, Lazar NA | Year: 2016 | Title: The ASA's Statement on p-Values: Context, Process, and Purpose

DOI: 10.1080/00031305.2016.1154108 | PMID: n/a

External validation: not found via doi_exact

No PubMed-backed source match was found for this matched reference.

External validation was recorded without a normalized source summary

No PubMed record was returned for DOI 10.1080/00031305.2016.1154108.

reference Order 42 Matched in text not found

Bretz F, Hothorn T, Westfall P. Multiple Comparisons Using R. New York, NY: Chapman and Hall/CRC; 2010.

At least one in-text citation resolves to this reference entry.

Matched in-text citations (2)

Citation XLIX Needs review

41,42 | Style: numeric | Normalized: 42

Detected in block order 380: We refer the reader to the main article and a discussion article for further informatio...

Citation L Needs review

42 | Style: numeric | Normalized: 42

Detected in block order 382: A key topic related to p-values is multiple comparisons or multiple testing. When multi...

Authors: Bretz F, Hothorn T, Westfall P | Year: 2010 | Title: Multiple Comparisons Using R

DOI: n/a | PMID: n/a

External validation: not found via title_author_year_search

No PubMed-backed source match was found for this matched reference.

External validation was recorded without a normalized source summary

No PubMed search candidates were returned for the normalized reference metadata.

reference Order 43 Matched in text not found

Benjamini Y, Hochberg Y. Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing. Journal of the Royal Statistical Society Series B (Methodological). 1995;57(1):289-300.

At least one in-text citation resolves to this reference entry.

Matched in-text citations (1)

Citation LI Needs review

43 | Style: numeric | Normalized: 43

Detected in block order 382: A key topic related to p-values is multiple comparisons or multiple testing. When multi...

Authors: Benjamini Y, Hochberg Y | Year: 1995 | Title: Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing

DOI: n/a | PMID: n/a

External validation: not found via title_author_year_search

No PubMed-backed source match was found for this matched reference.

External validation was recorded without a normalized source summary

No PubMed search candidates were returned for the normalized reference metadata.

reference Order 44 Matched in text validated

Cox KL, Devanarayan V, Kriauciunas A, Manetta J, Montrose C, Sittampalam S. Immunoassay Methods. In: Sittampalam GS, Coussens NP, Nelson H, Arkin M, Auld D, Austin C, et al., editors. Assay Guidance Manual. Bethesda (MD)2004.

At least one in-text citation resolves to this reference entry.

Matched in-text citations (1)

Citation LII Validated

44 | Style: numeric | Normalized: 44

Detected in block order 385: Assays where the analyte concentration levels are calibrated from a reference standard...

Authors: Cox KL, Devanarayan V, Kriauciunas A, Manetta J, Montrose C, Sittampalam S | Year: 2004 | Title: Immunoassay Methods

DOI: n/a | PMID: 22553884

External validation: validated via title_author_year_search

A PubMed-backed source record was validated for this matched reference.

Assay Guidance Manual | Cox KL, Devanarayan V, Kriauciunas A, Manetta J, Montrose C, Sittampalam S | 2004 | PMID 22553884

Parsed title differs from the returned PubMed title.

reference Order 45 Matched in text validated

Lee JW, Weiner RS, Sailstad JM, Bowsher RR, Knuth DW, O'Brien PJ, et al. Method validation and measurement of biomarkers in nonclinical and clinical samples in drug development: a conference report. Pharmaceutical research. 2005;22(4):499-511. doi: 10.1007/s11095-005-2495-9. PubMed PMID: 15846456.

At least one in-text citation resolves to this reference entry.

Matched in-text citations (1)

Citation LIII Validated

45 | Style: numeric | Normalized: 45

Detected in block order 385: Assays where the analyte concentration levels are calibrated from a reference standard...

Authors: Lee JW, Weiner RS, Sailstad JM, Bowsher RR, Knuth DW, O'Brien PJ, et al | Year: 2005 | Title: Method validation and measurement of biomarkers in nonclinical and clinical samples in drug development: a conference report

DOI: 10.1007/s11095-005-2495-9 | PMID: 15846456

External validation: validated via pmid_exact

A PubMed-backed source record was validated for this matched reference.

Method validation and measurement of biomarkers in nonclinical and clinical samples in drug development: a conference report. | Lee JW, Weiner RS, Sailstad JM, et al | 2005 | PMID 15846456

Parsed author summary differs from the returned PubMed authors.

reference Order 46 Matched in text not found

Fallahi-Sichani M, Honarnejad S, Heiser LM, Gray JW, Sorger PK. Metrics other than potency reveal systematic variation in responses to cancer drugs. Nature chemical biology. 2013;9(11):708.

At least one in-text citation resolves to this reference entry.

Matched in-text citations (1)

Citation LIV Needs review

46 | Style: numeric | Normalized: 46

Detected in block order 394: The area under a concentration response curve or kinetic time course can be quantified...

Authors: Fallahi-Sichani M, Honarnejad S, Heiser LM, Gray JW, Sorger PK | Year: 2013 | Title: Metrics other than potency reveal systematic variation in responses to cancer drugs

DOI: n/a | PMID: n/a

External validation: not found via title_author_year_search

No PubMed-backed source match was found for this matched reference.

External validation was recorded without a normalized source summary

No PubMed search candidates were returned for the normalized reference metadata.

reference Order 47 Matched in text validated

Weiss JN. The Hill equation revisited: uses and misuses. FASEB journal: official publication of the Federation of American Societies for Experimental Biology. 1997;11(11):835-41. Epub 1997/09/01. PubMed PMID: 9285481.

At least one in-text citation resolves to this reference entry.

Matched in-text citations (1)

Citation LV Validated

47 | Style: numeric | Normalized: 47

Detected in block order 457: Data from Table 12 was plotted using a log x-axis scale (A), linear x-axis scale (B) or...

Authors: Weiss JN | Year: 1997 | Title: The Hill equation revisited: uses and misuses

DOI: n/a | PMID: 9285481

External validation: validated via pmid_exact

A PubMed-backed source record was validated for this matched reference.

The Hill equation revisited: uses and misuses. | Weiss JN | 1997 | PMID 9285481

reference Order 48 Matched in text validated

Franzblau LE, Chung KC. Graphs, Tables, and Figures in Scientific Publications: The Good, the Bad, and How Not to Be the Latter. The Journal of Hand Surgery. 2012;37(3):591-6. doi: https://doi.org/10.1016/j.jhsa.2011.12.041.

At least one in-text citation resolves to this reference entry.

Matched in-text citations (1)

Citation LVIII Validated

48 | Style: numeric | Normalized: 48

Detected in block order 578: Many graph and figure types exist so that your data can be tailored to a specific purpo...

Authors: Franzblau LE, Chung KC | Year: 2011 | Title: Graphs, Tables, and Figures in Scientific Publications: The Good, the Bad, and How Not to Be the Latter

DOI: 10.1016/j.jhsa.2011.12.041 | PMID: 22305731

External validation: validated via doi_exact

A PubMed-backed source record was validated for this matched reference.

Graphs, tables, and figures in scientific publications: the good, the bad, and how not to be the latter. | Franzblau LE, Chung KC | 2012 | PMID 22305731

Parsed publication year differs from the returned PubMed year.

reference Order 49 Matched in text not found

Lile S. 44 Types of Graphs: Perfect for Every Top Industry [Accessed January 23, 2019]. Available from: https://visme.co/blog/types-of-graphs/.

At least one in-text citation resolves to this reference entry.

Matched in-text citations (1)

Citation LIX Needs review

49 | Style: numeric | Normalized: 49

Detected in block order 578: Many graph and figure types exist so that your data can be tailored to a specific purpo...

Authors: Lile S | Year: 2019 | Title: 44 Types of Graphs: Perfect for Every Top Industry [Accessed January 23, 2019]

DOI: n/a | PMID: n/a

External validation: not found via title_author_year_search

No PubMed-backed source match was found for this matched reference.

External validation was recorded without a normalized source summary

No PubMed search candidates were returned for the normalized reference metadata.

reference Order 50 Matched in text validated

Weissgerber TL, Milic NM, Winham SJ, Garovic VD. Beyond bar and line graphs: time for a new data presentation paradigm. PLoS Biol. 2015;13(4):e1002128. doi: 10.1371/journal.pbio.1002128. PubMed PMID: 25901488; PubMed Central PMCID: PMCPMC4406565.

At least one in-text citation resolves to this reference entry.

Matched in-text citations (2)

Citation LXI Validated

50 | Style: numeric | Normalized: 50

Detected in block order 643: While widely used, bar graphs have significant limitations. Since bar graphs only displ...

Citation LXIII Validated

50 | Style: numeric | Normalized: 50

Detected in block order 646: The full data may suggest different conclusions from the summary statistics. The means...

Authors: Weissgerber TL, Milic NM, Winham SJ, Garovic VD | Year: 2015 | Title: Beyond bar and line graphs: time for a new data presentation paradigm

DOI: 10.1371/journal.pbio.1002128 | PMID: 25901488

External validation: validated via pmid_exact

A PubMed-backed source record was validated for this matched reference.

Beyond bar and line graphs: time for a new data presentation paradigm. | Weissgerber TL, Milic NM, Winham SJ, Garovic VD | 2015 | PMID 25901488

reference Order 51 Matched in text validated

Weissgerber TL, Savic M, Winham SJ, Stanisavljevic D, Garovic VD, Milic NM. Data visualization, bar naked: A free tool for creating interactive graphics. J Biol Chem. 2017;292(50):20592-8. doi: 10.1074/jbc.RA117.000147. PubMed PMID: 28974579; PubMed Central PMCID: PMCPMC5733595.

At least one in-text citation resolves to this reference entry.

Matched in-text citations (1)

Citation LXII Validated

51 | Style: numeric | Normalized: 51

Detected in block order 643: While widely used, bar graphs have significant limitations. Since bar graphs only displ...

Authors: Weissgerber TL, Savic M, Winham SJ, Stanisavljevic D, Garovic VD, Milic NM | Year: 2017 | Title: Data visualization, bar naked: A free tool for creating interactive graphics

DOI: 10.1074/jbc.RA117.000147 | PMID: 28974579

External validation: validated via pmid_exact

A PubMed-backed source record was validated for this matched reference.

Data visualization, bar naked: A free tool for creating interactive graphics. | Weissgerber TL, Savic M, Winham SJ, et al | 2017 | PMID 28974579

Parsed author summary differs from the returned PubMed authors.

reference Order 52 Matched in text validated

Dahlin JL, Sinville R, Solberg J, Zhou H, Han J, Francis S, et al. A cell-free fluorometric high-throughput screen for inhibitors of Rtt109-catalyzed histone acetylation. PLoS One. 2013;8(11):e78877. doi: 10.1371/journal.pone.0078877. PubMed PMID: 24260132; PubMed Central PMCID: PMCPMC3832525.

At least one in-text citation resolves to this reference entry.

Matched in-text citations (1)

Citation LXIV Validated

52 | Style: numeric | Normalized: 52

Detected in block order 657: Heat maps can be used to view multiple plates at a time and to assist with efficient id...

Authors: Dahlin JL, Sinville R, Solberg J, Zhou H, Han J, Francis S, et al | Year: 2013 | Title: A cell-free fluorometric high-throughput screen for inhibitors of Rtt109-catalyzed histone acetylation

DOI: 10.1371/journal.pone.0078877 | PMID: 24260132

External validation: validated via pmid_exact

A PubMed-backed source record was validated for this matched reference.

A cell-free fluorometric high-throughput screen for inhibitors of Rtt109-catalyzed histone acetylation. | Dahlin JL, Sinville R, Solberg J, et al | 2013 | PMID 24260132

Parsed author summary differs from the returned PubMed authors.

reference Order 53 Matched in text not found

Relational Model. In Wikipedia, The Free Encyclopedia: Wikepedia Contributors; 2019.

At least one in-text citation resolves to this reference entry.

Matched in-text citations (1)

Citation LXV Needs review

53 | Style: numeric | Normalized: 53

Detected in block order 735: A well designed and properly implemented Relational Database Management System (RDBMS)...

Authors: Relational Model | Year: 2019 | Title: In Wikipedia, The Free Encyclopedia: Wikepedia Contributors; 2019

DOI: n/a | PMID: n/a

External validation: not found via title_author_year_search

No PubMed-backed source match was found for this matched reference.

External validation was recorded without a normalized source summary

No PubMed search candidates were returned for the normalized reference metadata.

Citation issues Open citation issues

No open citation issues were recorded for this document.