Group Assessment and Diagnostic Evaluation

Group Math Assessment and Diagnostic Evaluation (G-MADE)

Rating Summary

Classification Accuracyhalf bubble
GeneralizabilityModerate Low
Reliabilityfull bubble
Validityhalf bubble
Disaggregated Reliability and Validity DataN A
AdministrationIndividual Group
Administration & Scoring Time46-95 Minutes
Scoring KeyComputer Scored
Benchmarks / NormsYes
Cost Technology, Human Resources, and Accommodations for Special Needs Service and Support Purpose and Other Implementation Information Usage and Reporting

G-MADE costs $116.50 - $309.95 (depending on the grade level) for the complete kit, which includes: 30 test booklets, Forms A and B, and the Administration, Scoring and Interpretation Manual.

Additional test forms can be purchased for $32.50 for a package of 10. Technical manuals are $35.50 and protocols are $3.25 per student.

There are three software options: (1) hand entry scoring and reporting software is $411.95; (2) scanning, scoring, and reporting software is $2,341.95; and (3) scanning with barcode scoring and reporting software is $2,695.95

Computer and Internet access are required for full use of product services.

Testers will require less than 1 hour of training to learn to administer the test, or 1 to 4 hours to learn to administer and interpret the test.

Paraprofessionals can administer the test.

The test questions can be read to students with reading disabilities and can be translated for ELL students. Test booklets can be enlarged for students with visual impairments and students can respond with alternative response devices if they struggle or are unable to bubble in an answer sheet.

19500 Bulverde Road
San Antonio, TX 78259

Phone: 877-324-2401

Field-tested training manuals are included and should provide all implementation information.

Ongoing technical support is available.

The G-MADE is a norm-referenced, standards-based assessment of math skills. It is intended for use with students in kindergarten through high school. It includes nine test levels, each of which has two parallel forms (Form A and Form B). Eight of the nine test levels contain three sections or subtests: Concepts and Communication, Operations and Computation, and Process and Computation. 

The G-MADE can be used to observe progress and track growth of an individual student or group of students (an entire class, building, or district) from form to form or from year to year.

G-MADE can be administered individually or to a group. Administration usually takes 45-90 minutes, although the test is untimed. Scoring requires 1-5 minutes depending on whether scoring software is used or not.  

Available scores include: raw, standard, percentile, grade-equivalent, IRT-based, stanines, normal curve equivalents, subscale/subtest, composite, and error analysis.


Classification Accuracy

  Classification Accuracy in Predicting Proficiency on:
False Positive Rate 0.39 0.30 0.27
False Negative Rate 0.30 0.14 0.20
Sensitivity 0.70 0.86 0.80
Specificity 0.61 0.70 0.73
Positive Predictive Power 0.33 0.44 0.49
Negative Predictive Power 0.88 0.95 0.92
Overall Classification Rate 0.63 0.74 0.75
AUC (ROC) 0.74 0.86 0.83
Base Rate 0.22 0.21 0.24
Cut Points: 96 86 95
At 90% Sensitivity, Specificity equals 0.46 0.67 0.59
At 80% Sensitivity, Specificity equals 0.52 0.73 0.73
At 70% Sensitivity, Specificity equals 0.61 0.84 0.78



Description of study sample:

  • Number of States: 44
  • Size: total = 737
    • ITBS = 185
    • TerraNova = 238
    • ITED = 314
  • Gender:
    • 51% Male
    • 49% Female 
  • Race/Ethnicity:
    • 83% White, Non-Hispanic
    • 15% Black, Non-Hispanic
    • 1% Hispanic
    • 1% Other
  • Disability classification: Special education students if mainstreamed for part or all of the regular education day were included in the sample
  • First Language: English
  • Language proficiency status: 100% English Proficient


Type of Reliability Age or Grade n
Coefficient SEM Information (including normative data) /Subjects
range median
Internal Reliability- Total Test Alpha and Split Half Reliabilities 4-18 305-634 0.91-0.99 0.96 3.4-3.7  
Alternate Form Reliability 4-18 651
0.81-0.94 0.89 2.8-4.5  
Test-Retest Reliabilities 4-18 816
0.77-0.96 0.90 2.8-4.5  



Type of Validity Age or Grade Test or Criterion n (range) Coefficient (if applicable)
Range Median
Criterion Related Grades 1,2,3,4,6, Middle School, High School ITBS, TerraNova, ITED, TASKS 977
0.78-0.87 0.83
Predictive Grades 1-6, Middle and High School ITBS, Terra Nova, ITED 737
0.63-0.90 0.81