ERIC - Search Results

Publication Date

In 2025	0
Since 2024	2
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	8
Since 2006 (last 20 years)	36

Descriptor

Educational Testing	226
Test Reliability	226
Test Validity	123
Test Construction	88
Test Interpretation	53
Achievement Tests	48
Testing Problems	45
Elementary Secondary Education	44
Student Evaluation	41
Standardized Tests	40
Evaluation Methods	33
Academic Achievement	31
Higher Education	30
Statistical Analysis	27
Educational Assessment	25
Test Selection	25
Multiple Choice Tests	24
Test Results	22
Testing Programs	22
Criterion Referenced Tests	21
Scores	21
Scoring	21
Testing	21
Test Bias	20
Norm Referenced Tests	19
More ▼

Education Level

Elementary Secondary Education	14
Elementary Education	10
Higher Education	5
Early Childhood Education	3
Grade 3	3
Grade 4	3
Middle Schools	3
Adult Education	2
Grade 5	2
High Schools	2
Postsecondary Education	2
Preschool Education	2
Grade 1	1
Grade 2	1
Grade 6	1
Grade 8	1
Intermediate Grades	1
Junior High Schools	1
Primary Education	1
Secondary Education	1
More ▼

Audience

Practitioners	16
Teachers	10
Administrators	6
Researchers	4
Counselors	3
Students	2
Community	1
Parents	1
Policymakers	1
Support Staff	1

Location

California	7
United Kingdom	3
Arizona (Phoenix)	1
Australia	1
California (Stanford)	1
Canada	1
Colorado (Denver)	1
Connecticut	1
France	1
Ghana	1
Illinois	1
Indiana	1
Israel	1
Japan	1
Maryland	1
Michigan	1
New Jersey	1
North America	1
United Kingdom (Wales)	1
More ▼

Laws, Policies, & Programs

Individuals with Disabilities…	2
Elementary and Secondary…	1
No Child Left Behind Act 2001	1

What Works Clearinghouse Rating

Showing 1 to 15 of 226 results Save | Export

Somers' D as an Alternative for the Item-Test and Item-Rest Correlation Coefficients in the Educational Measurement Settings

Peer reviewed
PDF on ERIC

Download full text

Metsämuuronen, Jari – International Journal of Educational Methodology, 2020

Pearson product-moment correlation coefficient between item g and test score X, known as item-test or item-total correlation ("Rit"), and item-rest correlation ("Rir") are two of the most used classical estimators for item discrimination power (IDP). Both "Rit" and "Rir" underestimate IDP caused by the…

Descriptors: Correlation, Test Items, Scores, Difficulty Level

Classroom Assessment: What Teachers Need to Know, 10th Edition

Direct link

W. James Popham – Pearson, 2024

"Classroom Assessment" shows pre- and in-service teachers how to use classroom testing accurately and formatively to dramatically increase their teaching effectiveness and promote student learning. In addition to clear and concise guidelines on how to develop and use quality classroom assessments, the author also focuses on the teaching…

Descriptors: Student Evaluation, Testing, Teacher Effectiveness, Test Construction

The Effect of the Ratio of Common Items and the Separation of Grade Distributions on the Precision of Vertical Scaling

Peer reviewed

Direct link

Guangming Li; Zhengyan Liang – SAGE Open, 2024

In order to investigate the influence of separation of grade distributions and ratio of common items on the precision of vertical scaling, this simulation study chooses common item design and first grade as base grade. There are four grades with 1,000 students each to take part in a test which has 100 items. Monte Carlo simulation method is used…

Descriptors: Elementary School Students, Grade 1, Grade 2, Grade 3

Assessment Literacy for Educators in a Hurry

Direct link

Popham, W. James – ASCD, 2018

What is assessment literacy? It is a handful of fundamental understandings about the testing concepts and procedures that influence educational decisions. And it just might be the most cost-effective means of real school improvement. With characteristic humor and aplomb, assessment expert W. James Popham strips away the psychometrician-speak and…

Descriptors: Student Evaluation, Educational Testing, Test Validity, Test Reliability

Digital Module 09: Sociocognitive Assessment for Diverse Populations

Peer reviewed

Direct link

Mislevy, Robert J.; Oliveri, Maria Elena – Educational Measurement: Issues and Practice, 2019

In this digital ITEMS module, Dr. Robert [Bob] Mislevy and Dr. Maria Elena Oliveri introduce and illustrate a sociocognitive perspective on educational measurement, which focuses on a variety of design and implementation considerations for creating fair and valid assessments for learners from diverse populations with diverse sociocultural…

Descriptors: Educational Testing, Reliability, Test Validity, Test Reliability

An Evaluative Framework for Reviewing Fairness Standards and Practices in Educational Tests

Peer reviewed

Direct link

Jonson, Jessica L.; Trantham, Pamela; Usher-Tate, Betty Jean – Educational Measurement: Issues and Practice, 2019

One of the substantive changes in the 2014 Standards for Educational and Psychological Testing was the elevation of fairness in testing as a foundational element of practice in addition to validity and reliability. Previous research indicates that testing practices often do not align with professional standards and guidelines. Therefore, to raise…

Descriptors: Culture Fair Tests, Test Validity, Test Reliability, Intelligence Tests

Using Reliability and Item Analysis to Evaluate a Teacher-Developed Test in Educational Measurement and Evaluation

Peer reviewed

Direct link

Quaigrain, Kennedy; Arhin, Ato Kwamina – Cogent Education, 2017

Item analysis is essential in improving items which will be used again in later tests; it can also be used to eliminate misleading items in a test. The study focused on item and test quality and explored the relationship between difficulty index (p-value) and discrimination index (DI) with distractor efficiency (DE). The study was conducted among…

Descriptors: Item Analysis, Teacher Developed Materials, Test Reliability, Educational Assessment

The Right Test for the Wrong Reason

Direct link

Popham, W. James – Phi Delta Kappan, 2014

The tests we use to evaluate student achievement may well be sound measures of what students know, but they are faulty indicators at best of how well they have been taught. A remedy to this this situation of judging teachers by the performance of their students on high-stakes tests may be in hand already. We should look to the methods successfully…

Descriptors: High Stakes Tests, Academic Achievement, Teacher Evaluation, Evaluation Methods

Effect of Violating Unidimensional Item Response Theory Vertical Scaling Assumptions on Developmental Score Scales

Direct link

Topczewski, Anna Marie – ProQuest LLC, 2013

Developmental score scales represent the performance of students along a continuum, where as students learn more they move higher along that continuum. Unidimensional item response theory (UIRT) vertical scaling has become a commonly used method to create developmental score scales. Research has shown that UIRT vertical scaling methods can be…

Descriptors: Item Response Theory, Scaling, Scores, Student Development

Using the Major Field Test for a Bachelor's Degree in Business as a Learning Outcomes Assessment: Evidence from a Review of 20 Years of Institution-Based Research

Peer reviewed

Direct link

Ling, Guangming; Bochenek, Jennifer; Burkander, Kri – Journal of Education for Business, 2015

By applying multilevel models with random effects, the authors reviewed and synthesized findings from 30 studies that were published in the last 20 years exploring the relationship between the Educational Testing Service Major Field Test for a Bachelor's Degree in Business (MFTB) and related factors. The results suggest that MFTB scores correlated…

Descriptors: Bachelors Degrees, Institutional Research, Educational Testing, Scores

Social Epistemology and the Pragmatics of Assessment

Peer reviewed

Direct link

Gergen, Kenneth J.; Dixon-Román, Ezekiel J. – Teachers College Record, 2014

In the present offering we challenge the presumption that the educational testing of students provides objective information about such students. This presumption largely rests on an empiricist account of science. In light of mounting criticism, however, empiricist foundationalism has given way to a social epistemology. From this standpoint,…

Descriptors: Epistemology, Educational Testing, Test Validity, Evaluation Utilization

Does It Matter Whether One Takes a Test on an iPad or a Desktop Computer?

Peer reviewed

Direct link

Ling, Guangming – International Journal of Testing, 2016

To investigate possible iPad related mode effect, we tested 403 8th graders in Indiana, Maryland, and New Jersey under three mode conditions through random assignment: a desktop computer, an iPad alone, and an iPad with an external keyboard. All students had used an iPad or computer for six months or longer. The 2-hour test included reading, math,…

Descriptors: Educational Testing, Computer Assisted Testing, Handheld Devices, Computers

Analyzing the Reliability of the easyCBM Reading Comprehension Measures: Grade 3. Technical Report #1202

Download full text

Lai, Cheng-Fei; Irvin, P. Shawn; Park, Bitnara Jasmine; Alonzo, Julie; Tindal, Gerald – Behavioral Research and Teaching, 2012

In this technical report, we present the results of a reliability study of the third-grade multiple choice reading comprehension measures available on the easyCBM learning system conducted in the spring of 2011. Analyses include split-half reliability, alternate form reliability, person and item reliability as derived from Rasch analysis,…

Descriptors: Grade 3, Curriculum Based Assessment, Educational Testing, Testing Programs

Analyzing the Reliability of the easyCBM Reading Comprehension Measures: Grade 5. Technical Report #1204

Download full text

Park, Bitnara Jasmine; Irvin, P. Shawn; Lai, Cheng-Fei; Alonzo, Julie; Tindal, Gerald – Behavioral Research and Teaching, 2012

In this technical report, we present the results of a reliability study of the fifth-grade multiple choice reading comprehension measures available on the easyCBM learning system conducted in the spring of 2011. Analyses include split-half reliability, alternate form reliability, person and item reliability as derived from Rasch analysis,…

Descriptors: Grade 5, Curriculum Based Assessment, Educational Testing, Testing Programs

Analyzing the Reliability of the easyCBM Reading Comprehension Measures: Grade 4. Technical Report #1203

Download full text

Park, Bitnara Jasmine; Irvin, P. Shawn; Alonzo, Julie; Lai, Cheng-Fei; Tindal, Gerald – Behavioral Research and Teaching, 2012

In this technical report, we present the results of a reliability study of the fourth-grade multiple choice reading comprehension measures available on the easyCBM learning system conducted in the spring of 2011. Analyses include split-half reliability, alternate form reliability, person and item reliability as derived from Rasch analysis,…

Descriptors: Grade 4, Curriculum Based Assessment, Educational Testing, Testing Programs

Previous Page | Next Page »

Pages: 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | ... | 16

Educational Measurement:…	5
Behavioral Research and…	4
Educational Research	4
Journal of Experimental…	4
Educational and Psychological…	3
Journal of Economic Education	3
Journal of Educational…	3
National Elementary Principal	3
ProQuest LLC	3
American School Board Journal	2
Clearing House	2
Educational Evaluation and…	2
International Journal of…	2
Measurement and Evaluation in…	2
NHSA Dialog	2
Phi Delta Kappan	2
Regional Educational…	2
Times Educational Supplement…	2
ADE Bulletin	1
ASCD	1
Alberta Journal of…	1
American Educational Research…	1
Assessment in Education:…	1
B. C. Journal of Special…	1
Canadian Journal of School…	1
More ▼

White, Edward M.	6
Ebel, Robert L.	5
Alonzo, Julie	4
Irvin, P. Shawn	4
Lai, Cheng-Fei	4
Park, Bitnara Jasmine	4
Popham, W. James	4
Tindal, Gerald	4
Bagnato, Stephen J.	2
Booker, Kevin	2
Brady, Raymond G.	2
Bruch, Julie	2
Garvin, Alfred D.	2
Gill, Brian	2
Hansen, Duncan N.	2
Hopkins, Kenneth D.	2
Ling, Guangming	2
Macy, Marisa	2
Oller, John W., Jr.	2
Reckase, Mark D.	2
Saunders, Phillip	2
Smith, Douglas K.	2
Yelvington, James Yowell	2
More ▼

Reports - Research	59
Journal Articles	54
Reports - Descriptive	24
Reports - Evaluative	23
Speeches/Meeting Papers	20
Opinion Papers	18
Books	12
Guides - Non-Classroom	12
Guides - Classroom - Teacher	6
Numerical/Quantitative Data	6
Collected Works - Proceedings	4
Guides - General	4
Information Analyses	4
Tests/Questionnaires	4
Collected Works - General	3
Dissertations/Theses -…	3
Collected Works - Serials	2
Dissertations/Theses -…	2
Dissertations/Theses -…	1
Legal/Legislative/Regulatory…	1
Reference Materials -…	1
Reference Materials -…	1
More ▼

ACT Assessment	3
Iowa Tests of Basic Skills	3
Stanford Achievement Tests	3
Dynamic Indicators of Basic…	2
Kaufman Assessment Battery…	2
National Assessment of…	2
Preliminary Scholastic…	2
Stanford Binet Intelligence…	2
California Achievement Tests	1
Comprehensive Tests of Basic…	1
Continuous Performance Test	1
Cornell Critical Thinking Test	1
Graduate Record Examinations	1
Group Embedded Figures Test	1
Learning Style Inventory	1
Lorge Thorndike Intelligence…	1
Nelson Denny Reading Tests	1
New Jersey College Basic…	1
Pediatric Evaluation of…	1
Preschool Inventory	1
Strong Campbell Interest…	1
System of Multicultural…	1
Test of Understanding in…	1
Watson Glaser Critical…	1
Wide Range Achievement Test	1
More ▼