ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	9

Descriptor

Statistical Analysis	26
Test Construction	26
Testing Programs	26
Test Reliability	9
Educational Testing	8
Multiple Choice Tests	8
Educational Research	6
Elementary Secondary Education	6
Evaluation Research	6
State Programs	6
Test Results	6
Test Validity	6
Item Response Theory	5
Secondary School Students	5
Achievement Tests	4
Curriculum Based Assessment	4
Psychometrics	4
Reading Comprehension	4
Reading Tests	4
Scoring	4
Screening Tests	4
Test Items	4
Bulletins	3
Data Analysis	3
Educational Assessment	3
More ▼

Source

Behavioral Research and…	4
ETS Research Report Series	2
Educational Testing Service	2
Educational Measurement:…	1

Publication Type

Reports - Research	7
Reports - Evaluative	6
Numerical/Quantitative Data	4
Reports - Descriptive	4
Speeches/Meeting Papers	4
Journal Articles	3
Guides - Non-Classroom	1
Tests/Questionnaires	1

Education Level

Elementary Secondary Education	6
Elementary Education	4
Higher Education	3
Grade 3	1
Grade 4	1
Grade 5	1
Grade 6	1
Postsecondary Education	1

Audience

Researchers	2
Parents	1
Policymakers	1

Location

United Kingdom (England)	2
California	1

Laws, Policies, & Programs

Assessments and Surveys

National Assessment of…	2
SAT (College Admission Test)	2
Massachusetts Comprehensive…	1
National Longitudinal Study…	1
National Teacher Examinations	1
Praxis Series	1

What Works Clearinghouse Rating

Showing 1 to 15 of 26 results Save | Export

Exploring Alternative Test Form Linking Designs with Modified Equating Sample Size and Anchor Test Length. Research Report. ETS RR-13-02

Peer reviewed
PDF on ERIC

Download full text

Wang, Lin; Qian, Jiahe; Lee, Yi-Hsuan – ETS Research Report Series, 2013

The purpose of this study was to evaluate the combined effects of reduced equating sample size and shortened anchor test length on item response theory (IRT)-based linking and equating results. Data from two independent operational forms of a large-scale testing program were used to establish the baseline results for evaluating the results from…

Descriptors: Test Construction, Item Response Theory, Testing Programs, Simulation

Analyzing the Reliability of the easyCBM Reading Comprehension Measures: Grade 3. Technical Report #1202

Download full text

Lai, Cheng-Fei; Irvin, P. Shawn; Park, Bitnara Jasmine; Alonzo, Julie; Tindal, Gerald – Behavioral Research and Teaching, 2012

In this technical report, we present the results of a reliability study of the third-grade multiple choice reading comprehension measures available on the easyCBM learning system conducted in the spring of 2011. Analyses include split-half reliability, alternate form reliability, person and item reliability as derived from Rasch analysis,…

Descriptors: Grade 3, Curriculum Based Assessment, Educational Testing, Testing Programs

Analyzing the Reliability of the easyCBM Reading Comprehension Measures: Grade 5. Technical Report #1204

Download full text

Park, Bitnara Jasmine; Irvin, P. Shawn; Lai, Cheng-Fei; Alonzo, Julie; Tindal, Gerald – Behavioral Research and Teaching, 2012

In this technical report, we present the results of a reliability study of the fifth-grade multiple choice reading comprehension measures available on the easyCBM learning system conducted in the spring of 2011. Analyses include split-half reliability, alternate form reliability, person and item reliability as derived from Rasch analysis,…

Descriptors: Grade 5, Curriculum Based Assessment, Educational Testing, Testing Programs

Analyzing the Reliability of the easyCBM Reading Comprehension Measures: Grade 4. Technical Report #1203

Download full text

Park, Bitnara Jasmine; Irvin, P. Shawn; Alonzo, Julie; Lai, Cheng-Fei; Tindal, Gerald – Behavioral Research and Teaching, 2012

In this technical report, we present the results of a reliability study of the fourth-grade multiple choice reading comprehension measures available on the easyCBM learning system conducted in the spring of 2011. Analyses include split-half reliability, alternate form reliability, person and item reliability as derived from Rasch analysis,…

Descriptors: Grade 4, Curriculum Based Assessment, Educational Testing, Testing Programs

Constructed-Response DIF Evaluations for Mixed-Format Tests. Research Report. ETS RR-13-33

Peer reviewed
PDF on ERIC

Download full text

Moses, Tim; Liu, Jinghua; Tan, Adele; Deng, Weiling; Dorans, Neil J. – ETS Research Report Series, 2013

In this study, differential item functioning (DIF) methods utilizing 14 different matching variables were applied to assess DIF in the constructed-response (CR) items from 6 forms of 3 mixed-format tests. Results suggested that the methods might produce distinct patterns of DIF results for different tests and testing programs, in that the DIF…

Descriptors: Test Construction, Multiple Choice Tests, Test Items, Item Analysis

Analyzing the Reliability of the easyCBM Reading Comprehension Measures: Grade 6. Technical Report #1205

Download full text

Irvin, P. Shawn; Alonzo, Julie; Park, Bitnara Jasmine; Lai, Cheng-Fei; Tindal, Gerald – Behavioral Research and Teaching, 2012

In this technical report, we present the results of a reliability study of the sixth-grade multiple choice reading comprehension measures available on the easyCBM learning system conducted in the spring of 2011. Analyses include split-half reliability, alternate form reliability, person and item reliability as derived from Rasch analysis,…

Descriptors: Grade 6, Grade 3, Curriculum Based Assessment, Educational Testing

Universal Design and Multimethod Approaches to Item Review

Peer reviewed

Direct link

Johnstone, Christopher J.; Thompson, Sandra J.; Bottsford-Miller, Nicole A.; Thurlow, Martha L. – Educational Measurement: Issues and Practice, 2008

Test items undergo multiple iterations of review before states and vendors deem them acceptable to be placed in a live statewide assessment. This article reviews three approaches that can add validity evidence to states' item review processes. The first process is a structured sensitivity review process that focuses on universal design…

Descriptors: Test Items, Disabilities, Test Construction, Testing Programs

2010-11 Research Portfolio: Research & Development Division

Download full text

Educational Testing Service, 2010

This document describes the breadth of the research that the ETS (Educational Testing Service) Research & Development division is conducting in 2010. This portfolio will be updated in early 2011 to reflect changes to existing projects and new projects that were added after this document was completed. The research described in this portfolio falls…

Descriptors: Portfolios (Background Materials), Testing Programs, Educational Testing, Private Agencies

2008 Research Portfolio: Research & Development Division

Download full text

Educational Testing Service, 2008

This document describes the breadth of the research being conducted in 2008 by the Research and Development Division at Educational Testing Service (ETS). The research described falls into three large categories: (1) Research supported by the ETS research allocation; (2) Research funded by testing programs at ETS; and (3) Research funded by…

Descriptors: Research and Development, Testing Programs, Educational Testing, Educational Research

NLSMA Reports, No. 7, The Development of Tests.

Romberg, Thomas A.; Wilson, James W. – 1969

This is one of a series of reports on the National Longitudinal Study of Mathematical Abilities (NLSMA). This report describes the processes used for deciding what should be measured, when, and how. Work of the SMSG Panel on Tests for collecting tests items, conceptualizing scales, pilot testing, and analyzing pilot test data is reviewed. The…

Descriptors: Educational Research, Longitudinal Studies, Mathematics Education, Psychological Testing

Setting Cut Scores on Large-Scale Assessments. The State Board Connection Issues in Brief.

National Association of State Boards of Education, Alexandria, VA. – 1999

This Brief builds on the work of the National Association of State Boards of Education 1997 Study Group on State Assessment Systems by examining one of the state board actions that is most likely to capture the publics attention: setting cut scores on state assessments. This process involves a large measure of human judgment and politics and a…

Descriptors: Cutting Scores, Educational Testing, Standard Setting, State Programs

The Effects of Finite Sampling on State Assessment Sample Requirements. NAEP Validity Studies. Working Paper Series.

Download full text

Chromy, James R. – 2003

This study addressed statistical techniques that might ameliorate some of the sampling problems currently facing states with small populations participating in State National Assessment of Educational Progress (NAEP) assessments. The study explored how the application of finite population correction factors to the between-school component of…

Descriptors: Elementary Secondary Education, National Surveys, Sample Size, Sampling

A Testing Program for Introductory Accounting.

Download full text

Hines, Everett B. – 1973

The accounting department at the University of Arizona, faced with numerous sections of introductory accounting, full classrooms, testing periods spread over two days, and a shortage of clerical help, evolved this testing program for the course in introductory accounting. Two objective multiple choice tests are constructed which sample different…

Descriptors: Accounting, Computer Oriented Programs, Multiple Choice Tests, Program Descriptions

The Certificate of Secondary Education: Experimental Examinations--Science. Examinations Bulletin No. 8.

Schools Council, London (England). – 1965

This bulletin describes two phases of an experiment in examining science, one in 1963, and the second in 1964. The first phase of the experiment explored two forms of assessment: a Scientific Thinking paper; and a Practical paper. In the second phase two further factors were introduced: a Facts and Principles paper; and an Assessment of course…

Descriptors: Bulletins, Educational Methods, Evaluation Methods, Sciences

Development and Analysis of a Taped and Written Test for Guidance Counselors: A Pilot Study.

Download full text

Humphry, Betty – 1973

The two phases in the development and tryout of a Guidance Counselor Test to be added to the National Teacher Examinations Program are discussed. In Phase One, a 150-item written test and a 50-item written test based on taped stimulus material were produced. Each test consisted of five-choice multiple-choice questions. In Phase Two, the tests were…

Descriptors: Counselor Evaluation, Graduate Students, Guidance Personnel, Higher Education

Previous Page | Next Page »

Pages: 1 | 2

Alonzo, Julie	4
Irvin, P. Shawn	4
Lai, Cheng-Fei	4
Park, Bitnara Jasmine	4
Tindal, Gerald	4
Angoff, William H., Ed.	1
Bottsford-Miller, Nicole A.	1
Chromy, James R.	1
Cope, Ronald T.	1
Crovo, Mary L.	1
Deng, Weiling	1
Dorans, Neil J.	1
Hines, Everett B.	1
Humphry, Betty	1
Johnstone, Christopher J.	1
Lee, Yi-Hsuan	1
Liu, Jinghua	1
McDole, Thomas L.	1
McGuire, Dennis P.	1
Moses, Tim	1
Phillips, Gary W.	1
Pollock, William T.	1
Qian, Jiahe	1
Romberg, Thomas A.	1
Tan, Adele	1
More ▼