ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	0
Since 2016 (last 10 years)	0
Since 2006 (last 20 years)	5

Source

ACT, Inc.	1
Applied Measurement in…	1
Behavioral Research and…	1
Language Testing	1
National Bureau of Economic…	1

Publication Type

Reports - Evaluative	13
Speeches/Meeting Papers	6
Journal Articles	2
Numerical/Quantitative Data	2
Opinion Papers	1

Education Level

Secondary Education	2
Elementary Secondary Education	1
Grade 7	1
High School Equivalency…	1
High Schools	1
Middle Schools	1

Audience

Researchers

Location

Alabama	1
Australia	1
California	1
Dominica	1
Grenada	1
Hawaii	1
Saint Lucia	1
Saint Vincent and the…	1
Texas	1

Laws, Policies, & Programs

Assessments and Surveys

Alabama High School…	1
General Educational…	1
National Assessment of…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing all 13 results Save | Export

Analyzing the Reliability of the easyCBM Reading Comprehension Measures: Grade 7. Technical Report #1206

Download full text

Irvin, P. Shawn; Alonzo, Julie; Lai, Cheng-Fei; Park, Bitnara Jasmine; Tindal, Gerald – Behavioral Research and Teaching, 2012

In this technical report, we present the results of a reliability study of the seventh-grade multiple choice reading comprehension measures available on the easyCBM learning system conducted in the spring of 2011. Analyses include split-half reliability, alternate form reliability, person and item reliability as derived from Rasch analysis,…

Descriptors: Reading Comprehension, Testing Programs, Statistical Analysis, Grade 7

Do Questions Written in the Target Language Make Foreign Language Listening Comprehension Tests More Difficult?

Peer reviewed

Direct link

Filipi, Anna – Language Testing, 2012

The Assessment of Language Competence (ALC) certificates is an annual, international testing program developed by the Australian Council for Educational Research to test the listening and reading comprehension skills of lower to middle year levels of secondary school. The tests are developed for three levels in French, German, Italian and…

Descriptors: Listening Comprehension Tests, Item Response Theory, Statistical Analysis, Foreign Countries

Evaluating the Effects of Differences in Group Abilities on the Tucker and the Levine Observed-Score Methods for Common-Item Nonequivalent Groups Equating. ACT Research Report Series 2010-1

Download full text

Chen, Hanwei; Cui, Zhongmin; Zhu, Rongchun; Gao, Xiaohong – ACT, Inc., 2010

The most critical feature of a common-item nonequivalent groups equating design is that the average score difference between the new and old groups can be accurately decomposed into a group ability difference and a form difficulty difference. Two widely used observed-score linear equating methods, the Tucker and the Levine observed-score methods,…

Descriptors: Equated Scores, Groups, Ability Grouping, Difficulty Level

Item Position and Item Difficulty Change in an IRT-Based Common Item Equating Design

Peer reviewed

Direct link

Meyers, Jason L.; Miller, G. Edward; Way, Walter D. – Applied Measurement in Education, 2009

In operational testing programs using item response theory (IRT), item parameter invariance is threatened when an item appears in a different location on the live test than it did when it was field tested. This study utilizes data from a large state's assessments to model change in Rasch item difficulty (RID) as a function of item position change,…

Descriptors: Test Items, Test Content, Testing Programs, Simulation

Taking the Easy Way Out: How the GED Testing Program Induces Students to Drop Out. NBER Working Paper No. 14044

Direct link

Heckman, James J.; LaFontaine, Paul A.; Rodriguez, Pedro L. – National Bureau of Economic Research, 2008

We exploit an exogenous increase in General Educational Development (GED) testing requirements to determine whether raising the difficulty of the test causes students to finish high school rather than drop out and GED certify. We find that a six point decrease in GED pass rates induces a 1.3 point decline in overall dropout rates. The effect size…

Descriptors: Testing Programs, Dropout Rate, Dropouts, High School Equivalency Programs

Using Traditional Psychometric Methodologies and the Rasch Model in Designing a Test.

Download full text

Crislip, Marian A.; Chin-Chance, Selvin – 2001

This paper discusses the use of two theories of item analysis and test construction, their strengths and weaknesses, and applications to the design of the Hawaii State Test of Essential Competencies (HSTEC). Traditional analyses of the data collected from the HSTEC field test were viewed from the perspectives of item difficulty levels and item…

Descriptors: Difficulty Level, Item Response Theory, Psychometrics, Reliability

Overview of the Most Difficult Technical Issues on the VNT.

Download full text

Skaggs, Gary; Bourque, Mary Lyn – 1998

Political and legislative pressures have posed a number of measurement issues and challenges to the development of sound, valid voluntary national tests (VNTs). This paper focuses on what appear to be the most difficult technical issues related to the VNT proposed by President Clinton in 1997. Technical issues refer to psychometric issues, as…

Descriptors: Academic Achievement, Achievement Tests, Classification, Difficulty Level

OCOD-CTTP Test Evaluation Report.

Download full text

Shorey, Leonard – 1991

Tests in social studies and integrated science given in Saint Vincent, Saint Lucia, Grenada, and Dominica were analyzed by the Organization for Co-operation in Overseas Development (OCOD) Comprehensive Teacher Training Program (CTTP) for discrimination, difficulty, and reliability, as well as other characteristics. There were 767 examinees for the…

Descriptors: Difficulty Level, Elementary Secondary Education, Evaluation Methods, Foreign Countries

Cautionary Observations on Reliability and Equating of Forms in High Stakes Performance Assessment: The Problem of Granularity.

Download full text

Cope, Ronald T. – 1995

This paper deals with the problems that arise in performance assessment from the granularity that results from having a small number of tasks or prompts and raters of responses to these tasks or prompts. Two problems are discussed in detail: (1) achieving a satisfactory degree of reliability; and (2) equating or adjusting for differences of…

Descriptors: Difficulty Level, Educational Assessment, Equated Scores, High Stakes Tests

Setting Higher Sights: A Need for More Demanding Assessments for U.S. Eighth Graders.

Download full text

American Federation of Teachers, Washington, DC. – 1998

An examination of the content and level of mastery required of students taking statewide mathematics achievement tests was conducted to provide clues about the kind and level of mathematics valued in the United States. Proposals for voluntary national tests also contributed to the rationale for the study. In particular, this study: (1) examined…

Descriptors: Difficulty Level, Educational Assessment, Foreign Countries, Grade 8

Competency Testing: Setting Educational Performance Standards for the Group.

Bayless, David L.; Nix, Charles W. – 1979

The merits and hazards of minimum competency testing for the individual student or for student groups are discussed. Types of groups which lend themselves to group application and some important factors in determining the parameters of a group are discussed. Ten critical issues related to minimum competency testing are identified: (1) scope of…

Descriptors: Academic Standards, Cutting Scores, Difficulty Level, Elementary Secondary Education

Practical Questions about Item Response Models in Large-Scale Assessment Programs.

Download full text

Legg, Sue M.; Algina, James – 1986

This paper focuses on the questions which arise as test practitioners monitor score scales derived from latent trait theory. Large scale assessment programs are dynamic and constantly challenge the assumptions and limits of latent trait models. Even though testing programs evolve, test scores must remain reliable indicators of progress.…

Descriptors: Difficulty Level, Educational Assessment, Elementary Secondary Education, Equated Scores

A Descriptive Comparison of Test Item Statistics from Items Utilized in an Item Pilot, a Form Pilot, and Live Administrations of the Alabama High School Graduation Examination: The 1991 Update.

Download full text

Steele, D. Joyce – 1991

This paper compares descriptive information based on analyses of the pilot and live administrations of the Alabama High School Graduation Examination (AHSGE). The AHSGE, a product of decisions made in 1977 and 1984 by the Alabama State Board of Education, is composed of subject tests in reading, mathematics, and language. The pass score for each…

Descriptors: Comparative Testing, Difficulty Level, Grade 11, Graduation Requirements

Difficulty Level	13
Testing Programs	13
Test Items	7
State Programs	5
Statistical Analysis	5
Test Reliability	5
Item Response Theory	4
Psychometrics	4
Test Construction	4
Educational Assessment	3
Elementary Secondary Education	3
Equated Scores	3
Foreign Countries	3
Mathematics Tests	3
Reading Tests	3
Testing Problems	3
Academic Standards	2
Evaluation Methods	2
Goodness of Fit	2
High School Students	2
Language Tests	2
Mathematics Achievement	2
Minimum Competency Testing	2
National Competency Tests	2
Reading Comprehension	2
More ▼

Algina, James	1
Alonzo, Julie	1
Bayless, David L.	1
Bourque, Mary Lyn	1
Chen, Hanwei	1
Chin-Chance, Selvin	1
Cope, Ronald T.	1
Crislip, Marian A.	1
Cui, Zhongmin	1
Filipi, Anna	1
Gao, Xiaohong	1
Heckman, James J.	1
Irvin, P. Shawn	1
LaFontaine, Paul A.	1
Lai, Cheng-Fei	1
Legg, Sue M.	1
Meyers, Jason L.	1
Miller, G. Edward	1
Nix, Charles W.	1
Park, Bitnara Jasmine	1
Rodriguez, Pedro L.	1
Shorey, Leonard	1
Skaggs, Gary	1
Steele, D. Joyce	1
Tindal, Gerald	1
More ▼