ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	1
Since 2016 (last 10 years)	5
Since 2006 (last 20 years)	9

Descriptor

Comparative Analysis	12
Foreign Countries	12
International Assessment	5
Achievement Tests	4
Item Analysis	3
Secondary School Students	3
Test Bias	3
Test Items	3
Computation	2
Computer Assisted Testing	2
Difficulty Level	2
English	2
Error of Measurement	2
French	2
High School Students	2
High Schools	2
Mathematics Tests	2
Reading Tests	2
Scaling	2
Ability	1
Academic Achievement	1
Accuracy	1
Adaptive Testing	1
Benchmarking	1
Classification	1
More ▼

Source

Applied Measurement in…

Publication Type

Journal Articles	12
Reports - Research	10
Reports - Evaluative	2
Information Analyses	1

Education Level

Secondary Education	5
Elementary Secondary Education	3
Elementary Education	1
Grade 7	1
Grade 8	1
Junior High Schools	1
Middle Schools	1

Audience

Location

Canada	2
Australia	1
South Korea	1
Spain	1

Laws, Policies, & Programs

Assessments and Surveys

Program for International…	7
Graduate Record Examinations	1
Test of English as a Foreign…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing all 12 results Save | Export

Comparing the Robustness of Three Nonparametric DIF Procedures to Differential Rapid Guessing

Peer reviewed

Direct link

Abulela, Mohammed A. A.; Rios, Joseph A. – Applied Measurement in Education, 2022

When there are no personal consequences associated with test performance for examinees, rapid guessing (RG) is a concern and can differ between subgroups. To date, the impact of differential RG on item-level measurement invariance has received minimal attention. To that end, a simulation study was conducted to examine the robustness of the…

Descriptors: Comparative Analysis, Robustness (Statistics), Nonparametric Statistics, Item Analysis

Standard Errors for National Trends in International Large-Scale Assessments in the Case of Cross-National Differential Item Functioning

Peer reviewed

Direct link

Sachse, Karoline A.; Haag, Nicole – Applied Measurement in Education, 2017

Standard errors computed according to the operational practices of international large-scale assessment studies such as the Programme for International Student Assessment's (PISA) or the Trends in International Mathematics and Science Study (TIMSS) may be biased when cross-national differential item functioning (DIF) and item parameter drift are…

Descriptors: Error of Measurement, Test Bias, International Assessment, Computation

Analyzing Fairness among Linguistic Minority Populations Using a Latent Class Differential Item Functioning Approach

Peer reviewed

Direct link

Oliveri, Maria Elena; Ercikan, Kadriye; Lyons-Thomas, Juliette; Holtzman, Steven – Applied Measurement in Education, 2016

Differential item functioning (DIF) analyses have been used as the primary method in large-scale assessments to examine fairness for subgroups. Currently, DIF analyses are conducted utilizing manifest methods using observed characteristics (gender and race/ethnicity) for grouping examinees. Homogeneity of item responses is assumed denoting that…

Descriptors: Test Bias, Language Minorities, Effect Size, Foreign Countries

On the Cross-Country Comparability of Indicators of Socioeconomic Resources in PISA

Peer reviewed

Direct link

Pokropek, Artur; Borgonovi, Francesca; McCormick, Carina – Applied Measurement in Education, 2017

Large-scale international assessments rely on indicators of the resources that students report having in their homes to capture the financial capital of their families. The scaling methodology currently used to develop the Programme for International Student Assessment (PISA) background indices is designed to maximize within-country comparability…

Descriptors: Foreign Countries, Achievement Tests, Secondary School Students, International Assessment

Requiring a Consistent Unit of Scale between the Responses of Students and Judges in Standard Setting

Peer reviewed

Direct link

Humphry, Stephen; Heldsinger, Sandra; Andrich, David – Applied Measurement in Education, 2014

One of the best-known methods for setting a benchmark standard on a test is that of Angoff and its modifications. When scored dichotomously, judges estimate the probability that a benchmark student has of answering each item correctly. As in most methods of standard setting, it is assumed implicitly that the unit of the latent scale of the…

Descriptors: Foreign Countries, Standard Setting (Scoring), Judges, Item Response Theory

Assessment of Complex Problem Solving: What We Know and What We Don't Know

Peer reviewed

Direct link

Herde, Christoph Nils; Wüstenberg, Sascha; Greiff, Samuel – Applied Measurement in Education, 2016

Complex Problem Solving (CPS) is seen as a cross-curricular 21st century skill that has attracted interest in large-scale-assessments. In the Programme for International Student Assessment (PISA) 2012, CPS was assessed all over the world to gain information on students' skills to acquire and apply knowledge while dealing with nontransparent…

Descriptors: Problem Solving, Achievement Tests, Foreign Countries, International Assessment

Comparison of Human and Machine Scoring of Essays: Differences by Gender, Ethnicity, and Country

Peer reviewed

Direct link

Bridgeman, Brent; Trapani, Catherine; Attali, Yigal – Applied Measurement in Education, 2012

Essay scores generated by machine and by human raters are generally comparable; that is, they can produce scores with similar means and standard deviations, and machine scores generally correlate as highly with human scores as scores from one human correlate with scores from another human. Although human and machine essay scores are highly related…

Descriptors: Scoring, Essay Tests, College Entrance Examinations, High Stakes Tests

Do Different Approaches to Examining Construct Comparability in Multilanguage Assessments Lead to Similar Conclusions?

Peer reviewed

Direct link

Oliveri, Maria E.; Ercikan, Kadriye – Applied Measurement in Education, 2011

In this study, we examine the degree of construct comparability and possible sources of incomparability of the English and French versions of the Programme for International Student Assessment (PISA) 2003 problem-solving measure administered in Canada. Several approaches were used to examine construct comparability at the test- (examination of…

Descriptors: Foreign Countries, English, French, Tests

How Do Other Countries Measure Up to the Mathematics Achievement Levels on the National Assessment of Educational Progress?

Peer reviewed

Direct link

Hambleton, Ronald K.; Sireci, Stephen G.; Smith, Zachary R. – Applied Measurement in Education, 2009

In this study, we mapped achievement levels from the National Assessment of Educational Progress (NAEP) onto the score scales for selected assessments from the Trends in International Mathematics and Science Study (TIMSS) and the Program for International Student Achievement (PISA). The mapping was conducted on NAEP, TIMSS, and PISA Mathematics…

Descriptors: National Competency Tests, Mathematics Achievement, Mathematics Tests, Comparative Analysis

Measuring Self-Efficacy: Multitrait-Multimethod Comparison of Scaling Procedures.

Peer reviewed

Bong, Mimi; Hocevar, Dennis – Applied Measurement in Education, 2002

Examined convergent and discriminant validity of various self-efficacy measures across two studies, one involving 358 U.S. high school students and another involving 235 Korean female high school students. Across the studies the first-order confirmatory factor analyses provide support for both convergent validity of different self-efficacy…

Descriptors: Comparative Analysis, Foreign Countries, High School Students, High Schools

The Effects of Test Difficulty Manipulation in Computerized Adaptive Testing and Self-Adapted Testing.

Peer reviewed

Ponsoda, Vicente; Olea, Julio; Rodriguez, Maria Soledad; Revuelta, Javier – Applied Measurement in Education, 1999

Compared easy and difficult versions of self-adapted tests (SAT) and computerized adapted tests. No significant differences were found among the tests for estimated ability or posttest state anxiety in studies with 187 Spanish high school students, although other significant differences were found. Discusses implications for interpreting test…

Descriptors: Ability, Adaptive Testing, Comparative Analysis, Computer Assisted Testing

Comparability of Bilingual Versions of Assessments: Sources of Incomparability of English and French Versions of Canada's National Achievement Tests

Peer reviewed

Direct link

Ercikan, Kadriye; Gierl, Mark J.; McCreith, Tanya; Puhan, Gautam; Koh, Kim – Applied Measurement in Education, 2004

This research examined the degree of comparability and sources of incomparability of English and French versions of reading, mathematics, and science tests that were administered as part of a survey of achievement in Canada. The results point to substantial psychometric differences between the 2 language versions. Approximately 18% to 36% of the…

Descriptors: Foreign Countries, Psychometrics, Science Tests, French

Ercikan, Kadriye	3
Abulela, Mohammed A. A.	1
Andrich, David	1
Attali, Yigal	1
Bong, Mimi	1
Borgonovi, Francesca	1
Bridgeman, Brent	1
Gierl, Mark J.	1
Greiff, Samuel	1
Haag, Nicole	1
Hambleton, Ronald K.	1
Heldsinger, Sandra	1
Herde, Christoph Nils	1
Hocevar, Dennis	1
Holtzman, Steven	1
Humphry, Stephen	1
Koh, Kim	1
Lyons-Thomas, Juliette	1
McCormick, Carina	1
McCreith, Tanya	1
Olea, Julio	1
Oliveri, Maria E.	1
Oliveri, Maria Elena	1
Pokropek, Artur	1
Ponsoda, Vicente	1
More ▼