ERIC - Search Results

Publication Date

In 2025	1
Since 2024	1
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	4
Since 2006 (last 20 years)	5

Descriptor

Ability	11
Test Items	11
Test Reliability	11
Test Construction	6
Test Validity	6
Test Format	5
Item Response Theory	4
Psychometrics	4
Difficulty Level	3
High School Students	3
High Schools	3
Multiple Choice Tests	3
Computation	2
Elementary School Students	2
Error of Measurement	2
Intelligence Tests	2
Scoring	2
Test Bias	2
Test Length	2
Test Use	2
Accuracy	1
Adults	1
Aptitude Tests	1
Artificial Intelligence	1
Automation	1
More ▼

Source

Educational and Psychological…	2
Applied Measurement in…	1
ETS Research Report Series	1
Grantee Submission	1
International Journal of…	1
International Journal of…	1
Journal of Educational…	1
Measurement:…	1

Publication Type

Journal Articles	8
Reports - Research	6
Reports - Evaluative	3
Speeches/Meeting Papers	2
Book/Product Reviews	1
Books	1
Guides - Classroom - Learner	1
Guides - Classroom - Teacher	1
Reports - Descriptive	1

Education Level

Elementary Education	2
Early Childhood Education	1
Grade 3	1
Grade 4	1
Intermediate Grades	1
Primary Education	1

Audience

Practitioners	1
Students	1

Location

South Korea

Laws, Policies, & Programs

Assessments and Surveys

Differential Aptitude Test

What Works Clearinghouse Rating

Showing all 11 results Save | Export

Another Look at Yen's Q3: Is 0.2 an Appropriate Cut-Off?

Peer reviewed

Direct link

Kelsey Nason; Christine DeMars – Journal of Educational Measurement, 2025

This study examined the widely used threshold of 0.2 for Yen's Q3, an index for violations of local independence. Specifically, a simulation was conducted to investigate whether Q3 values were related to the magnitude of bias in estimates of reliability, item parameters, and examinee ability. Results showed that Q3 values below the typical cut-off…

Descriptors: Item Response Theory, Statistical Bias, Test Reliability, Test Items

Reliability and Validity of the School Function Assessment for Children with Disabilities in Korea: Applying Rasch Analysis

Peer reviewed

Direct link

Kim, Hun Ju; Lee, Sung Ja; Kam, Kyung-Yoon – International Journal of Disability, Development and Education, 2023

This study verified validity and reliability of the School Function Assessment (SFA) using Rasch analysis in South Korean school-based occupational therapy sites serving children with intellectual disabilities and others. Participants were 103 elementary school children (grades 1 through 6) with disabilities. Rasch analysis revealed several…

Descriptors: Foreign Countries, Test Validity, Test Reliability, Occupational Therapy

Effects of Differential Item Functioning on Examinees' Test Performance and Reliability of Test

Peer reviewed

Direct link

Lee, Yi-Hsuan; Zhang, Jinming – International Journal of Testing, 2017

Simulations were conducted to examine the effect of differential item functioning (DIF) on measurement consequences such as total scores, item response theory (IRT) ability estimates, and test reliability in terms of the ratio of true-score variance to observed-score variance and the standard error of estimation for the IRT ability parameter. The…

Descriptors: Test Bias, Test Reliability, Performance, Scores

Psychometric Report for the Early Fractions Test (Version 2.2) Administered with Third- and Fourth-Grade Students in Spring 2017. Research Report No. 2017-11

Download full text

Schoen, Robert C.; Yang, Xiaotong; Liu, Sicong; Paek, Insu – Grantee Submission, 2017

The Early Fractions Test v2.2 is a paper-pencil test designed to measure mathematics achievement of third- and fourth-grade students in the domain of fractions. The purpose, or intended use, of the Early Fractions Test v2.2 is to serve as a measure of student outcomes in a randomized trial designed to estimate the effect of an educational…

Descriptors: Psychometrics, Mathematics Tests, Mathematics Achievement, Fractions

The Effects of Rater Severity and Rater Distribution on Examinees' Ability Estimation for Constructed-Response Items. Research Report. ETS RR-13-23

Peer reviewed
PDF on ERIC

Download full text

Wang, Zhen; Yao, Lihua – ETS Research Report Series, 2013

The current study used simulated data to investigate the properties of a newly proposed method (Yao's rater model) for modeling rater severity and its distribution under different conditions. Our study examined the effects of rater severity, distributions of rater severity, the difference between item response theory (IRT) models with rater effect…

Descriptors: Test Format, Test Items, Responses, Computation

Reliability of Comparably Written Two-Option Multiple-Choice and True-False Test Items.

Peer reviewed

Hancock, Gregory R.; And Others – Educational and Psychological Measurement, 1993

Two-option multiple-choice vocabulary test items are compared with comparably written true-false test items. Results from a study with 111 high school students suggest that multiple-choice items provide a significantly more reliable measure than the true-false format. (SLD)

Descriptors: Ability, High School Students, High Schools, Objective Tests

The Relationship between the Distribution of Item Difficulties and Test Reliability.

Peer reviewed

Feldt, Leonard S. – Applied Measurement in Education, 1993

The recommendation that the reliability of multiple-choice tests will be enhanced if the distribution of item difficulties is concentrated at approximately 0.50 is reinforced and extended in this article by viewing the 0/1 item scoring as a dichotomization of an underlying normally distributed ability score. (SLD)

Descriptors: Ability, Difficulty Level, Guessing (Tests), Mathematical Models

The Second Century of Ability Testing: Some Predictions and Speculations

Peer reviewed

Direct link

Embretson, Susan E. – Measurement: Interdisciplinary Research and Perspectives, 2004

The last century was marked by dazzling changes in many areas, such as technology and communications. Predictions into the second century of testing are seemingly difficult in such a context. Yet, looking back to the turn of the last century, Kirkpatrick (1900), in his American Psychological Association presidential address, presented fundamental…

Descriptors: Ability, Testing, Futures (of Society), Psychometrics

The Effects of the Number of Options per Item and Student Ability on Test Validity and Reliability.

Peer reviewed

Trevisan, Michael S.; And Others – Educational and Psychological Measurement, 1991

The reliability and validity of multiple-choice tests were computed as a function of the number of options per item and student ability for 435 parochial high school juniors, who were administered the Washington Pre-College Test Battery. Results suggest the efficacy of the three-option item. (SLD)

Descriptors: Ability, Comparative Testing, Distractors (Tests), Grade Point Average

Psychological Testing: Theory and Applications.

Janda, Louis H. – 1998

This text prepares students to quantify observations through psychological testing. Measurement is critical in all the subareas of psychology, and the text begins by discussing the applications of testing in the subdisciplines of psychology. The book also discusses the extent to which tests are actually used. Early chapters discuss general…

Descriptors: Ability, Clinical Psychology, Counseling Psychology, Diagnostic Tests

The Differential Aptitude Test: A Review and Critique.

Download full text

Wang, Lin – 1993

The Differential Aptitude Test (DAT) is a multiple aptitude battery designed to measure junior and senior high school students' and adults' ability to learn or succeed in certain areas. The test is suitable for group administration and is primarily for use in educational and vocational counseling, although it may be used in employee selection. The…

Descriptors: Ability, Adults, Aptitude Tests, Career Counseling

Christine DeMars	1
Embretson, Susan E.	1
Feldt, Leonard S.	1
Hancock, Gregory R.	1
Janda, Louis H.	1
Kam, Kyung-Yoon	1
Kelsey Nason	1
Kim, Hun Ju	1
Lee, Sung Ja	1
Lee, Yi-Hsuan	1
Liu, Sicong	1
Paek, Insu	1
Schoen, Robert C.	1
Trevisan, Michael S.	1
Wang, Lin	1
Wang, Zhen	1
Yang, Xiaotong	1
Yao, Lihua	1
Zhang, Jinming	1
More ▼