ERIC - Search Results

Publication Date

In 2025	0
Since 2024	0
Since 2021 (last 5 years)	2
Since 2016 (last 10 years)	2
Since 2006 (last 20 years)	4

Descriptor

Computer Software	8
Item Analysis	8
Test Validity	8
Test Items	6
Test Reliability	3
Achievement Tests	2
Difficulty Level	2
Evaluation Methods	2
Foreign Countries	2
High Stakes Tests	2
Item Response Theory	2
Psychometrics	2
Science Tests	2
Scoring	2
Statistical Analysis	2
Test Construction	2
Accuracy	1
Adults	1
African Americans	1
Age Differences	1
Artificial Intelligence	1
Certification	1
Cheating	1
Classification	1
Comparative Analysis	1
More ▼

Source

Educational and Psychological…	2
Applied Psychological…	1
International Journal of…	1
Journal of Education and…	1
Language Assessment Quarterly	1
Online Submission	1

Publication Type

Journal Articles	6
Reports - Research	6
Speeches/Meeting Papers	2
Information Analyses	1
Numerical/Quantitative Data	1
Reports - Evaluative	1

Education Level

Adult Education	1
Elementary Secondary Education	1
Secondary Education	1

Audience

Location

Nigeria

Laws, Policies, & Programs

Assessments and Surveys

International English…	1
Peabody Picture Vocabulary…	1
Test of English as a Foreign…	1
Trends in International…	1

What Works Clearinghouse Rating

Showing all 8 results Save | Export

A Cognitive Diagnostic Assessment Study of the Reading Comprehension Section of the Preliminary English Test (PET)

Peer reviewed
PDF on ERIC

Download full text

Mohammed, Aisha; Dawood, Abdul Kareem Shareef; Alghazali, Tawfeeq; Kadhim, Qasim Khlaif; Sabti, Ahmed Abdulateef; Sabit, Shaker Holh – International Journal of Language Testing, 2023

Cognitive diagnostic models (CDMs) have received much interest within the field of language testing over the last decade due to their great potential to provide diagnostic feedback to all stakeholders and ultimately improve language teaching and learning. A large number of studies have demonstrated the application of CDMs on advanced large-scale…

Descriptors: Reading Comprehension, Reading Tests, Language Tests, English (Second Language)

Scoring Graphical Responses in TIMSS 2019 Using Artificial Neural Networks

Peer reviewed

Direct link

von Davier, Matthias; Tyack, Lillian; Khorramdel, Lale – Educational and Psychological Measurement, 2023

Automated scoring of free drawings or images as responses has yet to be used in large-scale assessments of student achievement. In this study, we propose artificial neural networks to classify these types of graphical responses from a TIMSS 2019 item. We are comparing classification accuracy of convolutional and feed-forward approaches. Our…

Descriptors: Scoring, Networks, Artificial Intelligence, Elementary Secondary Education

Development and Validation of Scientific Literacy Achievement Test to Assess Senior Secondary School Students' Literacy Acquisition in Physics

Peer reviewed
PDF on ERIC

Download full text

Adeleke, A. A.; Joshua, E. O. – Journal of Education and Practice, 2015

Physics literacy plays a crucial part in global technological development as several aspects of science and technology apply concepts and principles of physics in their operations. However, the acquisition of scientific literacy in physics in our society today is not encouraging enough to the desirable standard. Therefore, this study focuses on…

Descriptors: Physics, Secondary School Students, Scientific Literacy, Foreign Countries

Construct Validity and Measurement Invariance of the Peabody Picture Vocabulary Test-III Form A

Peer reviewed

Direct link

Pae, Hye K.; Greenberg, Daphne; Morris, Robin D. – Language Assessment Quarterly, 2012

The aim of this study was to apply the Rasch model to an analysis of the psychometric properties of the Peabody Picture Vocabulary Test--III Form A (PPVT--IIIA) items with struggling adult readers. The PPVT--IIIA was administered to 229 African American adults whose isolated word reading skills were between third and fifth grades. Conformity of…

Descriptors: African Americans, Test Items, Construct Validity, Test Validity

Three Coefficients for Analyzing the Reliability and Validity of Ratings.

Peer reviewed

Aiken, Lewis R. – Educational and Psychological Measurement, 1985

Three numerical coefficients for analyzing the validity and reliability of ratings are described. Each coefficient is computed as the ratio of an obtained to a maximum sum of differences in ratings. The coefficients are also applicable to the item analysis, agreement analysis, and cluster or factor analysis of rating-scale data. (Author/BW)

Descriptors: Computer Software, Data Analysis, Factor Analysis, Item Analysis

An Evaluation of "Polyweighting" in Domain-Referenced Testing.

Sympson, J. Bradford; Haladyna, Thomas M. – 1988

A new approach to polychotomous scoring of test items, similar to "max-alpha" scaling (MAS) and known as polyweighting, has been developed. Unlike MAS, this new method of polychotomous scoring provides scoring weights for a given item that are independent of the difficulty of other items in the analysis. Moreover, the scoring weights are…

Descriptors: Computer Software, Difficulty Level, Item Analysis, Latent Trait Theory

DIFAS: Differential Item Functioning Analysis System. Computer Program Exchange

Peer reviewed

Direct link

Penfield, Randall D. – Applied Psychological Measurement, 2005

Differential item functioning (DIF) is an important consideration in assessing the validity of test scores (Camilli & Shepard, 1994). A variety of statistical procedures have been developed to assess DIF in tests of dichotomous (Hills, 1989; Millsap & Everson, 1993) and polytomous (Penfield & Lam, 2000; Potenza & Dorans, 1995) items. Some of these…

Descriptors: Test Bias, Item Analysis, Psychological Studies, Evaluation Methods

An Analysis of Item Exposure and Item Parameter Drift on a Take-Home Recertification Exam

Download full text

Giordano, Carolyn; Subhiyah, Raja; Hess, Brian – Online Submission, 2005

There are few certifying or recertifying examinations in the medical field that are given in a take-home format. This stems from a concern that examinees may discuss items with peers, or save copies of items on the exam and then pass them on to others. This study examined if item exposure on take-home examinations influences the difficulty of the…

Descriptors: Computer Software, Test Items, Certification, Licensing Examinations (Professions)

Adeleke, A. A.	1
Aiken, Lewis R.	1
Alghazali, Tawfeeq	1
Dawood, Abdul Kareem Shareef	1
Giordano, Carolyn	1
Greenberg, Daphne	1
Haladyna, Thomas M.	1
Hess, Brian	1
Joshua, E. O.	1
Kadhim, Qasim Khlaif	1
Khorramdel, Lale	1
Mohammed, Aisha	1
Morris, Robin D.	1
Pae, Hye K.	1
Penfield, Randall D.	1
Sabit, Shaker Holh	1
Sabti, Ahmed Abdulateef	1
Subhiyah, Raja	1
Sympson, J. Bradford	1
Tyack, Lillian	1
von Davier, Matthias	1
More ▼