Publication Date
In 2025 | 1 |
Since 2024 | 2 |
Since 2021 (last 5 years) | 10 |
Since 2016 (last 10 years) | 27 |
Since 2006 (last 20 years) | 58 |
Descriptor
Comparative Analysis | 75 |
Psychometrics | 75 |
Test Items | 75 |
Foreign Countries | 29 |
Item Response Theory | 25 |
Difficulty Level | 23 |
Scores | 23 |
Test Construction | 18 |
Item Analysis | 16 |
Statistical Analysis | 14 |
Scoring | 13 |
More ▼ |
Source
Author
Publication Type
Education Level
Audience
Researchers | 2 |
Practitioners | 1 |
Students | 1 |
Location
Germany | 4 |
South Korea | 3 |
United States | 3 |
France | 2 |
Iran | 2 |
Nigeria | 2 |
South Africa | 2 |
Spain | 2 |
Africa | 1 |
Australia | 1 |
Canada | 1 |
More ▼ |
Laws, Policies, & Programs
No Child Left Behind Act 2001 | 3 |
Assessments and Surveys
What Works Clearinghouse Rating
Kaja Haugen; Cecilie Hamnes Carlsen; Christine Möller-Omrani – Language Awareness, 2025
This article presents the process of constructing and validating a test of metalinguistic awareness (MLA) for young school children (age 8-10). The test was developed between 2021 and 2023 as part of the MetaLearn research project, financed by The Research Council of Norway. The research team defines MLA as using metalinguistic knowledge at a…
Descriptors: Language Tests, Test Construction, Elementary School Students, Metalinguistics
Kim, Sooyeon; Walker, Michael – ETS Research Report Series, 2021
In this investigation, we used real data to assess potential differential effects associated with taking a test in a test center (TC) versus testing at home using remote proctoring (RP). We used a pseudo-equivalent groups (PEG) approach to examine group equivalence at the item level and the total score level. If our assumption holds that the PEG…
Descriptors: Testing, Distance Education, Comparative Analysis, Test Items
Yoo Jeong Jang – ProQuest LLC, 2022
Despite the increasing demand for diagnostic information, observed subscores have been often reported to lack adequate psychometric qualities such as reliability, distinctiveness, and validity. Therefore, several statistical techniques based on CTT and IRT frameworks have been proposed to improve the quality of subscores. More recently, DCM has…
Descriptors: Classification, Accuracy, Item Response Theory, Correlation
Bacon, Terrence E. – ProQuest LLC, 2023
The purpose of this study was to investigate developmental music aptitude with a broader sample in order to propose national norms. Research questions were: 1) To what extent are published Primary Measures of Music Aptitude (PMMA) norms different from those established using a current sample? 2) Are there comparative differences in PMMA item…
Descriptors: Psychometrics, Music, Aptitude Tests, Test Items
Rosemary Erlam; Lan Wei – Language Teaching Research, 2024
This study is a conceptual replication of Ellis' 'Measuring implicit and explicit knowledge of a second language: A psychometric study', published in "Studies in Second Language Acquisition" (2005), aiming to establish the importance of including belief statements (hypothesized to increase processing demands) in the design of Elicited…
Descriptors: Language Processing, Language Tests, Second Language Learning, Psychometrics
Musa Adekunle Ayanwale – Discover Education, 2023
Examination scores obtained by students from the West African Examinations Council (WAEC), and National Business and Technical Examinations Board (NABTEB) may not be directly comparable due to differences in examination administration, item characteristics of the subject in question, and student abilities. For more accurate comparisons, scores…
Descriptors: Equated Scores, Mathematics Tests, Test Items, Test Format
Deribo, Tobias; Goldhammer, Frank; Kroehne, Ulf – Educational and Psychological Measurement, 2023
As researchers in the social sciences, we are often interested in studying not directly observable constructs through assessments and questionnaires. But even in a well-designed and well-implemented study, rapid-guessing behavior may occur. Under rapid-guessing behavior, a task is skimmed shortly but not read and engaged with in-depth. Hence, a…
Descriptors: Reaction Time, Guessing (Tests), Behavior Patterns, Bias
Aborisade, Olatunbosun James; Fajobi, Olutoyin Olufunke – Educational Research and Reviews, 2020
West Africa Examination Council (WAEC) and National Examination Council (NECO) are the two major examination bodies saddled with the responsibility of awarding Senior Secondary School Certificate in Nigeria. This study examined the comparability of the psychometric properties of the items constructed by the two examination bodies using Item…
Descriptors: Foreign Countries, Mathematics Tests, Psychometrics, Test Items
Lenhard, Wolfgang; Lenhard, Alexandra – Educational and Psychological Measurement, 2021
The interpretation of psychometric test results is usually based on norm scores. We compared semiparametric continuous norming (SPCN) with conventional norming methods by simulating results for test scales with different item numbers and difficulties via an item response theory approach. Subsequently, we modeled the norm scores based on random…
Descriptors: Test Norms, Scores, Regression (Statistics), Test Items
New Meridian Corporation, 2020
New Meridian Corporation has developed the "Quality Testing Standards and Criteria for Comparability Claims" (QTS). The goal of the QTS is to provide guidance to states that are interested in including content from the New Meridian item bank and intend to make comparability claims with "other assessments" that include New…
Descriptors: Testing, Standards, Comparative Analysis, Guidelines
New Meridian Corporation, 2020
New Meridian Corporation has developed the "Quality Testing Standards and Criteria for Comparability Claims" (QTS). The goal of the QTS is to provide guidance to states that are interested in including content from the New Meridian item bank and intend to make comparability claims with "other assessments" that include New…
Descriptors: Testing, Standards, Comparative Analysis, Guidelines
Afsharrad, Mohammad; Pishghadam, Reza; Baghaei, Purya – International Journal of Language Testing, 2023
Testing organizations are faced with increasing demand to provide subscores in addition to the total test score. However, psychometricians argue that most subscores do not have added value to be worth reporting. To have added value, subscores need to meet a number of criteria: they should be reliable, distinctive, and distinct from each other and…
Descriptors: Comparative Analysis, Scores, Value Added Models, Psychometrics
Galeoto, Giovanni; D'Elpidio, Giuliana; Alvaro, Rosaria; Zicari, Anna Maria; Valente, Donatella; Riccio, Marianna – International Association for Development of the Information Society, 2021
The Italian Disciplinary section of Test of Competences (TECO-D) project is an important longitudinal study used to analyze learning outcomes of ungraded students and to measure quality of the educational process. The aim of the present study was to evaluate the psychometric properties of the TECO-D in students enrolled in the Bachelor's Degree in…
Descriptors: Case Studies, Nursing Education, Psychometrics, Longitudinal Studies
Storme, Martin; Myszkowski, Nils; Baron, Simon; Bernard, David – Journal of Intelligence, 2019
Assessing job applicants' general mental ability online poses psychometric challenges due to the necessity of having brief but accurate tests. Recent research (Myszkowski & Storme, 2018) suggests that recovering distractor information through Nested Logit Models (NLM; Suh & Bolt, 2010) increases the reliability of ability estimates in…
Descriptors: Intelligence Tests, Item Response Theory, Comparative Analysis, Test Reliability
Türkoguz, Suat – Anatolian Journal of Education, 2020
This study aimed to investigate the item "Response Time Fidelity scores" ("RTFs"), "KuderRichardson Reliability" ("KR[subscript 20]") and "Cronbach's Alpha Reliability" ("alpha") coefficients, calculate "KR[subscript 20]" coefficients with "RTFs" for 30 threshold…
Descriptors: Comparative Analysis, Reaction Time, Multiple Choice Tests, Scores