Publication Date
In 2025 | 2 |
Since 2024 | 3 |
Since 2021 (last 5 years) | 9 |
Since 2016 (last 10 years) | 16 |
Since 2006 (last 20 years) | 22 |
Descriptor
Source
Author
AlGhamdi, Hannan M. | 1 |
Ayan, Cansu | 1 |
Ayán-Pérez, Cárlos | 1 |
Bolsinova, Maria | 1 |
Bouzas-Rico, Sara | 1 |
Bowden, Stephen C. | 1 |
Braadbaart, Lieke | 1 |
Casey, Jackie M. | 1 |
Chengyu Cui | 1 |
Chun Wang | 1 |
Cikrikci, Nukhet | 1 |
More ▼ |
Publication Type
Journal Articles | 20 |
Reports - Research | 18 |
Reports - Evaluative | 3 |
Reports - Descriptive | 1 |
Speeches/Meeting Papers | 1 |
Tests/Questionnaires | 1 |
Education Level
Secondary Education | 5 |
Elementary Education | 4 |
Higher Education | 4 |
Postsecondary Education | 4 |
Elementary Secondary Education | 3 |
Audience
Location
Australia | 3 |
Japan | 2 |
Spain | 2 |
Turkey | 2 |
Chile | 1 |
China | 1 |
European Union | 1 |
Germany | 1 |
Ireland | 1 |
Netherlands | 1 |
Netherlands (Amsterdam) | 1 |
More ▼ |
Laws, Policies, & Programs
Assessments and Surveys
Program for International… | 3 |
Trends in International… | 2 |
Big Five Inventory | 1 |
Cognitive Abilities Test | 1 |
Wechsler Memory Scale | 1 |
What Works Clearinghouse Rating
Esra Sözer Boz – Education and Information Technologies, 2025
International large-scale assessments provide cross-national data on students' cognitive and non-cognitive characteristics. A critical methodological issue that often arises in comparing data from cross-national studies is ensuring measurement invariance, indicating that the construct under investigation is the same across the compared groups.…
Descriptors: Achievement Tests, International Assessment, Foreign Countries, Secondary School Students
Nwosu, Kingsley Chinaza; Wahl, W. P.; Hickman, Gregory P.; Ede, Moses Onyemaechi; Nwikpo, Mary Nneka – International Journal of Educational Methodology, 2023
Researchers have recognized the need for updates of test anxiety scales for more measurement accuracy. However, studies that investigated the measurement invariance of the Test Anxiety Inventory (TAI), and identified the latent profiles remain scare not withstanding its wide usage in Nigeria. This might have an impact on how generalizability and…
Descriptors: Test Anxiety, Error of Measurement, Profiles, Measures (Individuals)
How Smart Is My Child? The Judgment Accuracy of Parents Regarding Their Children's Cognitive Ability
Elena Mack; Vsevolod Scherrer; Franzis Preckel – Child Development, 2025
Parents' judgment of their children's cognitive ability is important for providing adequate learning environments. This study examined parents' judgment accuracy with 2346 children (M = 8.94 years; 48.3% girls) and their parents (1283 mothers, 426 fathers, and 637 parental pairs). The data were collected between September 2012 and February 2014 in…
Descriptors: Foreign Countries, Cognitive Ability, Elementary School Students, Parent Attitudes
Martí, Mónica; Ródenas, Carmen – International Journal of Social Research Methodology, 2021
This paper analyses the reliability and accuracy of the relationships between migration and employment status when estimated using a linked data set. The analysis will be carried out using a new source, the "Labour and Geographical Mobility Statistics," which is provided by the Spanish Statistical Office. This statistic is constructed by…
Descriptors: Foreign Countries, Error of Measurement, Occupational Mobility, Migration
Chengyu Cui; Chun Wang; Gongjun Xu – Grantee Submission, 2024
Multidimensional item response theory (MIRT) models have generated increasing interest in the psychometrics literature. Efficient approaches for estimating MIRT models with dichotomous responses have been developed, but constructing an equally efficient and robust algorithm for polytomous models has received limited attention. To address this gap,…
Descriptors: Item Response Theory, Accuracy, Simulation, Psychometrics
Lions, Séverin; Dartnell, Pablo; Toledo, Gabriela; Godoy, María Inés; Córdova, Nora; Jiménez, Daniela; Lemarié, Julie – Educational and Psychological Measurement, 2023
Even though the impact of the position of response options on answers to multiple-choice items has been investigated for decades, it remains debated. Research on this topic is inconclusive, perhaps because too few studies have obtained experimental data from large-sized samples in a real-world context and have manipulated the position of both…
Descriptors: Multiple Choice Tests, Test Items, Item Analysis, Responses
Tsaousis, Ioannis; Sideridis, Georgios D.; AlGhamdi, Hannan M. – Journal of Psychoeducational Assessment, 2021
This study evaluated the psychometric quality of a computerized adaptive testing (CAT) version of the general cognitive ability test (GCAT), using a simulation study protocol put forth by Han, K. T. (2018a). For the needs of the analysis, three different sets of items were generated, providing an item pool of 165 items. Before evaluating the…
Descriptors: Computer Assisted Testing, Adaptive Testing, Cognitive Tests, Cognitive Ability
Kilic, Abdullah Faruk; Dogan, Nuri – International Journal of Assessment Tools in Education, 2021
Weighted least squares (WLS), weighted least squares mean-and-variance-adjusted (WLSMV), unweighted least squares mean-and-variance-adjusted (ULSMV), maximum likelihood (ML), robust maximum likelihood (MLR) and Bayesian estimation methods were compared in mixed item response type data via Monte Carlo simulation. The percentage of polytomous items,…
Descriptors: Factor Analysis, Computation, Least Squares Statistics, Maximum Likelihood Statistics
Tijmstra, Jesper; Bolsinova, Maria; Liaw, Yuan-Ling; Rutkowski, Leslie; Rutkowski, David – Journal of Educational Measurement, 2020
Although the root-mean squared deviation (RMSD) is a popular statistical measure for evaluating country-specific item-level misfit (i.e., differential item functioning [DIF]) in international large-scale assessment, this paper shows that its sensitivity to detect misfit may depend strongly on the proficiency distribution of the considered…
Descriptors: Test Items, Goodness of Fit, Probability, Accuracy
Cikrikci, Nukhet; Yalcin, Seher; Kalender, Ilker; Gul, Emrah; Ayan, Cansu; Uyumaz, Gizem; Sahin-Kursad, Merve; Kamis, Omer – International Journal of Assessment Tools in Education, 2020
This study tested the applicability of the theoretical Examination for Candidates of Driving License (ECODL) in Turkey as a computerized adaptive test (CAT). Firstly, various simulation conditions were tested for the live CAT through an item response theory-based calibrated item bank. The application of the simulated CAT was based on data from…
Descriptors: Motor Vehicles, Traffic Safety, Computer Assisted Testing, Item Response Theory
Sachse, Karoline A.; Haag, Nicole – Applied Measurement in Education, 2017
Standard errors computed according to the operational practices of international large-scale assessment studies such as the Programme for International Student Assessment's (PISA) or the Trends in International Mathematics and Science Study (TIMSS) may be biased when cross-national differential item functioning (DIF) and item parameter drift are…
Descriptors: Error of Measurement, Test Bias, International Assessment, Computation
Yasuda, Jun-ichiro; Mae, Naohiro; Hull, Michael M.; Taniguchi, Masa-aki – Physical Review Physics Education Research, 2021
As a method to shorten the test time of the Force Concept Inventory (FCI), we suggest the use of computerized adaptive testing (CAT). CAT is the process of administering a test on a computer, with items (i.e., questions) selected based upon the responses of the examinee to prior items. In so doing, the test length can be significantly shortened.…
Descriptors: Foreign Countries, College Students, Student Evaluation, Computer Assisted Testing
Marti´nez-Lemos, R. I.; Ayán-Pérez, Cárlos; Bouzas-Rico, Sara – International Journal of Developmental Disabilities, 2019
Objectives: The main objective was to identify the test-retest reliability of the Wii Balance Board (WBB) for assessing standing balance when administered to a population of people with intellectual disability (ID). A secondary objective was to provide information regarding the reliability of the WBB, taking into account the severity of cognitive…
Descriptors: Test Reliability, Human Posture, Psychomotor Skills, Mild Intellectual Disability
van Kernebeek, Willem G.; de Schipper, Antoine W.; Savelsbergh, Geert J. P.; Toussaint, Huub M. – Measurement in Physical Education and Exercise Science, 2018
In The Netherlands, the 4-Skills Scan is an instrument for physical education teachers to assess gross motor skills of elementary school children. Little is known about its reliability. Therefore, in this study the test-retest and inter-rater reliability was determined. Respectively, 624 and 557 Dutch 6- to 12-year-old children were analyzed for…
Descriptors: Foreign Countries, Interrater Reliability, Pretests Posttests, Psychomotor Skills
Klausch, Thomas; Schouten, Barry; Hox, Joop J. – Sociological Methods & Research, 2017
This study evaluated three types of bias--total, measurement, and selection bias (SB)--in three sequential mixed-mode designs of the Dutch Crime Victimization Survey: telephone, mail, and web, where nonrespondents were followed up face-to-face (F2F). In the absence of true scores, all biases were estimated as mode effects against two different…
Descriptors: Evaluation Methods, Statistical Bias, Sequential Approach, Benchmarking
Previous Page | Next Page »
Pages: 1 | 2