NotesFAQContact Us
Collection
Advanced
Search Tips
Audience
Laws, Policies, & Programs
What Works Clearinghouse Rating
Showing all 10 results Save | Export
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Mahmut Sami Yigiter – Journal of Theoretical Educational Science, 2024
One of the main objectives of international large-scale assessments is to make comparisons between different countries, education policies, education systems, or subgroups. One of the main criteria for making comparisons between different groups is to ensure measurement invariance. The purpose of this study was to test the measurement invariance…
Descriptors: Mathematics, Mathematics Skills, Grade 4, Grade 8
Peer reviewed Peer reviewed
Direct linkDirect link
Yasuda, Jun-ichiro; Mae, Naohiro; Hull, Michael M.; Taniguchi, Masa-aki – Physical Review Physics Education Research, 2021
As a method to shorten the test time of the Force Concept Inventory (FCI), we suggest the use of computerized adaptive testing (CAT). CAT is the process of administering a test on a computer, with items (i.e., questions) selected based upon the responses of the examinee to prior items. In so doing, the test length can be significantly shortened.…
Descriptors: Foreign Countries, College Students, Student Evaluation, Computer Assisted Testing
Peer reviewed Peer reviewed
Direct linkDirect link
Takeda, Kazuya; Tanabe, Shigeo; Koyama, Soichiro; Nagai, Tomoko; Sakurai, Hiroaki; Kanada, Yoshikiyo; Shomoto, Koji – Measurement in Physical Education and Exercise Science, 2018
The aim of this study was to clarify the intra- and inter-rater reliability of the rate of force development in hip abductor muscle force measurements using a hand-held dynamometer. Thirty healthy adults were separately assessed by two independent raters on two separate days. Rate of force development was calculated from the slope of the…
Descriptors: Interrater Reliability, Human Body, Measurement Equipment, Handheld Devices
Peer reviewed Peer reviewed
Direct linkDirect link
Someki, Fumio; Ohnishi, Masafumi; Vejdemo-Johansson, Mikael; Nakamura, Kazuhiko – Journal of Psychoeducational Assessment, 2020
To examine reliability, validity, factor structure, and measurement invariance (i.e., configural, metric, and scalar invariance) of the Japanese Conners' Adult attention deficit hyperactivity disorder (ADHD) Rating Scales (CAARS), Japanese nonclinical adults (N = 786) completed the CAARS Self-Report (CAARS-S). Each participant was also rated by…
Descriptors: Attention Deficit Hyperactivity Disorder, Rating Scales, Foreign Countries, Test Reliability
Peer reviewed Peer reviewed
Direct linkDirect link
Sciffer, Michael G.; Perry, Laura B.; McConney, Andrew – British Journal of Sociology of Education, 2020
School socio-economic compositional (SEC) effects have been influential in educational research predicting a range of outcomes and influencing public policy. However, some recent studies have challenged the veracity of SEC effects when applying residualised-change and fixed effects models and simulating potential measurement errors in hierarchical…
Descriptors: School Demography, Socioeconomic Status, Socioeconomic Influences, Context Effect
Peer reviewed Peer reviewed
Direct linkDirect link
Hampf, Franziska; Wiederhold, Simon; Woessmann, Ludger – Large-scale Assessments in Education, 2017
Ample evidence indicates that a person's human capital is important for success on the labor market in terms of both wages and employment prospects. However, unlike the efforts to identify the impact of school attainment on labor-market outcomes, the literature on returns to cognitive skills has not yet provided convincing evidence that the…
Descriptors: Outcomes of Education, Human Capital, Labor Market, Income
Peer reviewed Peer reviewed
Direct linkDirect link
Holster, Trevor A.; Lake, J. – Language Assessment Quarterly, 2016
Stewart questioned Beglar's use of Rasch analysis of the Vocabulary Size Test (VST) and advocated the use of 3-parameter logistic item response theory (3PLIRT) on the basis that it models a non-zero lower asymptote for items, often called a "guessing" parameter. In support of this theory, Stewart presented fit statistics derived from…
Descriptors: Guessing (Tests), Item Response Theory, Vocabulary, Language Tests
Peer reviewed Peer reviewed
Direct linkDirect link
Suzuki, Yuichi – Language Testing, 2015
Self-assessment has been used to assess second language proficiency; however, as sources of measurement errors vary, they may threaten the validity and reliability of the tools. The present paper investigated the role of experiences in using Japanese as a second language in the naturalistic acquisition context on the accuracy of the…
Descriptors: Self Evaluation (Individuals), Error of Measurement, Japanese, Second Language Learning
Peer reviewed Peer reviewed
Direct linkDirect link
McLean, Stuart; Kramer, Brandon; Beglar, David – Language Teaching Research, 2015
An important gap in the field of second language vocabulary assessment concerns the lack of validated tests measuring aural vocabulary knowledge. The primary purpose of this study is to introduce and provide preliminary validity evidence for the Listening Vocabulary Levels Test (LVLT), which has been designed as a diagnostic tool to measure…
Descriptors: Test Construction, Test Validity, English (Second Language), Second Language Learning
Peer reviewed Peer reviewed
Direct linkDirect link
Kwon, Hyungil Harry; Pyun, Do Young; Han, Siwan; Ogasawara, Etsuko – Asia Pacific Journal of Education, 2011
The objective of this study was to provide empirical evidence to support psychometric properties of a modified four-dimensional model of the Leadership Scale for Sports (LSS). The study tested invariance of all parameters (i.e., factor loadings, error variances, and factor variances-covariances) in the four-dimensional measurement model between…
Descriptors: Feedback (Response), Testing, Athletes, Factor Structure