NotesFAQContact Us
Collection
Advanced
Search Tips
Location
Europe1
Turkey1
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
Showing 1 to 15 of 17 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Liou, Gloria; Bonner, Cavan V.; Tay, Louis – International Journal of Testing, 2022
With the advent of big data and advances in technology, psychological assessments have become increasingly sophisticated and complex. Nevertheless, traditional psychometric issues concerning the validity, reliability, and measurement bias of such assessments remain fundamental in determining whether score inferences of human attributes are…
Descriptors: Psychometrics, Computer Assisted Testing, Adaptive Testing, Data
Peer reviewed Peer reviewed
Direct linkDirect link
D. Steger; S. Weiss; O. Wilhelm – Creativity Research Journal, 2023
Creativity can be measured with a variety of methods including self-reports, others reports, and ability tests. While typical self-reports are best understood as weak proxies of creativity, biographical reports that assess previous creative activities seem more promising. Drawbacks of such measures -- including skewed item distributions, a lack of…
Descriptors: Creativity, Creativity Tests, Test Construction, Algorithms
Peer reviewed Peer reviewed
Direct linkDirect link
Wim J. van der Linden; Luping Niu; Seung W. Choi – Journal of Educational and Behavioral Statistics, 2024
A test battery with two different levels of adaptation is presented: a within-subtest level for the selection of the items in the subtests and a between-subtest level to move from one subtest to the next. The battery runs on a two-level model consisting of a regular response model for each of the subtests extended with a second level for the joint…
Descriptors: Adaptive Testing, Test Construction, Test Format, Test Reliability
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Kiyici, Gülbin; Kahraman, Nurcan – Science Insights Education Frontiers, 2022
This study aims to analyze the reliability generalization of the computational thinking scale. There are five dimensions of computational thinking: creativity, algorithmic thinking, cooperativity, critical thinking, and problem-solving. A Bonett transformation was used to standardize the reliability coefficient of Cronbach's alpha. A…
Descriptors: Meta Analysis, Generalization, Computation, Thinking Skills
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Emre Zengin; Yasemin Karal – International Journal of Assessment Tools in Education, 2024
This study was carried out to develop a test to assess algorithmic thinking skills. To this end, the twelve steps suggested by Downing (2006) were adopted. Throughout the test development, 24 middle school sixth-grade students and eight experts in different areas took part as needed in the tasks on the project. The test was given to 252 students…
Descriptors: Grade 6, Algorithms, Thinking Skills, Evaluation Methods
Peer reviewed Peer reviewed
Direct linkDirect link
Wang, Yu; Chiu, Chia-Yi; Köhn, Hans Friedrich – Journal of Educational and Behavioral Statistics, 2023
The multiple-choice (MC) item format has been widely used in educational assessments across diverse content domains. MC items purportedly allow for collecting richer diagnostic information. The effectiveness and economy of administering MC items may have further contributed to their popularity not just in educational assessment. The MC item format…
Descriptors: Multiple Choice Tests, Nonparametric Statistics, Test Format, Educational Assessment
Peer reviewed Peer reviewed
Direct linkDirect link
Silvia Wen-Yu Lee; Jyh-Chong Liang; Chung-Yuan Hsu; Meng-Jung Tsai – Interactive Learning Environments, 2024
While research has shown that students' epistemic beliefs can be a strong predictor of their academic performance, cognitive abilities, or self-efficacy, studies of this topic in computer education are rare. The purpose of this study was twofold. First, it aimed to validate a newly developed questionnaire for measuring students' epistemic beliefs…
Descriptors: Student Attitudes, Beliefs, Computer Science Education, Programming
Peer reviewed Peer reviewed
PDF on ERIC Download full text
Heng Lu – PASAA: Journal of Language Teaching and Learning in Thailand, 2023
The test view is on the Duolingo English Test (DET), an alternative online English proficiency test with a machine-driven characteristic. The review covers essential information of the DET such as test purpose, usage, score-mapping with CEFR scale, price, and publisher. Meanwhile, the test usefulness is discussed with focuses on reliability,…
Descriptors: Computer Software, Computer Assisted Instruction, Second Language Learning, Second Language Instruction
Peer reviewed Peer reviewed
ten Berge, Jos M. F.; And Others – Psychometrika, 1981
Several algorithms for computing the greatest lower bound to reliability or the constrained minimum-trace communality solution in factor analysis have been developed. The convergence properties of these methods are examined. A uniqueness proof for the desired solution is offered. (Author/JKS)
Descriptors: Algorithms, Factor Analysis, Test Reliability
Peer reviewed Peer reviewed
van der Linden, Wim J.; Boekkooi-Timminga, Ellen – Applied Psychological Measurement, 1988
Gulliksen's matched random subtests method is a graphical method to split a test into parallel test halves, allowing maximization of coefficient alpha as a lower bound to the classical test reliability coefficient. This problem is formulated as a zero-one programing problem solvable by algorithms that already exist. (TJH)
Descriptors: Algorithms, Equations (Mathematics), Programing, Test Reliability
Peer reviewed Peer reviewed
Shapiro, Alexander – Psychometrika, 1982
Minimum trace factor analysis has been used to find the greatest lower bound to reliability. This technique, however, fails to be scale free. A solution to the scale problem is proposed through the maximization of the greatest lower bound as the function of weights. (Author/JKS)
Descriptors: Algorithms, Estimation (Mathematics), Factor Analysis, Psychometrics
Peer reviewed Peer reviewed
Zimmerman, Donald W.; Williams, Richard H. – Psychometrika, 1982
Formulas for the standard error of measurement of three measures of change (simple differences; residualized difference scores; and a measure introduced by Tucker, Damarin, and Messick) are derived. A practical guide for determining the relative error of the three measures is developed. (Author/JKS)
Descriptors: Achievement Gains, Algorithms, Differences, Error of Measurement
Peer reviewed Peer reviewed
Wackerly, D. D.; Robinson, D. H. – Psychometrika, 1983
A statistical method for testing the agreement between a judge's assessment of an object or subject and a known standard is developed and shown to be superior to two other methods which appear in the literature. (Author/JKS)
Descriptors: Algorithms, Computer Programs, Judges, Measurement Techniques
Peer reviewed Peer reviewed
Tatsuoka, Kikumi K.; Tatsuoka, Maurice M. – Journal of Educational Measurement, 1983
This study introduces the individual consistency index (ICI), which measures the extent to which patterns of responses to parallel sets of items remain consistent over time. ICI is used as an error diagnostic tool to detect aberrant response patterns resulting from the consistent application of erroneous rules of operation. (Author/PN)
Descriptors: Achievement Tests, Algorithms, Error Patterns, Measurement Techniques
Peer reviewed Peer reviewed
Armstrong, Ronald D.; And Others – Journal of Educational Statistics, 1994
A network-flow model is formulated for constructing parallel tests based on classical test theory while using test reliability as the criterion. Practitioners can specify a test-difficulty distribution for values of item difficulties as well as test-composition requirements. An empirical study illustrates the reliability of generated tests. (SLD)
Descriptors: Algorithms, Computer Assisted Testing, Difficulty Level, Item Banks
Previous Page | Next Page »
Pages: 1  |  2