Publication Date
| In 2026 | 0 |
| Since 2025 | 220 |
| Since 2022 (last 5 years) | 1089 |
| Since 2017 (last 10 years) | 2599 |
| Since 2007 (last 20 years) | 4960 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 653 |
| Teachers | 563 |
| Researchers | 250 |
| Students | 201 |
| Administrators | 81 |
| Policymakers | 22 |
| Parents | 17 |
| Counselors | 8 |
| Community | 7 |
| Support Staff | 3 |
| Media Staff | 1 |
| More ▼ | |
Location
| Turkey | 226 |
| Canada | 223 |
| Australia | 155 |
| Germany | 116 |
| United States | 99 |
| China | 90 |
| Florida | 86 |
| Indonesia | 82 |
| Taiwan | 78 |
| United Kingdom | 73 |
| California | 66 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 4 |
| Meets WWC Standards with or without Reservations | 4 |
| Does not meet standards | 1 |
Adedokun, Omolola A.; Burgess, Wilella D. – Journal of MultiDisciplinary Evaluation, 2012
Background: Although McNemar Test is the most appropriate tool for analyzing pre-post differences in dichotomous items (e.g., "yes" or "no", "correct" or "incorrect", etc.), many scholars have noted the inappropriate use of Pearson's Chi-square Test by researchers, including social scientists and evaluators,…
Descriptors: Statistical Analysis, Test Items, Pretests Posttests, Hypothesis Testing
Doebler, Anna – Applied Psychological Measurement, 2012
It is shown that deviations of estimated from true values of item difficulty parameters, caused for example by item calibration errors, the neglect of randomness of item difficulty parameters, testlet effects, or rule-based item generation, can lead to systematic bias in point estimation of person parameters in the context of adaptive testing.…
Descriptors: Adaptive Testing, Computer Assisted Testing, Computation, Item Response Theory
Kim, Eun Sook; Yoon, Myeongsun; Lee, Taehun – Educational and Psychological Measurement, 2012
Multiple-indicators multiple-causes (MIMIC) modeling is often used to test a latent group mean difference while assuming the equivalence of factor loadings and intercepts over groups. However, this study demonstrated that MIMIC was insensitive to the presence of factor loading noninvariance, which implies that factor loading invariance should be…
Descriptors: Test Items, Simulation, Testing, Statistical Analysis
Andrich, David; Marais, Ida; Humphry, Stephen – Journal of Educational and Behavioral Statistics, 2012
Andersen (1995, 2002) proves a theorem relating variances of parameter estimates from samples and subsamples and shows its use as an adjunct to standard statistical analyses. The authors show an application where the theorem is central to the hypothesis tested, namely, whether random guessing to multiple choice items affects their estimates in the…
Descriptors: Test Items, Item Response Theory, Multiple Choice Tests, Guessing (Tests)
Lee, HwaYoung; Dodd, Barbara G. – Educational and Psychological Measurement, 2012
This study investigated item exposure control procedures under various combinations of item pool characteristics and ability distributions in computerized adaptive testing based on the partial credit model. Three variables were manipulated: item pool characteristics (120 items for each of easy, medium, and hard item pools), two ability…
Descriptors: Adaptive Testing, Computer Assisted Testing, Item Banks, Ability
Polak, Marike; De Rooij, Mark; Heiser, Willem J. – Multivariate Behavioral Research, 2012
In this article we propose a model-free diagnostic for single-peakedness (unimodality) of item responses. Presuming a unidimensional unfolding scale and a given item ordering, we approximate item response functions of all items based on ordered conditional means (OCM). The proposed OCM methodology is based on Thurstone & Chave's (1929) "criterion…
Descriptors: Item Response Theory, Measures (Individuals), Test Items, Item Analysis
Birnholz, Justin L.; Young, Michael A. – Assessment, 2012
This study assessed whether the Center for Epidemiological Studies Depression Scale (CES-D) functions equivalently in assessing depressive symptom severity in lesbian, bisexual, and heterosexual women. Using differential item functioning methods, the authors examined (a) whether there is a bias in CES-D total scores and in individual item scores…
Descriptors: Test Bias, Measures (Individuals), Depression (Psychology), Severity (of Disability)
Marszalek, Jacob M.; Hamilton, Jessica L. – Measurement and Evaluation in Counseling and Development, 2012
Four maltreatment items were examined from Wave III (N = 13,516) of the National Longitudinal Study of Adolescent Health. Item analysis, confirmatory factor analysis, cross-validation, reliability estimates, and convergent validity coefficients strongly supported the validity of using the four items as a unidimensional composite. Implications for…
Descriptors: Measures (Individuals), Test Items, Validity, Item Analysis
Bonner, Sarah M.; D'Agostino, Jerome V. – Applied Measurement in Education, 2012
We investigated examinees' cognitive processes while they solved selected items from the Multistate Bar Exam (MBE), a high-stakes professional certification examination. We focused on ascertaining those mental processes most frequently used by examinees, and the most common types of errors in their thinking. We compared the relationships between…
Descriptors: Cognitive Processes, Test Items, Problem Solving, Thinking Skills
Novick, Laura R.; Catley, Kefyn M. – International Journal of Science Education, 2012
In a recent article, Nadelson and Southerland (2010. Development and preliminary evaluation of the Measure of Understanding of Macroevolution: Introducing the MUM. "The Journal of Experimental Education", 78, 151-190) reported on their development of a multiple-choice concept inventory intended to assess college students' understanding…
Descriptors: Evidence, Validity, Science Education, College Students
Hong, Huang-Yao; Chiu, Chieh-Hsin – Australasian Journal of Educational Technology, 2016
This study explored how students viewed the role of ideas for knowledge work and how such a view was related to their inquiry activities. Data mainly came from students' online interaction logs, group discussion and inquiry, and a survey concerning the role of ideas for knowledge work. The findings suggest that knowledge building was conducive to…
Descriptors: Technology Uses in Education, Electronic Learning, Educational Technology, Interaction
Zhang, Xijuan; Savalei, Victoria – Educational and Psychological Measurement, 2016
Many psychological scales written in the Likert format include reverse worded (RW) items in order to control acquiescence bias. However, studies have shown that RW items often contaminate the factor structure of the scale by creating one or more method factors. The present study examines an alternative scale format, called the Expanded format,…
Descriptors: Factor Structure, Psychological Testing, Alternative Assessment, Test Items
Ebadi, Saman; Saeedian, Abdulbaset – Teaching English with Technology, 2016
Dynamic Assessment (DA) is a postmodern notion in testing which sees instruction and assessment as inextricably mingled contending that learners will progress if provided with dynamic interactions. The main purpose of the study is to see if the scores generated by the computerized dynamic assessment (C-DA) which is grounded in Vygotsky's…
Descriptors: Instructional Design, Second Language Learning, Second Language Instruction, Postmodernism
Bichi, Ado Abdu; Hafiz, Hadiza; Bello, Samira Abdullahi – International Journal of Evaluation and Research in Education, 2016
High-stakes testing is used for the purposes of providing results that have important consequences. Validity is the cornerstone upon which all measurement systems are built. This study applied the Item Response Theory principles to analyse Northwest University Kano Post-UTME Economics test items. The developed fifty (50) economics test items was…
Descriptors: Item Response Theory, Test Items, Difficulty Level, Statistical Analysis
Boyte, Kenneth J. – TESOL International Journal, 2016
As part of an international effort to develop theory and best practices for teaching languages, the U.S. military has, since the American Revolution, been a leading supporter of literacy education to improve the job performance of soldiers. One important aspect of literacy education today--which continues to be a priority for government agencies,…
Descriptors: Second Language Learning, Second Language Instruction, Recall (Psychology), Teaching Methods

Peer reviewed
Direct link
