Publication Date
| In 2026 | 0 |
| Since 2025 | 197 |
| Since 2022 (last 5 years) | 1067 |
| Since 2017 (last 10 years) | 2577 |
| Since 2007 (last 20 years) | 4938 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 653 |
| Teachers | 563 |
| Researchers | 250 |
| Students | 201 |
| Administrators | 81 |
| Policymakers | 22 |
| Parents | 17 |
| Counselors | 8 |
| Community | 7 |
| Support Staff | 3 |
| Media Staff | 1 |
| More ▼ | |
Location
| Turkey | 225 |
| Canada | 223 |
| Australia | 155 |
| Germany | 116 |
| United States | 99 |
| China | 90 |
| Florida | 86 |
| Indonesia | 82 |
| Taiwan | 78 |
| United Kingdom | 73 |
| California | 65 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 4 |
| Meets WWC Standards with or without Reservations | 4 |
| Does not meet standards | 1 |
Briggs, Derek C. – Measurement: Interdisciplinary Research and Perspectives, 2010
The use of large-scale assessments for making high stakes inferences about students and the schools in which they are situated is premised on the assumption that tests are sensitive to good instruction. An increase in the quality of classroom instruction should cause, on the average, an increase in test scores. In work with a number of colleagues…
Descriptors: Measurement, High Stakes Tests, Inferences, Scores
Raker, Jeffrey R.; Towns, Marcy H. – Chemistry Education Research and Practice, 2010
Investigations of the problem types used in college-level general chemistry examinations have been reported in this Journal and were first reported in the "Journal of Chemical Education" in 1924. This study extends the findings from general chemistry to the problems of four college-level organic chemistry courses. Three problem…
Descriptors: Benchmarking, Organic Chemistry, Science Instruction, College Science
Ip, Edward H. – Applied Psychological Measurement, 2010
The testlet response model is designed for handling items that are clustered, such as those embedded within the same reading passage. Although the testlet is a powerful tool for handling item clusters in educational and psychological testing, the interpretations of its item parameters, the conditional correlation between item pairs, and the…
Descriptors: Item Response Theory, Models, Test Items, Correlation
Kim, Sooyeon; Livingston, Samuel A. – Journal of Educational Measurement, 2010
Score equating based on small samples of examinees is often inaccurate for the examinee populations. We conducted a series of resampling studies to investigate the accuracy of five methods of equating in a common-item design. The methods were chained equipercentile equating of smoothed distributions, chained linear equating, chained mean equating,…
Descriptors: Equated Scores, Test Items, Item Sampling, Item Response Theory
Revuelta, Javier – Psychometrika, 2010
A comprehensive analysis of difficulty for multiple-choice items requires information at different levels: the test, the items, and the alternatives. This paper introduces a new parameterization of the nominal categories model (NCM) for analyzing difficulty at these three levels. The new parameterization is referred to as the NE-NCM and is…
Descriptors: Classification, Short Term Memory, Multiple Choice Tests, Test Items
Haberman, Shelby J.; Sinharay, Sandip – Psychometrika, 2010
Recently, there has been increasing interest in reporting subscores. This paper examines reporting of subscores using multidimensional item response theory (MIRT) models (e.g., Reckase in "Appl. Psychol. Meas." 21:25-36, 1997; C.R. Rao and S. Sinharay (Eds), "Handbook of Statistics, vol. 26," pp. 607-642, North-Holland, Amsterdam, 2007; Beguin &…
Descriptors: Item Response Theory, Psychometrics, Statistical Analysis, Scores
Hooker, Giles; Finkelman, Matthew – Psychometrika, 2010
Hooker, Finkelman, and Schwartzman ("Psychometrika," 2009, in press) defined a paradoxical result as the attainment of a higher test score by changing answers from correct to incorrect and demonstrated that such results are unavoidable for maximum likelihood estimates in multidimensional item response theory. The potential for these results to…
Descriptors: Models, Scores, Item Response Theory, Psychometrics
Altun, Halis; Korkmaz, Özgen – Online Submission, 2012
The aim of this study is to adapt the Cooperative Learning Attitude Scale into Turkish and determine engineering students' attitudes towards the cooperative learning. The study is based on the descriptive scanning model. The study group consists of 466 engineering students. The validity of the scale is confirmed through exploration factor analysis…
Descriptors: Foreign Countries, Cooperative Learning, Attitude Measures, Engineering Education
Ho, Siew Yin; Lowrie, Tom – Mathematics Education Research Group of Australasia, 2012
This study describes Singapore students' (N = 607) performance on a recently developed Mathematics Processing Instrument (MPI). The MPI comprised tasks sourced from Australia's NAPLAN and Singapore's PSLE. In addition, the MPI had a corresponding question which encouraged students to describe how they solved the respective tasks. In particular,…
Descriptors: Foreign Countries, Academic Achievement, National Competency Tests, Mathematics Tests
Louisiana Department of Education, 2012
"Louisiana Believes” embraces the principle that all children can achieve at high levels, as evidenced in Louisiana's recent adoption of the Common Core State Standards (CCSS). "Louisiana Believes" also promotes the idea that Louisiana's educators should be empowered to make decisions to support the success of their students. In…
Descriptors: Student Evaluation, Achievement Tests, Grade 8, English
Taylor, Catherine S.; Lee, Yoonsun – Applied Measurement in Education, 2012
This was a study of differential item functioning (DIF) for grades 4, 7, and 10 reading and mathematics items from state criterion-referenced tests. The tests were composed of multiple-choice and constructed-response items. Gender DIF was investigated using POLYSIBTEST and a Rasch procedure. The Rasch procedure flagged more items for DIF than did…
Descriptors: Test Bias, Gender Differences, Reading Tests, Mathematics Tests
Pae, Hye K. – Educational Research and Evaluation, 2012
The aim of this study was to apply Rasch modeling to an examination of the psychometric properties of the "Pearson Test of English Academic" (PTE Academic). Analyzed were 140 test-takers' scores derived from the PTE Academic database. The mean age of the participants was 26.45 (SD = 5.82), ranging from 17 to 46. Conformity of the participants'…
Descriptors: Reliability, Second Language Learning, Field Tests, Psychometrics
Stairs, Agnes M.; Smith, Gregory T.; Zapolski, Tamika C. B.; Combs, Jessica L.; Settles, Regan E. – Assessment, 2012
The construct of perfectionism is related to many important outcome variables. However, the term "perfectionism" has been defined in many different ways, and items comprising the different existing scales appear to be very different in content. The overarching aim of the present set of studies was to help clarify the specific…
Descriptors: Factor Structure, Personality Traits, Factor Analysis, Test Construction
Green, Anthony; Hawkey, Roger – Language Testing, 2012
The important yet under-researched role of item writers in the selection and adaptation of texts for high-stakes reading tests is investigated through a case study involving a group of trained item writers working on the International English Language Testing System (IELTS). In the first phase of the study, participants were invited to reflect in…
Descriptors: Test Items, Semantics, Reading Tests, Language Tests
Weng, Ting-Sheng – Journal of Educational Technology Systems, 2012
This research applies multimedia technology to design a dynamic item generation method that can adaptively adjust the difficulty level of items according to the level of the testee. The method is based on interactive testing software developed by Flash Actionscript, and provides a testing solution for users by automatically distributing items of…
Descriptors: Feedback (Response), Difficulty Level, Educational Technology, Educational Games

Peer reviewed
Direct link
