Publication Date
| In 2026 | 0 |
| Since 2025 | 8 |
| Since 2022 (last 5 years) | 36 |
| Since 2017 (last 10 years) | 115 |
| Since 2007 (last 20 years) | 378 |
Descriptor
| Test Theory | 1166 |
| Test Items | 262 |
| Test Reliability | 252 |
| Test Construction | 246 |
| Test Validity | 245 |
| Psychometrics | 183 |
| Scores | 176 |
| Item Response Theory | 168 |
| Foreign Countries | 160 |
| Item Analysis | 141 |
| Statistical Analysis | 134 |
| More ▼ | |
Source
Author
Publication Type
Education Level
Location
| United States | 17 |
| United Kingdom (England) | 15 |
| Canada | 14 |
| Australia | 13 |
| Turkey | 12 |
| Sweden | 8 |
| United Kingdom | 8 |
| Netherlands | 7 |
| Texas | 7 |
| New York | 6 |
| Taiwan | 6 |
| More ▼ | |
Laws, Policies, & Programs
| No Child Left Behind Act 2001 | 4 |
| Elementary and Secondary… | 3 |
| Individuals with Disabilities… | 3 |
Assessments and Surveys
What Works Clearinghouse Rating
Slaney, Kathleen L.; Maraun, Michael D. – Psychological Methods, 2008
The authors argue that the current state of applied data-based test analytic practice is unstructured and unmethodical due in large part to the fact that there is no clearly specified, widely accepted test analytic framework for judging the performances of particular tests in particular contexts. Drawing from the extant test theory literature,…
Descriptors: Test Theory, Data, Test Validity, Models
Green-Gibson, Andrea – ProQuest LLC, 2011
This mixed, causal-comparative study was an investigation of culture infusion methods and AYP of two different public schools in Chicago, a school that infuses African culture and a school that does not. The purpose of the study was to identify if there was a significant causative relationship between culture infusion methods and Adequate Yearly…
Descriptors: Urban Schools, Public Schools, Correlation, Academic Achievement
Hubley, Anita M.; Zumbo, Bruno D. – Social Indicators Research, 2011
The vast majority of measures have, at their core, a purpose of personal and social change. If test developers and users want measures to have personal and social consequences and impact, then it is critical to consider the consequences and side effects of measurement in the validation process itself. The consequential basis of test interpretation…
Descriptors: Construct Validity, Social Change, Measurement, Test Interpretation
Lee, Young-Sun; Lembke, Erica; Moore, Douglas; Ginsburg, Herbert P.; Pappas, Sandra – Assessment for Effective Intervention, 2012
The present study examined the technical adequacy of curriculum-based measures (CBMs) of early numeracy. Six 1-min early mathematics tasks were administered to 137 kindergarten and first-grade students, along with an omnibus test of early mathematics. The CBM measures included Count Out Loud, Quantity Discrimination, Number Identification, Missing…
Descriptors: Numeracy, Curriculum Based Assessment, Mathematics Tests, Kindergarten
Kane, Michael – Educational Testing Service, 2010
The 12th annual William H. Angoff Memorial Lecture was presented by Dr. Michael T. Kane, ETS's (Educational Testing Service) Samuel J. Messick Chair in Test Validity and the former Director of Research at the National Conference of Bar Examiners. Dr. Kane argues that it is important for policymakers to recognize the impact of errors of measurement…
Descriptors: Error of Measurement, Scores, Public Policy, Test Theory
Darrah, Marjorie; Fuller, Edgar; Miller, David – Journal of Computers in Mathematics and Science Teaching, 2010
This paper discusses a possible solution to a problem frequently encountered by educators seeking to use computer-based or multiple choice-based exams for mathematics. These assessment methodologies force a discrete grading system on students and do not allow for the possibility of partial credit. The research presented in this paper investigates…
Descriptors: College Students, College Mathematics, Calculus, Computer Assisted Testing
Hew, Khe Foon; Cheung, Wing Sum – Electronic Journal of e-Learning, 2012
Contemporary discussions of education in blended-learning environments increasingly emphasize the social nature of learning which emphasizes interactions among students, or among students and instructors. These interactions can occur asynchronously using a text based discussion forum. A text-based discussion forum, however, may not work well for…
Descriptors: Computer Mediated Communication, Discussion Groups, Undergraduate Students, Test Theory
Steinmetz, Jean-Paul; Brunner, Martin; Loarer, Even; Houssemand, Claude – Psychological Assessment, 2010
The Wisconsin Card Sorting Test (WCST) assesses executive and frontal lobe function and can be administered manually or by computer. Despite the widespread application of the 2 versions, the psychometric equivalence of their scores has rarely been evaluated and only a limited set of criteria has been considered. The present experimental study (N =…
Descriptors: Computer Assisted Testing, Psychometrics, Test Theory, Scores
Reeve, Charlie L.; Heggestad, Eric D.; Lievens, Filip – Intelligence, 2009
The assessment of cognitive abilities, whether it is for purposes of basic research or applied decision making, is potentially susceptible to both facilitating and debilitating influences. However, relatively little research has examined the degree to which these factors might moderate the criterion-related validity of cognitive ability tests. To…
Descriptors: Test Anxiety, Familiarity, Cognitive Tests, Test Validity
Audette, Jennifer Gail – ProQuest LLC, 2011
Purpose: International service-learning (ISL) is popular in higher education, and many physical therapy educational programs are adding ISL opportunities to their curricula because doing so aligns with student interest and the increasingly global nature of the profession. The faculty leading these experiences have not been studied. Nearly all…
Descriptors: Group Membership, Higher Education, Teaching Styles, Teacher Characteristics
Caprara, Gian Vittorio; Alessandri, Guido; Eisenberg, Nancy; Kupfer, A.; Steca, Patrizia; Caprara, Maria Giovanna; Yamaguchi, Susumu; Fukuzawa, Ai; Abela, John – Psychological Assessment, 2012
Five studies document the validity of a new 8-item scale designed to measure "positivity," defined as the tendency to view life and experiences with a positive outlook. In the first study (N = 372), the psychometric properties of Positivity Scale (P Scale) were examined in accordance with classical test theory using a large number of…
Descriptors: Validity, Measures (Individuals), Psychological Testing, Test Theory
Lin, Kuan-Cheng; Wei, Yu Che; Hung, Jason C. – International Journal of Distance Education Technologies, 2012
Many studies demonstrate that Digital Game Based Learning (DGBL) can foster learning effect. The purpose of this study is to survey whether the online game in junior high school students can encourage learning effect in Taiwan's History. So, the research applied Interactive Game-based Learning System (IGLS) to junior high history teaching as an…
Descriptors: Academic Achievement, Learning Motivation, Student Attitudes, High School Students
O'Sullivan, Maureen – Psychological Bulletin, 2008
In 2006, C. F. Bond Jr. and B. M. DePaulo provided a meta-analysis of means and concluded that average lie detection accuracy was significantly greater than chance for most people. Now, they have presented an analysis of standard deviations (C. F. Bond Jr. & B. M. DePaulo, 2008), claiming that there are no reliable individual differences in lie…
Descriptors: Deception, Test Theory, Meta Analysis, Individual Differences
van der Linden, Wim J. – Measurement: Interdisciplinary Research and Perspectives, 2010
The traditional way of equating the scores on a new test form X to those on an old form Y is equipercentile equating for a population of examinees. Because the population is likely to change between the two administrations, a popular approach is to equate for a "synthetic population." The authors of the articles in this issue of the…
Descriptors: Test Format, Equated Scores, Population Distribution, Population Trends
Ellis, David P. – ProQuest LLC, 2011
The current version of the International Language Testing Association (ILTA) Guidelines for Practice requires language testers to pretest items before including them on an exam, or when pretesting is not possible, to conduct post-hoc item analysis to ensure any malfunctioning items are excluded from scoring. However, the guidelines are devoid of…
Descriptors: Item Response Theory, High Stakes Tests, College Entrance Examinations, Item Analysis

Peer reviewed
Direct link
