Publication Date
| In 2026 | 0 |
| Since 2025 | 197 |
| Since 2022 (last 5 years) | 1067 |
| Since 2017 (last 10 years) | 2577 |
| Since 2007 (last 20 years) | 4938 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Practitioners | 653 |
| Teachers | 563 |
| Researchers | 250 |
| Students | 201 |
| Administrators | 81 |
| Policymakers | 22 |
| Parents | 17 |
| Counselors | 8 |
| Community | 7 |
| Support Staff | 3 |
| Media Staff | 1 |
| More ▼ | |
Location
| Turkey | 225 |
| Canada | 223 |
| Australia | 155 |
| Germany | 116 |
| United States | 99 |
| China | 90 |
| Florida | 86 |
| Indonesia | 82 |
| Taiwan | 78 |
| United Kingdom | 73 |
| California | 65 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 4 |
| Meets WWC Standards with or without Reservations | 4 |
| Does not meet standards | 1 |
New Meridian Corporation, 2020
The purpose of this report is to describe the technical qualities of the 2018-2019 operational administration of the English language arts/literacy (ELA/L) and mathematics summative assessments in grades 3 through 8 and high school. The ELA/L assessments focus on reading and comprehending a range of sufficiently complex texts independently and…
Descriptors: Language Arts, Literacy Education, Mathematics Education, Summative Evaluation
New Meridian Corporation, 2020
The purpose of this report is to describe the technical qualities of the 2018-2019 operational administration of the English language arts/literacy (ELA/L) and mathematics assessments in grades 3 through 8 and high school. New Meridian, in coordination with multiple states and vendors, developed an alternate form of the summative assessment to…
Descriptors: Language Arts, Literacy Education, Mathematics Education, Summative Evaluation
Demars, Christine E. – Applied Measurement in Education, 2011
Three types of effects sizes for DIF are described in this exposition: log of the odds-ratio (differences in log-odds), differences in probability-correct, and proportion of variance accounted for. Using these indices involves conceptualizing the degree of DIF in different ways. This integrative review discusses how these measures are impacted in…
Descriptors: Effect Size, Test Bias, Probability, Difficulty Level
Facon, Bruno; Magis, David; Nuchadee, Marie-Laure; De Boeck, Paul – Intelligence, 2011
Standardized tests are used widely in comparative studies of clinical populations, either as dependent or control variables. Yet, one cannot always be sure that the test items measure the same constructs in the groups under study. In the present work, 460 participants with intellectual disability of undifferentiated etiology and 488 typical…
Descriptors: Intelligence Tests, Standardized Tests, Mental Retardation, Children
Ritter, Nicola; Kilinc, Emin; Navruz, Bilgin; Bae, Yunhee – Journal of Psychoeducational Assessment, 2011
This article reviews Test of Nonverbal Intelligence-Fourth Edition (TONI-4), an individually administered instrument created to assess intelligence. The distinguishing characteristic of the TONI-4 is the nonverbal, motor-reduced format that assesses common elements of intelligence without the confounding effects of motor or linguistic skills. The…
Descriptors: Nonverbal Tests, Intelligence Tests, Scoring, Test Items
Braeken, Johan – Psychometrika, 2011
Conditional independence is a fundamental principle in latent variable modeling and item response theory. Violations of this principle, commonly known as local item dependencies, are put in a test information perspective, and sharp bounds on these violations are defined. A modeling approach is proposed that makes use of a mixture representation of…
Descriptors: Test Construction, Item Response Theory, Models, Tests
O'Keefe, Robert D.; Hamer, Lawrence O.; Kemp, Philip R. – Journal of Learning in Higher Education, 2013
Assessment, or better stated, the assurance of student learning, has become a central issue in both the internal and the external evaluations of degree programs offered by colleges and universities. The continual importance of assurance of learning activities within institutions of higher education has generated a growing need to create empirical…
Descriptors: Alignment (Education), Program Evaluation, Course Evaluation, Learning Activities
Wang, Lin; Qian, Jiahe; Lee, Yi-Hsuan – ETS Research Report Series, 2013
The purpose of this study was to evaluate the combined effects of reduced equating sample size and shortened anchor test length on item response theory (IRT)-based linking and equating results. Data from two independent operational forms of a large-scale testing program were used to establish the baseline results for evaluating the results from…
Descriptors: Test Construction, Item Response Theory, Testing Programs, Simulation
Nevid, Jeffrey S.; McClelland, Nate – Journal of Education and Training Studies, 2013
We used a set of action verbs based on Bloom's taxonomy to assess learning outcomes in two college-level introductory psychology courses. The action verbs represented an acronym, IDEA, comprising skills relating to identifying, defining or describing, evaluating or explaining, and applying psychological knowledge. Exam performance demonstrated…
Descriptors: Verbs, Taxonomy, Introductory Courses, Psychology
Solano-Flores, Guillermo; Barnett-Clarke, Carne; Kachchaf, Rachel R. – Educational Assessment, 2013
We examined the performance of English language learners (ELLs) and non-ELLs on Grade 4 and Grade 5 mathematics content knowledge (CK) and academic language (AL) tests. CK and AL items had different semiotic loads (numbers of different types of semiotic features) and different semiotic structures (relative frequencies of different semiotic…
Descriptors: English Language Learners, Performance, Mathematics Tests, Semiotics
Raker, Jeffrey R.; Holme, Thomas A. – Journal of Chemical Education, 2013
Standardized examinations, such as those developed and disseminated by the ACS Examinations Institute, are artifacts of the teaching of a course and over time may provide a historical perspective on how curricula have changed and evolved. This study investigated changes in organic chemistry curricula across a 60-year period by evaluating 18 ACS…
Descriptors: Organic Chemistry, Science Education History, Curriculum Research, Educational Development
Young, Arthur; Shawl, Stephen J. – Astronomy Education Review, 2013
Professors who teach introductory astronomy to students not majoring in science desire them to comprehend the concepts and theories that form the basis of the science. They are usually less concerned about the myriad of
detailed facts and information that accompanies the science. As such, professors prefer to test the students for such…
Descriptors: Multiple Choice Tests, Classification, Astronomy, Introductory Courses
Li, Xueming; Sireci, Stephen G. – Educational and Psychological Measurement, 2013
Validity evidence based on test content is of essential importance in educational testing. One source for such evidence is an alignment study, which helps evaluate the congruence between tested objectives and those specified in the curriculum. However, the results of an alignment study do not always sufficiently capture the degree to which a test…
Descriptors: Content Validity, Multidimensional Scaling, Data Analysis, Educational Testing
Dagostino, Lorraine; Carifio, James; Bauer, Jennifer D. C.; Zhao, Qing – Current Issues in Education, 2013
The review of existing literature suggests that few researchers have adopted cross-language comparisons to explore how cultural background affects the assessment of reading comprehension of students. In this present study, the researchers independently reviewed and rated all the items of two reading comprehension tests translated from Malay into…
Descriptors: Cultural Background, Reading Comprehension, Models, Reading Tests
Taubner, Svenja; Horz, Susanne; Fischer-Kern, Melitta; Doering, Stephan; Buchheim, Anna; Zimmermann, Johannes – Psychological Assessment, 2013
The Reflective Functioning Scale (RFS) was developed to assess individual differences in the ability to mentalize attachment relationships. The RFS assesses mentalization from transcripts of the Adult Attachment Interview (AAI). A global score is given by trained coders on an 11-point scale ranging from antireflective to exceptionally reflective.…
Descriptors: Measures (Individuals), Attachment Behavior, Individual Differences, Adults

Peer reviewed
Direct link
