Publication Date
| In 2026 | 2 |
| Since 2025 | 441 |
| Since 2022 (last 5 years) | 1920 |
| Since 2017 (last 10 years) | 4492 |
| Since 2007 (last 20 years) | 6977 |
Descriptor
Source
Author
Publication Type
Education Level
Audience
| Researchers | 454 |
| Practitioners | 319 |
| Teachers | 128 |
| Administrators | 73 |
| Policymakers | 33 |
| Counselors | 31 |
| Students | 17 |
| Parents | 10 |
| Community | 6 |
| Support Staff | 5 |
Location
| Turkey | 831 |
| Australia | 239 |
| China | 211 |
| Canada | 207 |
| Indonesia | 161 |
| Spain | 129 |
| United States | 123 |
| United Kingdom | 121 |
| Germany | 111 |
| Taiwan | 108 |
| Netherlands | 102 |
| More ▼ | |
Laws, Policies, & Programs
Assessments and Surveys
What Works Clearinghouse Rating
| Meets WWC Standards without Reservations | 2 |
| Meets WWC Standards with or without Reservations | 2 |
| Does not meet standards | 1 |
Simpson, Lucy; Baird, Jo-Anne – Oxford Review of Education, 2013
Over recent years, the credibility of public examinations in England has increasingly come to the fore. Government agencies have invested time and money into researching public perceptions of the reliability and validity of examinations. Whilst such research overlaps into the conceptual domain of trust, trust in examinations remains an elusive…
Descriptors: Foreign Countries, Exit Examinations, Trust (Psychology), Test Reliability
Gardner, John – Oxford Review of Education, 2013
Evidence from recent research suggests that in the UK the public perception of errors in national examinations is that they are simply mistakes; events that are preventable. This perception predominates over the more sophisticated technical view that errors arise from many sources and create an inevitable variability in assessment outcomes. The…
Descriptors: Educational Assessment, Public Opinion, Error of Measurement, Foreign Countries
Freed, Mark R. – ProQuest LLC, 2013
There were three primary purposes for this study. First, I investigated the psychometric properties of the Sources of Middle School Mathematics Self-Efficacy (SMMSE) scale (Usher, 2007; Usher & Pajares, 2009) with high school students. Validation for expanded use of the SMMSE scale was achieved by assessing the instrument's psychometric…
Descriptors: Middle School Students, High School Students, Mathematics, Self Efficacy
Brown, Corina E. – ProQuest LLC, 2013
This two-stage study focused on the undergraduate nursing course that covers topics in general, organic, and biological (GOB) chemistry. In the first stage, the central objective was to identify the main concepts of GOB chemistry relevant to the clinical practice of nursing. The collection of data was based on open-ended interviews of both nursing…
Descriptors: Nursing Education, Test Construction, Psychometrics, Organic Chemistry
Steed, Elizabeth A.; Webb, Mi-young L. – Journal of Positive Behavior Interventions, 2013
This report documents the reliability and validity of scores on the Preschool-Wide Evaluation Tool (PreSET), an assessment used to measure program-wide implementation of the universal level of positive behavior interventions and support (PBIS) in early childhood settings. Initial analyses of descriptive statistics, item, subscale, and total…
Descriptors: Psychometrics, Preschool Evaluation, Student Behavior, Intervention
Guido Veronese; Alessandro Pepe – Research on Social Work Practice, 2013
Objective: Professional social workers and emergency workers operating in war contexts may develop posttraumatic stress disorder (PTSD) following exposure to traumatic events. Impact of trauma must be accurately assessed by researchers via robust models of measurement. In this article, measurement models for the 13-item Children's Revised Impact…
Descriptors: Violence, Military Personnel, Military Service, Posttraumatic Stress Disorder
Li, Yanmei; Li, Shuhong; Wang, Lin – Educational Testing Service, 2010
Many standardized educational tests include groups of items based on a common stimulus, known as "testlets". Standard unidimensional item response theory (IRT) models are commonly used to model examinees' responses to testlet items. However, it is known that local dependence among testlet items can lead to biased item parameter estimates…
Descriptors: English, Language Tests, Reading Tests, Item Response Theory
Porter, Andrew C.; Polikoff, Morgan S.; Goldring, Ellen B.; Murphy, Joseph; Elliott, Stephen N.; May, Henry – Elementary School Journal, 2010
The Vanderbilt Assessment of Leadership in Education (VAL-ED) is a multirater assessment of principals' learning-centered leadership. The instrument was developed based on the Standards for Educational and Psychological Testing. In this article, we report on the validity and reliability evidence for the VAL-ED accumulated in a national field…
Descriptors: Psychological Testing, Test Validity, Leadership, Principals
Bechger, Timo M.; Maris, Gunter; Hsiao, Ya Ping – Applied Psychological Measurement, 2010
The main purpose of this article is to demonstrate how halo effects may be detected and quantified using two independent ratings of the same person. A practical illustration is given to show how halo effects can be avoided. (Contains 2 tables, 7 figures, and 2 notes.)
Descriptors: Performance Based Assessment, Test Reliability, Test Length, Language Tests
Barker, David H.; Lloyd, Thad Q.; Stewart, Peter K.; Wells, M. Gawain – Journal of Child and Family Studies, 2010
Developing normed treatment outcome measures is important to research addressing treatment effectiveness and to improved clinical care. The Preschool Outcome Questionnaire (POQ) is a new measure designed for use with preschool children aged two to six. Designed in collaboration with parents and clinicians, the POQ is brief, easy to administer,…
Descriptors: Outcomes of Treatment, Predictive Validity, Preschool Children, Measures (Individuals)
Mashaw, Bijan – Decision Sciences Journal of Innovative Education, 2012
As a result of this research, a quantitative model and a procedure have been developed to create an online mentoring effectiveness index (EI). To develop the model, mentoring and teaching effectiveness are defined, and then the constructs and factors of effectiveness are identified. The model's construction is based on the theory that…
Descriptors: Models, Online Courses, Statistical Analysis, Mentors
Kansopon, Venus – Language Testing in Asia, 2012
This study primarily investigated the validity and reliability of the writing assessments and their backwash effects on the undergraduates of Institute of International Studies, Ramkhamhaeng University (IIS-RU). The English-major students had academic writing skills problem, especially among the non-native English speakers, whose writing ability…
Descriptors: Foreign Countries, Writing Tests, Writing Evaluation, Test Reliability
Goldhaber, Dan; Chaplin, Duncan – Center for Education Data & Research, 2012
In a provocative and influential paper, Jesse Rothstein (2010) finds that standard value added models (VAMs) suggest implausible future teacher effects on past student achievement, a finding that obviously cannot be viewed as causal. This is the basis of a falsification test (the Rothstein falsification test) that appears to indicate bias in VAM…
Descriptors: School Effectiveness, Teacher Effectiveness, Achievement Gains, Statistical Bias
Tosado, Luis Antonio, II – ProQuest LLC, 2012
Two overlapping issues have given rise to this study: the need for assessment instruments to use with Spanish-speaking Latinos and the need for normative data on current and future Spanish-language instruments. Numerous career assessment instruments exist for the English-speaking population. These instruments may be administered on computer-based…
Descriptors: Spanish Speaking, Hispanic Americans, Vocational Evaluation, Feasibility Studies
Ohuakanwa, Chijioke Ephraim; Omeje, Joachim Chinweike; Eskay, Michael – Online Submission, 2012
The study sought to investigate the relationship between pornography addiction and psychosocial and academic adjustment of students in universities in Lagos State. In order to achieve this objective, five research questions were formulated and two hypotheses postulated. The subjects for the study consisted of 616 full-time third-year undergraduate…
Descriptors: Test Reliability, Addictive Behavior, Undergraduate Students, Pornography

Peer reviewed
Direct link
