NotesFAQContact Us
Collection
Advanced
Search Tips
Showing 6,541 to 6,555 of 9,530 results Save | Export
Peer reviewed Peer reviewed
Direct linkDirect link
Elliott, Robert; Fox, Christine M.; Beltyukova, Svetlana A.; Stone, Gregory E.; Gunderson, Jennifer; Zhang, Xi – Psychological Assessment, 2006
Rasch analysis was used to illustrate the usefulness of item-level analyses for evaluating a common therapy outcome measure of general clinical distress, the Symptom Checklist-90-Revised (SCL-90-R; Derogatis, 1994). Using complementary therapy research samples, the instrument's 5-point rating scale was found to exceed clients' ability to make…
Descriptors: Therapy, Rating Scales, Item Response Theory, Test Items
Peer reviewed Peer reviewed
Direct linkDirect link
Swinkels, Sophie H. N.; Dietz, Claudine; van Daalen, Emma; Kerkhof, Ine H. G. M.; van Engeland, Herman; Buitelaar, Jan K. – Journal of Autism and Developmental Disorders, 2006
This article describes the development of a screening instrument for young children. Screening items were tested first in a non-selected population of children aged 8-20 months (n = 478). Then, parents of children with clinically diagnosed ASD (n = 153, average age 87 months) or ADHD (n = 76, average age 112 months) were asked to score the items…
Descriptors: Pervasive Developmental Disorders, Autism, Questionnaires, Testing
Peer reviewed Peer reviewed
Direct linkDirect link
Marks, Anthony M.; Cronje, Johannes C. – Educational Technology & Society, 2008
Computer-based assessments are becoming more commonplace, perhaps as a necessity for faculty to cope with large class sizes. These tests often occur in large computer testing venues in which test security may be compromised. In an attempt to limit the likelihood of cheating in such venues, randomised presentation of items is automatically…
Descriptors: Educational Assessment, Educational Testing, Research Needs, Test Items
Peer reviewed Peer reviewed
Direct linkDirect link
Rahimi, Mohammad – Reading Matrix: An International Online Journal, 2007
Test method facet has been considered as an important factor affecting the testee's performance on a test. That is, a test used to assess a particular ability would yield different results when different test methods are used to gauge the same trait. The language of presentation is an aspect of test method conceived of as affecting the performance…
Descriptors: Reading Comprehension, Second Language Learning, Language Proficiency, Indo European Languages
Fisher, Douglas; Frey, Nancy – Association for Supervision and Curriculum Development, 2007
If you ever have students who are reluctant to tell you when they don't understand something--or worse, tell you they understand when they really don't--then here's a book that gives you lots of ways to check for understanding. Learn why typical methods to check for understanding are usually ineffective. And explore formative assessment techniques…
Descriptors: Test Items, Student Evaluation, Student Reaction, Formative Evaluation
Lang, W. Steve; Chew, Alex L.; Crownover, Carol; Wilkerson, Judy R. – Online Submission, 2007
Determining the cross-cultural equivalence of multilingual tests is a challenge that is more complex than simple horizontal equating of test forms. This study examines the functioning of a trilingual test of preschool readiness to determine the equivalence. Different forms of the test have previously been examined using classical statistical…
Descriptors: Multilingualism, Reading Readiness Tests, Item Analysis, Item Response Theory
Kobrin, Jennifer L.; Melican, Gerald J. – College Board, 2007
This report synthesizes the research to date addressing the construct comparability of the SAT Reasoning Test and prior SAT I: Reasoning Test and the series of research studies addressing the equatability and subpopulation invariance of the SAT and SAT I.
Descriptors: College Entrance Examinations, Logical Thinking, Thinking Skills, Scores
Peer reviewed Peer reviewed
Direct linkDirect link
Abbott, Marilyn L. – Language Testing, 2007
In this article, I describe a practical application of the Roussos and Stout (1996) multidimensional analysis framework for interpreting group performance differences on an ESL reading proficiency test. Although a variety of statistical methods have been developed for flagging test items that function differentially for equal ability examinees…
Descriptors: Test Bias, Test Items, English (Second Language), Second Language Learning
Ferrara, Steven; And Others – 1995
A study was conducted to begin a process of validating hypothesized causes of local item dependence (LID) in large-scale performance assessments. Data for the study are item level scores from 26 science tasks from the 1993 edition of the Maryland School Performance Assessment Program. Causes of high LID were hypothesized from studies by Ferrara et…
Descriptors: Educational Assessment, Hands on Science, Performance Based Assessment, Prediction
Bejar, Isaac I. – 1991
Response generative modeling (RGM) is an approach to psychological measurement that involves a "grammar" capable of assigning a psychometric description to every item in a universe of items and is capable of generating all the items in that universe. The article discusses the rationale behind RMG and its roots, explores how it relates to…
Descriptors: Educational Assessment, Item Response Theory, Measurement Techniques, Models
Freedle, Roy; Kostin, Irene – 1992
This study examines the predictability of Graduate Record Examinations (GRE) reading item difficulty (equated delta) for the three major reading item types: main idea, inference, and explicit statement items. Each item type is analyzed separately, using 110 GRE reading passages and their associated 244 reading items; selective analyses of 285…
Descriptors: College Entrance Examinations, Correlation, Difficulty Level, Higher Education
Freedle, Roy; Kostin, Irene – 1993
Prediction of the difficulty (equated delta) of a large sample (n=213) of reading comprehension items from the Test of English as a Foreign Language (TOEFL) was studied using main idea, inference, and supporting statement items. A related purpose was to examine whether text and text-related variables play a significant role in predicting item…
Descriptors: Construct Validity, Difficulty Level, Multiple Choice Tests, Prediction
Carlson, Sybil B.; Ward, William C. – 1988
Issues concerning the cost and feasibility of using Formulating Hypotheses (FH) test item types for the Graduate Record Examinations have slowed research into their use. This project focused on two major issues that need to be addressed in considering FH items for operational use: the costs of scoring and the assignment of scores along a range of…
Descriptors: Adaptive Testing, Computer Assisted Testing, Costs, Pilot Projects
Schmitt, Alicia P.; And Others – 1992
Studies evaluating hypotheses about sources of differential item functioning (DIF) are classified into two categories: observational studies evaluating operational items and randomized DIF studies evaluating specially constructed items. For observational studies, advice is given for item classification, sample selection, the matching criterion,…
Descriptors: Causal Models, Classification, Effect Size, Estimation (Mathematics)
Blais, Jean-Guy; Laurier, Michel – 1995
This study deals with the assessment of the unidimensionality of a set of items through the procedure developed by W. Stout and others (1991) and implemented in the computer program DIMTEST. This study examines a special feature of DIMTEST: the possibility for the user to assess the unidimensionality of a set of items by specifying a subset…
Descriptors: Computer Software, Evaluation Methods, Foreign Countries, French
Pages: 1  |  ...  |  433  |  434  |  435  |  436  |  437  |  438  |  439  |  440  |  441  |  ...  |  636