HomeAbout MASCDMembershipEvents and TrainingCommunicationsBoard MeetingsResourcesJob OpportunitiesListservLinksContact Us - jscofield@gomasa.org

BASICS FOR USING STANDARDS AND ASSESSMENTS
Thomas R. Guskey

10/23/2002

Our focus today is standards and assessments. Next time is grading and reporting. Take a moment for yourself to think about two issues: given that the session is focused on standards and assessments, why did you come today? To be successful for you, what would you like to get out of today?

How do we bring efforts together? How do you change attitudes about assessment? How do we create an understanding of assessments? How does assessment drive instruction so that it’s understandable for everyone? How do we create a common rubric so that writing can be assessed across departments? How do you merge the two masters: standards and accountability? Are standardized assessments more valid than district assessments? How should we report assessment results? Contextured assessment vs decontextured assessment…

No Child Left Behind
A lot of what’s being done today in assessment, especially with respect to reform, happened in Kentucky. Kentucky has led the way in the nation in educational reform. Kentucky was the first state to have a statewide assessment program; it focused on language arts and math, had MC and a few CR questions. It generated an aggregated score for each school, and Kentucky used this to rank order all the schools in the state. This generated all kinds of problems because of the great diversity all over the state. This created all sorts of dilemmas. Educational funding is also inequitable. The state department decided not to rank the schools any more, but the newspapers did it. A group of 66 poor schools filed a class action suit to say that school funding was inequitable, and they won. The state’s system of education was judged inequitable (1988-1989). The KERA (Kentucky Education Reform Act) was passed and was very comprehensive. It was the first reform effort in the country: it dealt with finance, governance, curriculum and instruction. Standards were established for all children in six different domains: reading, writing, language arts, social studies, science, fine arts, practical living. A statewide assessment was also developed. Five people helped design the system: Jason Millman, Doris Redman, Pat Forgione, Edward Keiffer, Grant Wiggins. They developed a phenomenal assessment system that included three different components: there were portfolios in each domain; on-demand assessment with MC and CR items; performance events (a group of evaluators would come to your school and a group of children would be selected randomly to perform); non-cognitive indicators—dropout rates, ADA, and follow up information—were they gainfully employed, going to school, etc. They had to define success. They had to draw lines: what is and what isn’t success. This had to be tied to accountability. Technical issues are solvable. Accountability is a political issue. They had four levels: novice, apprentice, proficient, distinguished. All children were included in the assessments, no exceptions. If a child isn’t present on the day of assessment, they give the child the lowest score possible on the assessment. They averaged the school scores. You were given 20 years to get to 100%. The first year tests were given was 1992. The average score for schools was 30%. They bi-annual goal was an increase based on the difference divided by 10. If you went beyond this goal, you were awarded up to $3000/teacher. If you didn’t make it, the principal and teachers could be fired. Teacher tenure was eliminated in the State. If a school isn’t making it, there are intermediate steps and resources available. No school has reached that level of sanctions yet. The accountability is on the part of teachers, not students.

California created a similar system. In one school students didn’t want speedbumps and held the district hostage. The district lost $56,000 when the students blew off the test. Children must have adequate opportunity to learn. The Kentucky system has to do with improvement. If a school does nothing and stays the same, sanctions are in order. This eliminates the need to adjust for social, economic or demographic characteristics.

William Sanders in Tennessee thought the Kentucky model was worthwhile. He started analyzing results in terms of gain scores and called this “value added accountability.” It doesn’t matter where you are but how much you gain. He found that gain patterns were inconsistent and unrelated. It’s better to be in a rich district than a poor one; the big difference is between teachers. They’ve taken accountability to the individual classroom level. He’s presented this to the chief state school officers three times. They can attach a name to classroom level results. The teachers who get good results get good results year after year after year. The same thing goes for teachers who get poor results. He found it is somewhat related to who has had advanced training in their subject areas. It doesn’t matter what students the good teachers have; they always get good results.

Texas adopted this model. Texas used this to rank order teachers in a system. They also determined what teachers created the top third, middle third, bottom third. They examined children who started at the 40th %ile and if these children had teachers in the top third, they ended up in the 82% range. The effects are cumulative. Teachers make a difference! If we’re looking for models of excellence, we don’t have to look far. If we’re looking for teachers who need help, we don’t have to look far, either.

NCLB has similar parameters. We’re testing 3rd grade through 8th grade and one year beyond. We’re going to disaggregate the data. Our President and Secretary of Education are from Texas. We’re absolutely moving in this direction.

Systemic Change
• Change is a highly complex process
• Professional development is essential

Michael Fullan – Leading in a Culture of Change

Baselines are being established in every state based on the standards set by the state. Schools have until 2014 to reach 100%. In Kentucky, no school has reached the 100% level yet though some are close.

Change is a prerequisite for improvement! Everything is based on this premise. You can’t get better without any change. One of the efforts in the reform effort is to clarify what we want students to learn: standards.

In Standards-Based Education there are four premises:
1. These ideas are not new!
• Ralph W. Tyler – 1949
• “Basic Principles of Curriculum and Instruction”
• Two fundamental decisions:
a. What do I want students to learn?
b. What evidence would I accept to verify their learning?

Education isn’t based on student learning; it’s based on time. It’s Tyler who said, “I taught them, they just didn’t learn it” is the same as saying, “I sold it, they just didn’t buy it.” If they didn’t learn anything, how can you say anything was taught? We have to see teaching and learning as absolutely linked.

2. The Ideas are more important than the vocabulary we use!
In education today, we’ve become involved in a war of words, and it’s detracting us from important work we need to do. Confusing vocabulary:
Objective goal outcome standard benchmark
Competency proficiency performance expectation aspiration
New year’s resolution

It doesn’t matter what you call it! The ideas are more important than the vocabulary. We must make definite decisions about what we want students to learn and what evidence we’ll accept that they’ve learned it.

3. Good ideas can be implemented poorly!
• How do the ideas translate into practice?
• How will we know if they work?

Teachers don’t plan in terms of goals or standards or objectives; they plan in terms of materials and activities. What that says is that if we want these things to find their way into practice, we have to find a way into implementation. We must be concerned about implementation: how it’s enacted in practice.
4. Success in education hinges on what happens at the classroom level. Any successful change will occur at the smallest level. It doesn’t matter what happens at the national, state or district level; if it doesn’t happen at the classroom level, it will fail!

In the 1970s teachers started to be held accountable for performance on national standardized tests. Teachers started teaching to the test. When the powers that be learned this, they started changing the tests to be higher level. This only brought about slight changes in the classroom.

Carol Ann Tomlinson’s The Parallel Curriculum. EMU’s Pat Williams-Boyd is strong in differentiated instruction at the middle level. Deborah Wahlstrom is a good presenter, very practical, takes the mystery out of using data in the classroom.

If we want to make a difference at the classroom level, how can we do this? If we press on these issues on the teacher and ask what obstacles there are, there are two answers that are always given:
1. Student motivation
2. Diversity among students

In this case, diversity means the range of skills and abilities in the classroom. How can we have them all learn well since they’re starting at such different points?

Student motivation:
There one thing more closely tied to motivation than anything else: The key to motivation is success! They avoid with a passion those things that they feel they can’t be successful at or will fail. Walk into a video arcade. We’re told in methods classes to make things relevant and interesting. But Pac Man is about gobbling up dots—it’s repetitive and isn’t relevant. It’s not the reward but the success. Every time they play, they can improve their score. They play with passion and intensity. Success is more important than interest or relevance.

Diversity among students:
Teachers often lack the time, support and strategies to assist all learners.
Individualized programs are often impractical in group-based classroom.

If kids are going at their own rate, how are they going to learn everything they need to learn. They can go at their own rate, but by the end of the year, they must be here. Next year’s teachers won’t be happy if they aren’t at this place. How can they learn what they’re expected to learn if they go at their own rate?

Needed: A practical strategy to help all students succeed!

Solution: Mastery Learning – A practical strategy to help all students succeed!
This helps all students succeed in a standards-based format.

Mastery Learning: An Overview
How to get started – Mid 1960s we really tried to individualize instruction. As teachers started using them, they discovered hardships. Bloom came along and he was concerned with the same issue. He walked into schools and watched teachers teach. He found consistency across grade levels and subject areas; there was lots of consistency, even in assessment. They took the materials and divided them up into learning units that corresponded with chapters in textbooks. The teacher introduced and taught a unit, they reviewed, they took a test and went over the test, then went on to the next unit. How many really learn excellently? Teachers thought 30%. Students who didn’t learn the first unit well, didn’t learn the second unit well, particularly if it was sequential. Bloom found that only 20% of the students really learned excellently. Bloom discovered the Bell curve for these students. The Bell curve is the distribution of achievement in traditional classrooms. Bloom said, “Hold on! This is the random distribution of achievement without any intervention!” If we intervene, we should have more students get better scores. He said, “Teaching is an intervention. It’s designed to make a difference.” If we get a Bell curve, we haven’t made a difference.

The basis of mastery learning:
1. Tutoring instructional techniques are the best.
2. Learning strategies of successful students.

With these two sources of information in mind, he went back to the classroom. He kept the idea of things being divided into units. At the quiz point, he said we need to make a change. We’ll use this as a source of information. This will communicate to kids what’s important to know. He called it formative.

Assessment information or feedback
1. What students are expected to learn.
2. What each student has learned well.
3. What each student needs to learn better.

Formative should inform the student about what’s important. Mystery tests teach two things: you can’t trust the teacher and it doesn’t matter how hard you work. Learning for most kids is a guessing game. We were good at school because we were good a guessing what the teacher thought was important.

We need to pair with the feedback those things we deem important. This is not reteaching, meaning we repeat the same thing but louder. Bloom suggested a process that looked like this:

Instructional process in mastery learning classrooms

Unit 1 – First formative assessment
Feedback and correctives
Second formative assessment

At the end you’d get a J curve. You aren’t lowering standards or dumbing down the curriculum. This is called the mastery learning process.

Unit 1 – Formative Assessment A
Enrichment -> Unit 2
Correctives -> Formative Assessment B (parallel to the first but not identical) -> Unit 2

Enrichment is a challenging, exciting learning opportunity for those who get it the first time.

1. One-to-one tutoring is just the process described. You constantly ask the student questions. When s/he makes a mistake, you stop and point out the error and try to explain it in a different way, then ask another question to see if the student understands before going on. This is what an excellent coach does.

2. Strategies of successful students
They look at what they do wrong, correct the test, save the test to review.

Essential Elements of Mastery Learning
1. Feedback, correctives, and enrichment
2. Congruence among instructional components (alignment)

Implementing Mastery Learning is in its second printing and it’s practical for teachers.

In order to implement this, we must incorporate the feedback loop. In order for a teacher to incorporate these, a teacher must offer regular and specific feedback on their progress. If they do this only once a year, you’re going to lose. We have to go over assessment results weekly. Any time any kind of assessment is administered, it’s an opportunity to learn. How many kids missed each assessment item? If it’s a great percentage, it’s a teacher problem. If kids don’t get it, it didn’t work! Whether it works is defined by what our students can do. We have to set aside ego issues. If over half the kids in class miss something, it’s something I’m doing as teacher. Principals should ask every Friday as teachers leave the parking lot what they learned as a teacher this week. We should aslk students every day what they learned in school today; if they reply, “Nothing,” we send them back into the school.

Correctives must include:
1. Different presentation
2. Different involvement
Children don’t learn in the same way.

Correctives include:
• Different learning styles
• Different learning modalities
• Different types of intelligence
• NOT repetition of the initial instruction

Enrichment must be:
1. Rewarding
2. Challenging (valuable learning experience and need not be related directly to the unit content)

Best resources can be found in materials for G/T. They’re gamelike and challenging. They involve high level skills.

G/T – Midwest Publications; Dale Seymour; Games magazine. Apt initials—take a famous person’s initials and develop a phrase using the initials. Resignation Made News = Richard M. Nixon. Wonderkid and Musician = Wolfgang Amadeus Mozart. After you’re done, take your own initials and develop a phrase that describes you.

Mastery learning is not: teach > test > reteach > retest. Don’t use a 2-test format! The feedback corrective enrichment process is essential. You can use the first test to predict the score on the second test unless you do correctives/enrichment.

Essential element #2
Congruence among instructional components or alignment.
Teaching and learning are composed of three major parts:
Learning objectives > instruction > feedback and correctives > competent learners (evaluation)

Among these components there must be congruence. Mastery learning is not: teaching to the test. Not so for this reason: If a test dictates to the teacher what is to be taught, then you must teach to the test. This isn’t the same thing. Every teacher at every level in every subject level must clearly identify what’s to be learned. Teaching is directional. Teaching is taking students from where they are to where we want them to be. We’re not teaching to a test but testing what we teach. Every time you give a test, you’re teaching students what’s important to learn.

Essential Elements of Mastery Learning show:
1. Implementation is flexible
2. Applications are broad

Implementation is extremely flexible; there’s no one best way to teach it. Format of tests can be very different. Mastery learning takes the guesswork out of learning. The formative assessment permits individualization. They only work on what they didn’t do well on. The correctives are your ticket to the second assessment. He gives his students guidance and feedback on their learning errors. He clearly communicates the standards to his students.

Common Concerns Regarding Mastery Learning:
1. Mastery Learning and Grading Standards?
2. The fairness of a second chance?
3. Mastery learning’s relation to real life?
4. Motivation and the second chance?

If I do this, if I extend my units or instruction, I’m going to have to limit the amount of content I cover. This is true in the early units; it will require more time. Students need time to be oriented to this process. All of this is new. They’ll ask, Where’s the trick? Some orientation is necessary. When they first begin, teachers must take time for the correctives to be done in the classroom under the guidance of the teacher. If not, they won’t do it. It’s done in class and under my direction and guidance. I want them to see they get something out of it. When I first begin, it’s highly structured. First thing I do with the correctives group is reteach. When they’re working on something, I go up and down the rows. Enrichment activities must be rewarding, challenging, things the students can work on fairly independently, self-directional, and can be completed within the time I’ve allotted. After awhile, that structure can be relaxed. Start out with correctives as 1 ½ hours per unit, then reduce it to 1 per unit, then 1 per two units, ½ per four units, and finally I give it to them as homework. Review is now shifted to the correctives phase. The kids who don’t need to review can do enrichment. You must see pacing your units in more flexible terms. My first units are 7 day; my later units are 5 days. At mid-term I’m one unit behind, but by the final, I’m one unit ahead!

That’s exactly what Kindergarten’s like: you get there in different ways and at different times, but you all get there.

What happens if the child doesn’t make it the second time or third time or fourth time—if you keep going back, you’ll teach the same unit all year. Move on. This process is student-based and teacher-paced. If you insist on 100% before going on to the next thing, you’re going for perfection. Kids learn things differently and in different sequences. You should move on if something isn’t developmentally
appropriate for a child at that time. You need to spiral in the curriculum. Refer to items you taught in unit one during unit 2. It’s one way to capture what’s difficult even if you did move on. Do the same on the exam. If you miss this, you’ll do correctives on it again. When we move on, we have to see progress in different ways. Some kids need to master skills before they can move on; others don’t.

To most kids multiplication is a new thing they have to memorize. They don’t see it as an extension of addition. Remember where you want to be at the end and see your pathway and students’ pathways in a flexible way. Multiple opportunities to learn means you spiral and provide opportunities at different times.

Do I have to change my grading standards with mastery learning? No, unless you grade kids on the curve = you don’t know if anyone learned anything. You have to grade based on standards.

The fairness of a second chance is understood by writing teachers. First draft and get feedback; revise according to the feedback; rewrite and get more feedback, etc. Am I supposed to give them the same high grade if they get a good score on the second test? This isn’t like what life is like at all. We let all other professionals practice before they actually do what it is they’ve learned (pilots, surgeons, etc.). However, we expect kids to perform perfectly the first time. Can kids learn successfully without making mistakes? Thomas Edison had over 1000 failed attempts to make the light bulb. He says he learned 1000 ways how not to make the light bulb. These are the kinds of skills we want to develop in lifelong learners.

If I tie it to the grade, they’ll try harder. Did you pass your driver’s license test on the first time? What do we say to those who don’t? They can’t drive in the rain? Maybe the reason they didn’t get it the first time isn’t there fault; maybe it was mine? What’s the motivation for the first if they get a second? Bloom emphasized enrichment. Getting the certificate of mastery proved motivational. It’s tangible recognition of success! It means someone recognizes they’ve done a good job.

Another concern that was raised is, does this imply we have to start every unit with a pre-test? Bloom said not to use them. Pre-tests have value if we use them at the start of the school year. Don’t use them at the beginning of every formal unit. It only confirms what the teacher already knows (they don’t know this material). It also starts their experience with a failure. Use the pre-test as formative A and the post-test as formative B, and your concern is how to construct the enrichment and correctives in between. Rather than Monday, give the pre-test on Wednesday after two days of teaching and the post-test on Friday after days of enrichment and correctives.

If we’re going to use this well, here are the steps I think we should take. Discuss.

Developing a Table of Specifications (Outlining Learning Goals)

Two Important Questions:
1. What do I want students to learn?
2. What do I want students to do with what they have learned?
When you look at implementation, think big but start small. Start at the individual classroom level/school level. If you approach implementation voluntarily, not everyone will do it. If you get 30-40% to do it, you’re doing well. Those who are likely to volunteer are the best, most effective, the most talented you have on your staff. Those most likely to resist are those who don’t want to change anyway. Go with the thoroughbreds. Publish their results. You won’t get up to 100% even in the 2nd year. You’ll still have holdouts in the 3rd year. Three years into the process 90% are using it. The principal/department or grade level chair sits down with them. “I think you should get similar results.” Share the results of the teachers who are using these techniques. Otherwise, this person is hurting kids; isn’t professional. Not everyone will have the will at the start. Make sure you collect the evidence! Think five years down the line.

One of the major challenges you’ll face: you have standards, goals, benchmarks. How do you translate these into instructional practice? In instructional planning, ask teachers “How far will you go before you’re sure students are with you?” In this unit, you must ask, “What do you want students to learn? What do you want students to do with what they have learned?” Use the Table of Specifications. Are there any vocabulary terms they’ll need to learn? What facts do you want them to learn? What rules & principles do you want them to learn? What processes & procedures (a series of steps) you want students to know as a result of this unit? The first four columns answers, What do you want students to know?

What do yu want students to do with what they know? Do you want them to explain it in their own words? This is translations. Give them a paragraph and pick out errors = translations. If they write the paragraph and pick out the errors = application. See what we’re doing in math that has relevance in science = analysis and synthesis. This is Bloom’s taxonomy. Teachers use their instructional materials as a guide to develop this instructional table. This will only fill out the first four columns. Teachers get nervous when they see empty boxes, so they fill out the “translations,” “applications,” and “analysis and synthesis” boxes themselves. Marzano refers to this as unpacking the standard. This is a unit plan. The most important part is to pull out the assessment you use for the unit and see how well it matches. This is a very humbling experience. They’ll discover that all of their assessment is in the knowledge area. They see that they have to change their assessments to more performance. This helps bring about congruence. You don’t have to have something in every column! Sometimes early units are heavily laden with lower level skills. Later units deal with problem solving and not vocabulary, for example. This is a basic framework for bridging the gap between what the standards and benchmarks provide and the materials deliver and the assessments evaluate. This helps us unpack, combine units. This is the way teachers plan.

Guidelines for Success
If assessments are going to make a difference, if you look at schools where they’ve used assessments to make a difference, here’s what they’ve done:
1. Think big, but start small. Don’t require too much, too soon from teachers and administrators. Do not try using these ideas in all the subjects in the very first year because if you do, you’ll die! Have a clear vision of where you want to be 3-4 years from now. Start with this first year, add something else second year, add something new third year. Teachers consistently get better results the second year as they work the bugs out. Have a vision, long term.
2. Ensure that assessments become an integral part of the instructional process. Quizzes and tests should be learning tools, not simply evaluation devices that mark the end of learning.
Implication #1: Assessments must be sources of information for students and teachers.
Implication #2: Assessments must be followed by high quality corrective instruction. Just going over the test isn’t enough.
Implication #3: Students must be given a second chance to show improvement! It gives students a second chance at success.

Quote: “Spectacular achievements are always preceded by unspectacular preparation.” Roger Staubach. You don’t get a second chance in the Super Bowl, but up to that point, you do.

3. Link new assessments with existing classroom assessment practices. Blend traditional approaches with alternative assessments. This year we’re going to do portfolios. Why? Make sure form follows function. Adapt assessments to fit the learning goals/standards/whatever you’re trying to achieve.

A complex problem:

2/3 + ¾ = 5/7

Assessment formats – Traditional assessments (1. T/F; 2. Matching; 3. Multiple choice; 4. Completion; 5. Essay/short answer.

Do you really understand the concept of adding fractions with unlike denominators?

No matter how clever we are, students will always give us answers that are difficult to score. There is no such thing as a perfect assessment.

Essay/short answer are now called constructed responses. Traditional assessments all have advantages and disadvantages.

1. T/F are good, particularly if you have low level skills. Put T/F in the left hand margin rather than have them write a T or an F. They’re restricted to low level skills. Don’t ask kids to correct it to make it true. They can guess and get 50% right.

2. Matching. They’re restricted to low level skills. One important aspect of matching: how you format them on the page is important. Don’t put answers on the left. We’ve taught them to read left to right. They can read a long response and scan a list of words. It takes 1/3 the time. The less time you take for this, the more time you have for instruction. Publishers usually format them wrong. If it’s formatted by the publisher wrong, work backwards. Read the long series and scan the long series of words.

3. Multiple choice: Include common errors to diagnose learning problems. If a child answers 1.2 = .23 = either 3.5 or .35, the child has a problem with placement of decimals. You’re not trying to be tricky but to diagnose a problem.
4. Convergence theory: Stephen Douglas, Abraham Lincoln, James Monroe, Robert E. Lee. What’s the question? Often just by analyzing the relationship between the answers, you can identify where they converge. Some share a Civil War history. Some share the presidency. Lincoln has two of these, thus he’s the answer. High school students can be trained in convergence theory. They can get 60-70% right without having the questions! The usefulness as a learning tool is dependent on immediate feedback. They can get this by creating a double answer sheet. As they take the assessment, they write the answers twice. They hand one in. We go over it immediately and they correct their papers. They know what they got right and what they got wrong. They know who’s in enrichment and who’s in correctives. Teacher makes an assessment analysis (# of errors/item). First look at the questions themselves. Are they poorly worded? If no, then I didn’t teach it very well. If many students chose the item in error, they’re guessing. This means I need to make a change here.

Alternative assessments:
5. Skill demonstrations
6. Oral presentations
7. Task performances and complex problems
8. compositions and writing samples
9. laboratory experiments
10. projects and reports
11. group tasks or activities
12. portfolios

The key to success with alternative assessments: Clearly specified performance criteria or scoring rubrics. These need to be clear enough so that the student knows how he can get a 10 (of 10).

Rubrics:
1. List the criteria for a piece of work or “what counts.”
2. Articulate graduations of quality for each criterion from “Excellent” to “Poor.”

Heidi Goodrich. The magic numbers are between 3-5. Understanding rubrics. (1996) Educational Leadership, 54 (4), 14-17.

Simple guideline for developing graduations of quality:
4 Yes
3. Yes, but…
2 No, but
1. No

Why use rubrics?
1. They’re powerful tools for teaching and assessment.
2. They help students become more thoughtful judges of their own work.
3. They reduce the time teachers spend evaluating students’ work.
4. They allow teachers to accommodate differences in heterogeneous classes.
5. They’re easy to use and explain.
6. They improve objectivity in scoring.

Tips for designing rubrics:
1. Begin with models of excellence
2. Avoid unclear language (e.g., “A Creative Beginning”)
3. Avoid unnecessary negative language
4. Involve students in the process


 


 

Michigan ASCD ©
1001 Centennial Way, Suite 300
Lansing, MI 48917-9279
jscofield@gomasa.org