Musings and Ruminations

"I write to find out what I think." Joan Didion. "Qu'est ce que je sais"-What do you know? "a fox knows many things, but a hedgehog know one big thing" Archilochus I studied most of my life for credentials, now I study as a Polymath. This blog is my personal ruminations. I invite you along to explore many things. I won't promise that it will all be interesting, but I promise that the thoughts are honest. I realized, relatively late, that life is for the living. So, it was time to live.

Showing posts with label Learning and Teaching. Show all posts

Saturday, April 15, 2023

Learning and Teaching-Cognitive Load Theory: Part 2. Minimizing Extrinsic Load by Honing the Presentation.

In Part 1 on Cognitive Load Theory (https://polymathtobe.blogspot.com/2023/02/learning-and-teaching-cognitive-load.html), the framework of WHAT Cognitive Load Theory is was laid out in principle, following Oliver Lovell’s book on the subject (Lovell 2020).

Part 3 is on how teachers can minimize the extrinsic load on the learner through structuring their practices and lessons. (https://polymathtobe.blogspot.com/2023/05/learning-and-teaching-cognitive-load.html)

Part 4 is on how teachers can optimize intrinsic loads on the student. (https://polymathtobe.blogspot.com/2023/08/learning-and-teaching-cognitive-load.html)

This article follows the book in examining how the teacher or coach can apply cognitive load theory to minimize the extrinsic loading on the working memory by honing their presentation. What follows is my interpretation of what was laid out in Lovell’s book, any misrepresentation is entirely my fault.

The definitions of intrinsic and extrinsic loads are reiterated below.

The extrinsic cognitive loads are:

· A part of the manner and structure of how the information is conveyed to the learners.

· Disruptive to the learning task because it distracts the learner from learning by occupying valuable working memory space.

Whereas the intrinsic cognitive loads are those that are critical to learning whatever it is that we need to learn. They are:

· Part of the nature of the information that we are learning.

· Core learning.

· Information that we WANT the learner to have in their working memory.

The critical limitation is that the working memory has a finite capacity; that is, the intrinsic and the extrinsic loads are vying for the same finite resource. One emphasis should be placed on minimizing the extrinsic load; that is, to offload unnecessary extrinsic cognitive load, to make space in the working memory before optimizing the intrinsic loads.

Note that even though Lovell’s book is relatively short, he presents quite a bit of results, and information, and examples from many different subjects, so it is worthwhile to read through the book.

Since I come from two different but related points of view: teaching at a university level and coaching, I will apply the rules for honing the presentation within both contexts whenever they are applicable.

Minimize Extraneous Load

My experience in both teaching and coaching showed me that de-cluttering the learning experience and eliminating the non-essential components is critical to helping my students/players learn. The actions of the teacher/coach in teaching/coaching often distract the student/player, which dilutes their attention. Minimizing the extrinsic load on the student/player is a large job for the teacher/coach. It was a hard lesson to learn.

Lovell split the task of minimizing the extraneous load into two parts: Honing the presentation and structuring the practice. This part is about honing the presentation.

Honing the Presentation

How the information is presented is critical as they need to focus all of their working memory on the learning task rather than any unintended extrinsic loads added by the presentation modes of the teacher/coaches which distracts the learning experience.

Here are the key points presented in Lovell’s book.

· Redundancy Effect: Eliminate unnecessary information and avoid replicating necessary information.

Most teachers/coaches feel that anything they present is critical to learning; that is, they don’t recognize that the way they present the information can be redundant. An example from the author is: When information is presented simultaneously in written and spoken forms, both forms of presentation are competing for the same working memory resources, which results in interference with each other.

o In the case of coaching, it isn’t the mixed medium for presentation which is redundant, it is that coaches often get into the habit of repeating themselves because coaches think that they need to fill in the silence with chatter to make sure that the players get the point. In my experience, the incessant repetition of information and instruction has just the opposite effect: it pulls the player’s attention away from processing the information to actively thinking about the things that the coach is saying.

o The definition given in the book does apply in the classroom teaching context. We tend to present written information and we also have a tendency to read that information to the students. This goes back to the cardinal rule of making presentations: do not read the slides verbatim during a presentation.

· Bullet-Proof definitions: Clearly and simply define each concept introduced. The book recommends a “bullet proof definition”, which is a one sentence summary of the key concept. Detailed definitions and long well-thought-out explanations, while complete, will overload the student’s working memory because they are overcome with the amount of information being presented and will try to process the all the detailed information in real time, an impossible proposition. It is critical to keep it simple but to the point, at least while introducing new concepts.

o In the coaching context, the coach needs to avoid giving all the instructions at once. Coaches will sometimes give the players instructions for the complete skill or tactics rather than feeding the instructions to the players in pieces. While presenting whole and complete pictures are important, they are more important to the experienced players who are refining their understanding of the skills after having achieved a level of competency. The nuances and holistic understanding come later in the player’s skill evolution, not while they are first exposed to the task.

o The idea is identical in the teaching context, the teacher needs to learn to construct a basic scaffolding for the student, who needs to establish their fundamental understanding. Each piece of bullet-proof definition serves to serve as the foundation for their future development. These bullet-proof definitions also need to be placed in relation to the other bullet-proof definitions to show students the linkages between the parts to facilitate the synthesis of the coherent structure. This is especially critical in teaching the student the problem-solving process, because it is never obvious how one problem-solving step leads to the next step.

· Expertise Level Reverse Effect: What may appear to be vital to the expert is not vital to the neophyte.

Successful learning must be based on the level of the learner. Details matter to the learning process of the expert, they don’t matter to the neophyte, yet.

Guadagnoli and Lee’s explains in their paper (Guadagnoli 2004) that those learners who are learning new skills and concept without any previous experience are more likely to be lost and confused while learning the basics because their long-term memory is do not contain any tangible memory to retrieve and extend into new knowledge. The presentation must be honed to meet the level of the student and not that of the teacher.

In both the coaching and teaching context, it is incumbent on the teacher/coach to understand that they need to anticipate the difficult concepts which may confuse a beginner and steer them through the complexities. It is better to underestimate the amount of knowledge that the beginner may have rather than to over-estimate.

· Split Attention: “Information should be placed together in space and time, if and only IF the information cannot be understood in isolation and is essential rather than redundant.” In other words, when presenting complex information to students, the information needs to be presented together if presenting them separately would lead to confusion. This is more applicable in the classroom teaching context.

Rather than presenting complex information separated by space and time, putting them close together in space and time is more effective in helping the student to learn.

o Split Attention in Space: This is more applicable in the classroom context. The recommended practice is to put all the information in close proximity to one another so that the student’s visual field contains all the necessary information so that the complete set of information can be seen and absorbed together. The key is to keep the information within the visual field of the student so that they don’t have to search for pertinent information, while also making sure that the information presented together is not redundant.

o Split Attention in Time: Refers to the timing of the information presentation. Rather than presenting in a manner which makes logical sense to the presenter; for example, presenting in an orderly bullet list form first and then going returning to the list after the initial presentation to define each item, it is better to present each term and follow up with the definition immediately. Oftentimes, orderly presentation writing practices can cause the student’s working memory to be overloaded.

· Transient Information Effect: When information disappears, and students must hold it in working memory, this will also cause extraneous cognitive load. This effect occurs almost exclusively in a classroom setting when the teacher improvises and passes on information verbally without presenting that information in a tangible way. This is where improvisation needs to be regulated and digressions avoided unless the students have significant experience and background to process the digressions in parallel.

In the coaching context, the coach should suppress the urge to extemporaneously progress beyond the specific instruction on the fly.

· Modality Effect: Modality effect is defined as presenting information via the auditory and visual channels in tandem via to eliminate visual split attention and expand working memory capacity. The idea is that the visual and auditory channels work separately, so that both channels can be deployed together to make more effective learning. The auditory and visual channels can be used to enhance the student’s learning process; they are greater than the sum of their parts.

The deployment of the modality effect may seem to be contradictory to the redundancy effect, it is a nuanced difference between being redundant and advantageous. They are only contradictory if the teacher/coach falls into two distinct redundancy traps.

1. Simultaneous presentation of written and spoken words. They both involve the use of the language processing center of the brain, which overloads the working memory.

2. Simultaneous presentation of redundant information.

To successfully deploy the dual modality presentation the information presented with the auditory and visual channels are essentially complementary, i.e. if they can be understood separately, then they need to be presented separately.

An example of dual modality presentation in coaching is the time-honored tradition of skills or tactics demo as the coach explains what the coach wants the player to see. The players should be absorbing the visual demo while also getting essential reminders of what they are seeing. The information from the visual and the auditory channels should complement one another rather than contradict or repeat one another.

· Keep the number of independent subjects limited. This does not come from Lovell, this comes from Doug Lemov’s The Coaches Guide to Coaching (Lemov 2020).

o Anecdotally, the working memory can handle up to seven separate pieces of information at a time. Lemov recommends, especially with beginners, no more than three instructions. It is better to have free space in the working memory so that the student/player can apply more working memory resources to those three instructions.

Note that while these points are seemingly basic, it is difficult to execute because they are often contradictory to what the teacher/coach are accustomed to in practice. I have come to practice understated presentation or explanation to avoid overloading the student/player’s working memory. But just to be sure, I also check for understanding more often and change presentation modes rapidly at the first sign of confusion.

Summary

· Redundant information is the main reason for an unnecessarily overloaded working memory.

· Redundancy comes in many forms

o Presenting the same information using the same means at the same time, e.g. using the written word and oral instructions at the same time because both involve the language processing part of the brain.

o Presenting too many pieces of information together.

· Keep it simple. Have bullet-proof definitions for the student/player to hang on to. Something that is simple and to the point.

o Keep the experience and skill level of the student/player in mind when formulating the definitions. It is easier to initially underestimate their experience and skill level and ratchet up your expectations as they show more aptitude and higher experience and skill levels.

· To avoid splitting the student/player’s attention,

o Combine information in the presentation Only If the information is so closely related that it would cause more confusion if they were presented separately.

o Define the information at the same time as when you present the topic, not returning to the list after the list is presented for the first time .

· Use the language processing and visual processing channels together.

o Visual processing meaning picture and videos.

o Reading slides uses language processing part of the brain, same as listening.

· Present up to three main points at a time and no more. Avoid improvisations and digressions.

The Rest of the Story

Cognitive Load Theory- Part 3: Reducing Extrinsic Load by Structuring Practice

Cognitive Load Theory-Part 4: Optimizing Intrinsic Load

Works Cited

Guadagnoli, Mark and Timothy D. Lee. "Challenge Point: a Framework for Conceptualizing the Effects of Various Practice Conditions in Motor Learning." Journal of Motor Behavior, June 2004: 212-224.
Lemov, Doug. The Coaches Guide to Teaching. Clearwater, FL: John Catt Educational Ltd., 2020.
Lovell, Oliver. Sweller's Cognitive Load Theory in Action. Melton: John Catt Educational Ltd, 2020.

Friday, February 3, 2023

Learning and Teaching-Cognitive Load Theory Part 1

When I started to teach engineering classes after many years away from teaching at a college level, I was looking for ways to buttress my teaching and I was looking for a solid reference to give me confidence to overcome my own fear of failure in the classroom.

My first instinct was to research the latest teaching literature for help. Most of what I was reading was focused on the techniques and tools that were available to the teacher or coach, not exactly a how-to approach.

As I broadened my investigation, the literature eventually steered me towards the cognitive sciences: how learning and retention works neurologically, a different direction than what I had expected. Instead of focusing on strategies and tactics for the teacher/coach to be more effective, this approach focuses on how the instructor should adjust their teaching philosophy to conform to how students/players can best learn.

Cognitive Load Theory (CLT) shows up continually on my radar, I first learned about it from the books that I had read upon the recommendation of Coach Vern Gambetta. (Brown, 2014) (Lemov, Teach Like a Champion: 49 Techniques that Put Students On The Path to College, 2010) (Lemov, The Coaches Guide to Teaching, 2020) (Lemov, Woolway, & Yezzi, Practice Perfect: 42 Rules for Getting Better at Getting Better, 2012). Even as I was intrigued by those books, I still felt lacking in background on CLT, so I read a couple of papers, both by the originator of the concept, John Sweller. His original paper was published in 1998. (John Sweller, Cognitive Architecture and Instructional Design, 1998). He also wrote another paper celebrating the 20^th anniversary of the original paper, discussing the developments that had augmented and evolved from the original theory after its initial introduction (John Sweller, Cognitive Architecture and Instructional Design: 20 Years Later, 2019).

Somewhere along the line, a monograph Cognitive Load Theory in Action by Oliver Lovell (Lovell, 2020) came onto my radar, which focused on not only the theory but has neatly summarized teaching strategies which turns CLT from concept into effective and tangible practice. Lovell had worked in close consultation with Sweller to organize and summarize the CLT ideas that were in the original paper while also broadening the original idea with additional work that was done in the intervening years.

What follows is a discussion structured along Lovell’s monograph, after I added some thoughts on the topic that are specific to my situation: teaching college engineering classes, coaching junior volleyball, and applying the method to my own learning practice. I do this s a means to help me understand topics that I felt were important. It aligns with the following quote from Joan Didion: “I write entirely to find out what I’m thinking, what I’m looking at, what I see and what it means.”

What is CLT?

Cognitive Load Theory is primarily based on what we know of the structure of the human memory and how that structure affects how humans learn and more importantly, recall all the information, knowledge and experience which makes up our learning arsenal.

There are five main principles which underlie CLT.

1.     Memory has Architecture.
2.     Knowledge is Categorized as Biologically Primary or Secondary
3.     Working Memory can be Categorized as either Intrinsic or Extrinsic
4.     The Knowledge Can be Either Domain-General or Domain-Specific
5.     Element Interactivity is the Source of Cognitive Load

Memory has Architecture.

The first essential principle of CLT states that humans’ memories have an architecture which consists of three major resources: environment, working memory and long-term memory.

The environment is everything that exists outside of our brain, all the external resources that we can consult for information: anything and everything that surrounds us which can be used to augment our working and long-term memory. Both Barbara Tversky’s (Tversky, 2019) and Annie Murphy Paul’s (Paul, 2021) books have chapters devoted to how humans use the space surrounding us in extending our memories to that space.

The second architecture is the working memory:

·       It has limited capacity, able to keep 3-7 elements of information at one time.
·       It is where all the thinking takes place.
·       It is the domain of conscious thought: that is, we are actively managing the memory consciously.
·       It is the limiting factor of human thinking, the bottleneck to our ability to learn effectively.
·       It is where the unfamiliar and unknown at that instant takes up more of the working memory capacity than the familiar and known.
·       It uses chunking and automating the information to reduce the cognitive load on the working memory.

The last architecture is the long-term memory:

· It is unlimited in capacity although the information that is most often recalled is those that are most easily recalled, i.e., those that are most often recalled or had been recalled most recently.
· It is divided into three key form of knowledge:

o   Episodic: refers to life events,
o   Semantic: refers to factual information
o   Procedural: refers to process memories.

An analogy can be drawn comparing the human memory definition to computer system configuration. Working memory is analogous to Random Access Memory (RAM), long term memory is analogous to the hardware memory: HDD’s or SSD’s. Whereas the CPU is the brain itself, serving as the central traffic control for the knowledge that is being passed around. Even though this analogy can only carry us so far, as the human brain and nervous system are not as clearly defined as a computer system because the computer system is essentially a Von Neumann machine designed to be a simple imitation of what we think human cognition functions. Another place where the analogy falls apart is the transfer of knowledge from the working memory to the long-term memory. Human memory is such that it takes many iterations of transferring the knowledge from working to long-term memory before it is made permanent, whereas the computer has a specific storage function to more the data from the working memory to the long-term memory.

The analogy holds true for our purposes. The working memory actively using the brain to consciously process all the new and unknown information, knowledge, and experiences; while recalling the familiar and known information, knowledge, and experiences without using the brain because the information has been chinked and automated so that the recall is done subconsciously, without adding to the cognitive load.

This architecture is the foundational model for CLT, the arguments for implementing the strategy and tactics suggested by CLT are built upon the bedrock assumption that the three-resource model is correct.

Knowledge is Categorized as Biologically Primary or Secondary

The second essential principle states that knowledge can be categorized as either biologically primary or secondary.

Biologically primary knowledge refers to knowledge that:

·       Are unconscious, fast, frugal, and implicit.
·       Are acquired by humans through evolution.
·       Are Knowledge that cannot be taught.

Biologically secondary knowledge refers to knowledge that:

·       Are slow, effortful, deliberate, and conscious.
·       Have evolved through the last 1000 years.
·       Needs to be taught.

Working Memory Load can be Categorized as either Intrinsic or Extrinsic

The third essential principle states that our working memory can be categorized as either intrinsic or extrinsic cognitive loads.

Intrinsic cognitive loads are those that are critical to learning whatever it is that we need to learn. They are:

· Part of the nature of the information that we are learning.

· Core learning.

· Information that we WANT the learner to have in their working memory.

Whereas the extrinsic cognitive loads are:

· A part of the manner and structure of how the information is conveyed to the learners.

· Disruptive to the learning task because it distracts the learner from learning by occupying valuable working memory space.

The crux of the problem is that the working memory capacity is finite; that is, the intrinsic and the extrinsic loads are vying for the same finite resource: the working memory. Ideally, the learner needs to optimize the use of the working memory for intrinsic loads and minimize the use of the working memory for extrinsic loads. Note that it is desirable to optimize the intrinsic load rather than also minimizing the intrinsic load as we want to minimize the extrinsic load. The reason for the difference in goals is that we wish to devote as much of the working memory to the intrinsic (productive) learning mode, what is variable is whether an appropriate amount of intrinsic load is placed on the working memory for optimal learning, or whether too much, or too little intrinsic load is placed on the working memory.

The Knowledge Can be Either Domain-General or Domain-Specific

A fourth essential principle states that the knowledge that the learner is being exposed to can be divided into domain-general versus domain-specific.

Domain-general skills are:

·       Biologically primary
·       General capabilities
·       Generally applicable
·       Transferable
·       Examples are:

o Problem solving

o Creativity

o Communication

o Teamwork

o Critical thinking

Domain-general skills are assumed to exist, can be taught, learned and transferred.

Domain -specific skills are:

· Biologically secondary
· Applicable to only specific domains.

The difference between novices and experts in each domain is that experts have more relevant domain-specific knowledge. Which means that the novice uses more thinking in performing tasks while experts use more knowledge. The novices use up more of the working memory to think (conscious) rather than recall and the experts use up more of the working memory to recall knowledge rather than to think (subconscious).

An expert:

·       Has a larger collection of situations and associated actions stored in their long-term memory.
·       Can explain why these situations imply taking the specific actions,
·       Can derive the solutions from foundational principles,
·       Can explain the mechanism behind them,
·       Can recognize the situations and execute an appropriate action,

The payoff is that the expert can recognize a larger cache of problems and situations and the necessary actions to deal with each problem and situation, whereas the novice has a much smaller cache of memory.

If the biologically secondary domain-specific knowledge is not transferable, how does the learner go about improving their performance in a specific domain? The answer, according to Lovell and Sweller, is to increase domain specific knowledge in the long-term memory to expand the number of potential solutions available from the long term memory which can be recalled into the working memory.

Conversely, how does one leverage the biologically primary, domain-general, and transferrable knowledge to improve the biologically secondary, domain specific, and non-transferable knowledge? The answer is to apply transferable knowledge within the context and the constraints of the specific domain. It is an indirect way of leveraging what is already resident within us when we are born to acquire new knowledge that is new to us?

Element Interactivity, the Source of Cognitive Load

The final principle is that the number of elements in a given problem and situation and the amount of interactivity each element has with the other elements determine the amount of cognitive load being placed on the working memory. Element interactivity depends on the nature of the activity as well as the background knowledge stored in the long-term memory inherent in the learner: novice learners will have minimal knowledge base to draw upon, while the expert will have an extensive knowledge base. In addition, the expert also is experienced, so that they can effectively “chunk”, i.e. consolidate and automate the knowledge to effectively bypass thinking and allow their knowledgeable self to react automatically.

Perniciously, the amount of interactivity rises geometrically as the number of elements increases linearly, this kind of growth in interactivity will overwhelm the working memory in short order. The elements and the interactivity of the elements can also be divided into intrinsic and extrinsic loads following the second principle stated.

Indeed, this is the main idea that saves our working memory from constantly being overwhelmed: we can separate the intrinsic and the extrinsic; optimizing the learning from the former and minimizing the chatter from the latter.

According to Lovell, intrinsic load is optimized through well-crafted curriculum sequencing and the extrinsic load is minimized through good instructional design. All of which is also covered Lovell’s book, digging deeper into the granularity of

· Strategies and tools to optimize the intrinsic load.
· Strategies and tools to minimize the intrinsic load.

Which will be covered in a separate article, CLT Part 2, because this first part has optimized my intrinsic memory, and I need a break.

Part 2 is on how teachers can minimize extrinsic load on the learner through honing their presentation. (https://polymathtobe.blogspot.com/2023/04/learning-and-teaching-cognitive-load.html)

Part 4 is on how teachers can optimize intrinsic loads on the student. (https://polymathtobe.blogspot.com/2023/08/learning-and-teaching-cognitive-load.html)

References

Abrahams, D. (2022). Retrieved from Daniel Abrahams: Helping People Perform: https://danabrahams.com/blog/

Brown, P. C. (2014). Make It Stick: The Science of Successful Learning. Canbridge MA: Belknap Press.

John Sweller, J. J. (1998). Cognitive Architecture and Instructional Design. Educational Psychology Review, 251-296.

John Sweller, J. J. (2019). Cognitive Architecture and Instructional Design: 20 Years Later. Educational Psychology Review, 261-292.

Lemov, D. (2010). Teach Like a Champion: 49 Techniques that Put Students On The Path to College. San Francisco: Jossy-Bass Teacher.

Lemov, D. (2020). The Coaches Guide to Teaching. Clearwater, FL: John Catt Educational Ltd.

Lemov, D., Woolway, E., & Yezzi, K. (2012). Practice Perfect: 42 Rules for Getting Better at Getting Better. San Francisco CA: Jossey-Bass.

Lovell, O. (2020). Sweller's Cognitive Load Theory In Action. Melton, Woodtidge UK: JohnCatt Educational Ltd.

Paul, A. M. (2021). The Extended Mind-The Power of Thinking Outside the Brain. New York: Houghton Mifflin Harcourt.

Tversky, B. (2019). Mind In Motion-How Action Shapes Thought. New York: Hachette Book Group.