130
1 263 526

1 2 2 MIPS64 Addressing Modes and Instruction Formats

7:51

1 2 3 MIPS64 Operations

10:37

1 3 1 Pipelining Principles

6:45

1 3 3 MIPS Pipeline Features and Pipeline Hazards

5:01

Test 1 5 1 Caches and the Principle of Locality

8:26

Test 1 5 2 Direct mapped Cache Organization

6:40

1 1 5 CPU Performance Equation

1 1 5 CPU Performance Equation

Відео

1 2 2 MIPS64 Addressing Modes and Instruction Formats

7:51

1 2 2 MIPS64 Addressing Modes and Instruction Formats

Переглядів 9 тис.6 років тому

1 2 2 MIPS64 Addressing Modes and Instruction Formats

10:37

1 2 3 MIPS64 Operations

Переглядів 5 тис.6 років тому

1 2 3 MIPS64 Operations

6:45

1 3 1 Pipelining Principles

Переглядів 62 тис.6 років тому

1 3 1 Pipelining Principles

1 3 3 MIPS Pipeline Features and Pipeline Hazards

5:01

1 3 3 MIPS Pipeline Features and Pipeline Hazards

Переглядів 23 тис.6 років тому

1 3 3 MIPS Pipeline Features and Pipeline Hazards

Test 1 5 1 Caches and the Principle of Locality

8:26

Test 1 5 1 Caches and the Principle of Locality

Переглядів 2 тис.6 років тому

Test 1 5 1 Caches and the Principle of Locality

Test 1 5 2 Direct mapped Cache Organization

6:40

Test 1 5 2 Direct mapped Cache Organization

Переглядів 9586 років тому

Test 1 5 2 Direct mapped Cache Organization

5:39

Test 1 5 3 Hit or Miss Example

Переглядів 8116 років тому

Test 1 5 3 Hit or Miss Example

Test 1 5 4 Basic Cache Optimizations to Reduce Miss Rate

9:57

Test 1 5 4 Basic Cache Optimizations to Reduce Miss Rate

Переглядів 8286 років тому

Test 1 5 4 Basic Cache Optimizations to Reduce Miss Rate

Test 1 5 5 Cache Equations for Set Associative Caches

8:50

Test 1 5 5 Cache Equations for Set Associative Caches

Переглядів 4726 років тому

Test 1 5 5 Cache Equations for Set Associative Caches

Test 1 5 6 Cache Metrics and Improving AMAT

6:42

Test 1 5 6 Cache Metrics and Improving AMAT

Переглядів 3226 років тому

Test 1 5 6 Cache Metrics and Improving AMAT

Test 1 5 7 Reduce Miss Penalty by Multilevel Cache

4:53

Test 1 5 7 Reduce Miss Penalty by Multilevel Cache

Переглядів 5716 років тому

Test 1 5 7 Reduce Miss Penalty by Multilevel Cache

8:34

Test 1 5 8 Give Priority to Read Misses

Переглядів 2816 років тому

Test 1 5 8 Give Priority to Read Misses

6:53

Test 2 3 1 Introduction to SIMD

Переглядів 6476 років тому

Test 2 3 1 Introduction to SIMD

Test 2 3 2 SIMD Register File, Data Types, and Instructions

11:27

Test 2 3 2 SIMD Register File, Data Types, and Instructions

Переглядів 4176 років тому

Test 2 3 2 SIMD Register File, Data Types, and Instructions

Test 2 3 3 SIMD Multiplication Instructions

7:53

Test 2 3 3 SIMD Multiplication Instructions

Переглядів 2936 років тому

Test 2 3 3 SIMD Multiplication Instructions

Test 2 3 4 Special Purpose Instructions & Data Conversions

8:24

Test 2 3 4 Special Purpose Instructions & Data Conversions

Переглядів 1656 років тому

Test 2 3 4 Special Purpose Instructions & Data Conversions

Test 2 3 5 Data Alignment and Reordering

10:01

Test 2 3 5 Data Alignment and Reordering

Переглядів 2666 років тому

Test 2 3 5 Data Alignment and Reordering

6:06

Test 2 3 6 SIMD Control Flow

Переглядів 1746 років тому

Test 2 3 6 SIMD Control Flow

Test 2 4 1 TLP Motivation and Introduction

7:38

Test 2 4 1 TLP Motivation and Introduction

Переглядів 2336 років тому

Test 2 4 1 TLP Motivation and Introduction

7:02

Test 2 4 2 SW and HW Multithreading

Переглядів 1486 років тому

Test 2 4 2 SW and HW Multithreading

Test 2 4 3 Introduction to Block Multithreading

6:30

Test 2 4 3 Introduction to Block Multithreading

Переглядів 1086 років тому

Test 2 4 3 Introduction to Block Multithreading

Test 2 4 5 Introduction to Interleaved Multithreading

8:56

Test 2 4 5 Introduction to Interleaved Multithreading

Переглядів 1136 років тому

Test 2 4 5 Introduction to Interleaved Multithreading

Test 2 4 6 Examples of Interleaved Multithreading

9:27

Test 2 4 6 Examples of Interleaved Multithreading

Переглядів 846 років тому

Test 2 4 6 Examples of Interleaved Multithreading

Test 2 4 7 Introduction to Simultaneous Multithreading

10:28

Test 2 4 7 Introduction to Simultaneous Multithreading

Переглядів 2186 років тому

Test 2 4 7 Introduction to Simultaneous Multithreading

Test 2 4 8 Examples of Simultaneous Multithreading

7:48

Test 2 4 8 Examples of Simultaneous Multithreading

Переглядів 1366 років тому

Test 2 4 8 Examples of Simultaneous Multithreading

9:45

1 3 5 Load use Data Hazard

Переглядів 28 тис.6 років тому

1 3 5 Load use Data Hazard

1 3 8 Scheduling Instructions for Branch Delay Slot

7:26

1 3 8 Scheduling Instructions for Branch Delay Slot

Переглядів 17 тис.6 років тому

1 3 8 Scheduling Instructions for Branch Delay Slot

6:39

1 3 10 Excercise

Переглядів 6 тис.6 років тому

1 3 10 Excercise

10:17

1 4 1 Multicycle Operations

Переглядів 6 тис.6 років тому

1 4 1 Multicycle Operations

КОМЕНТАРІ

@motasemmeqbel6742 9 днів тому
Thanks, Prof. This lesson has ingrained this concept in my brain. which is probably the most important theoretical limitation of concurrent programming
@johnwick_123 11 днів тому
i think you could solve unoptimized code only by putting lw Rf , f after lw Rc , c this will make stall to first load and we already have loaded Rf in this way
@D.u.d.e.r Місяць тому
Serial vs parallel is basically what effects everything not just chip logic. There u can see how more serial based CPU cores cannot scale as good as multi parallel GPUs. It’s all coming back to basics of processing where some tasks must be performed in series and others can be heavily parallelized. That sweet spot between them is constantly being challenged and pushed to achieve best possible results. In my opinion level of true parallelization and processing optimization will only increase also due to slowdowns and limits of chip shrinking. Golden years of just adding more smaller transistors and increasing clock speed r over even they will still play an important role, but surely as not as important as they played in the past.
@akiramishima8309 Місяць тому
actually, 40 is not an odd number😁
@anantawijaya1594 Місяць тому
🙏 thankyou
@antonkal Місяць тому
How do you find the size of the block offset. Is that the size of the cache line in bits?
@Принуждениекмиру-ь2ц Місяць тому
It looks so ugly. Just use assembly and don't f**k your brain, gays.
@tandung05_nguyen51 Місяць тому
Thanh you Professor
@mymagicalmusic-ng3ry Місяць тому
thank you very much sir this is one of the best lectures presentations i have seen.
@GlimmerLiu Місяць тому
I enjoyed this a lot! Your way of teaching is so engaging and enlightening! Thanks for sharing with us these wonderful videos!
@juliotitosagastegui9453 Місяць тому
my quiz is today ina couple fo hrs, this saved my bacon, its only worth 100% of my final grade anyways, nothing crazy
@rayenlahgui2292 2 місяці тому
thank u so much u better than thouse indians teachers and my univ teacher
@saikiranbelana4897 2 місяці тому
Thank you for this lecture. This helped me understand in breaking down to basics of dynamic scheduling for loops.
@XinyuanChen-i4c 2 місяці тому
R9 is 1024, as the compare to jump out of loop
@RodrygoLembanyanja 2 місяці тому
Thank you Sir
@codandreas2169 2 місяці тому
goat
@konstantinoskoumas2211 2 місяці тому
marinos antoniou x goggins big biceps
@rohanrudra 2 місяці тому
The most clear explanation of this topic !! Thank You
@themoviesongofken 2 місяці тому
I'm a little confused here. BHT is used to store the history of individual branch. Is there anothere table to record prediction state (prediction wrong or correct)?
@alexandermamani1488 3 місяці тому
Excelent explanation and summary!
@olliea876 3 місяці тому
Thank you, this has helped me a lot! One thing though is the quiz questions seem very vaguely and confusingly worded: "What is the ideal speedup due to pipelining?" well, from what is said here, pipelining WILL speed up executed instructions. And this is the ideal point of it. It will do this because of how the stages are organized and utilized. So it's both. Unless the question was meant to be "what is the ideal speed up due to, in pipelining?"
@ArneChristianRosenfeldt Місяць тому
Let’s say a single cycle CPU can run 1 MHz on a given process, then a 5 stage CPU can run 5 MHz?
@LewMurray-s8u 3 місяці тому
Little Fords
@AhmedAwny-v3i 3 місяці тому
well done professor you do a great job
@danser_theplayer01 3 місяці тому
Too many loops. I have an easier method called "slicing", where I take slices of A and B matrix multiplying them in such a way that I access A rows and B rows. The matrices are stored as typed arrays (contiguous memory blocks), abstracted to be 2d arrays with values stored in row-major order. Even though they are single arrays, it's ulikely that the whole array will fit into a CPU cache. Cache misses are inevitable. But I'm going to solve the problem of column access creating more cache misses without having to transform matrix B, and without adding extra operations (especially multiplication).
@ronaldfungss 3 місяці тому
It's very clear. Thank you for your work : ]
@nullbeyondo 3 місяці тому
Perfect for a Networking scheduler... if you're counting the load on each thread to distribute newer requests to the less loaded threads, then you really don't need an atomic variable that wastes CPU cycles by waiting for the threads to be synchronized, you could just read, and even if the scheduler reads an old value that was updated 10 seconds ago by the thread, who cares? It would work perfectly for distrubiting the load. I exaggerated of course, but that'd most likely be a few nanoseconds delay in CPU times between threads, so even more perfect. This relaxed consistency model is preferred in this case over the atomic models that require consistency. That also works because you there would only be one thread always writing to the variable, while the scheduler merely reads it, so no undefined behavior.
@RITIKKUMARCHHITROLIA 3 місяці тому
00:11 Introduction to computer architecture and objectives 01:20 Computer architecture is the science and art of selecting and interconnecting hardware components. 02:39 Introduction to ENIAC and course objectives 03:59 Understanding and overcoming cache challenges 05:18 Core i5 processors utilize key components for out-of-order execution 06:38 Branch prediction and SIMD instructions are important concepts for accelerating applications. 07:57 Nehalem microarchitecture highlights dual multi-threading support by Intel 09:15 Server processes contain multiple cores and execute multiple processes simultaneously. Crafted by Merlin AI.
@yuriytereshchuk1232 4 місяці тому
At the end the demonstration of mnemonics wasn’t shown on background place
@uday7777777 4 місяці тому
Extraordinary....simple and clear....Thank you very much
@uday7777777 4 місяці тому
Thank you for an amazing lecture.
@juang.garcia7390 4 місяці тому
Slides don't work 😢
@GurkenDieb4444 5 місяців тому
Sehr gut.. Danke
@noitnettaattention 5 місяців тому
The microphone is really killing this great series !
@memeingthroughenglish7221 5 місяців тому
Vielen Dank!
@madhurharikrishnan3003 6 місяців тому
Very good video!
@Chill_dood_007 6 місяців тому
Best explanation on internet! will never forget Amdahl's law
@MiladOmidi-w9k 6 місяців тому
at 6:42 "The two inner loops access an N times N/B submatrix of x", I think instead he should have said "The two inner loops access an N times B submatrix of x"?
@dianarehan7599 7 місяців тому
thankssss for this video
@dghtucs 7 місяців тому
thank you for sharing knowledge selflessly!
@MrNewAmerican 7 місяців тому
Clear, concise, precise - what I'd expect from a German
@mert1354 7 місяців тому
Sir you are great but I am not understand
@nom6235 7 місяців тому
Thanks for much
@woelleow 8 місяців тому
thanks king I've got an exam tm
@monasser2167 8 місяців тому
Amazing and simple
@kiruravoori 8 місяців тому
thanks for great explanation.
@Chris-nq3ri 8 місяців тому
Thx Germany 🇩🇪 I need this for finals next week. From 🇺🇸
@Chris-nq3ri 8 місяців тому
Thx Germany 🇩🇪 I need this for finals next week. From 🇺🇸
@AdvaithSubramanianSahasranamam 8 місяців тому
Prof. Dr. Juurlink, thank you so much for your informative and crisp videos! They are very helpful for my computer architecture course here in the U.S.!
@seyitilkturk 8 місяців тому
Dankeschön!
@avi123 8 місяців тому
6:35 valid only if a doesn't overlap with b or c

Prof. Dr. Ben H. Juurlink

КОМЕНТАРІ