外文资料
Hardware and Computer Organization
From: Hardware and Computer Organization, Arnold S.Berger Figure 6.21is an excerpt form te data sheet for an SDRAM memory device from Micron Technology,Inc, a miconductor memory manufacturer located in Boi ,ID.The timing diagram is for the MT48LC128MXA22family of SDRAM memories.The devices are 513 Mbit parts organized as 4,8or 16-bit wide data paths . The ‘X’is a placeholder for the organization (4,8or 16 bit wide).Thus ,the MT48LC128M4A2 32m* is organized as 32M*4,while the MT48LC128M4A2is organized as 8M*16.
The devices are far more complicated in their operation then the simple SRAM memories we’ve looked at so far. However ,we can e the fundamental brust behavior in Figure 6.21.
The fields marked COMMAND ,ADDRESS and DQ are reprented as band of data,rather than individual bits .This is a simplification that allows us to show a group of signals ,such as 14address bits ,without having to show the state of each individual signal.The band is ud to show where the signal must be stable and where it is allowed to chang .Notice how the signals are all synchronized to the rising edge of the clock .Once the READ command is issued and the address is provided for where the
statueoflibertyburst is to originate ,there is a two clock cycle latency an quentially stored data in the chip will then be available on every successive clock cycle.Clearly,this is far more efficient then reading one byte at a time .
上海商务英语培训
When we consider cache memories in greater detail,we’ll e that the on-chip caches are also designed to be filled from external memory in bursts of data.Thus ,we incur a penalty in having to t-up the initial conditions for the data transfer from external memory to the on-chip caches,but once the data transfer parameters are loaded ,the memory to memory data transfer can take place quite rapidly.For this family of devices the data transfer takes place at a maximum clock rate of 133MHz.
Newer SDRAM devices, called double data rate,or DDR chips,can transfer data on both the rising and falling edges of the clock .Thus ,a DDR chip with a 133 MHz clock input can transfer data at a speedy 266MHZ.The parts are designated ,for reasons unknown,,as PC2700 devices .Any SDRAM chip capable of conforming to a 266MHZ clock rate are PC2700.
Modern DRAM design takes many different forms.We’ve been discussing SDRAM becau this is the most common form of DRAM in a modern PC.Your graphics card contains video DRAM.Older P
C’s contained extended data out ,or EDO DRAM.Today ,the most common type of SDRAM is DDR SDRAM .The amazing thing about all of this is the incredibly low cost of this type of memory .At this writing (summer of 2004),you can
chilled
purcha 512 Mbytes of SDRAM for about 10 cents per megabyte .A memory with the same capacity ,built in static RAM would cost well over $2,000.
Memory -to -Processor Interface
The last typic that we’ll tackle in this chapter involves the details of how the memory system and the processor communicate with each other .Admittedly ,we can only scratch the surface becau there are so many variation on a theme when there are over 300 commercially available microprocessor families in the world today ,but let's try to take general overview without getting too deeply enmeshed in individual differences .
In general, most microprocessor-bad systems contain three major bus groupings : Address bus :A unidirectional bus from the process out memory .
Data bus : A bi-directional bus carrying data from memory to the processor during read operations and from the processor to memory during write operations.
毕竟英语
Status bus : A heterogeneous bus comprid of the various control and houkeeping signals need to coordinate t operation of the processor,its memory and other peripheral devices .Typical status bus signal include :
a. RESET
b. interrupt management
c. clock signals
d. read and write signals
e. read and write signals
This is shown schematically in Figure 6.22 for the Motorola MC68000 processor .The 68000 has a 24-bit address bus and a 16-bit external data bus .However ,internally ,both address and data can be up to 32 bits in length.We’ll discuss the interrupt system and bus management system later on in this ction.
The Data Bus is also homogeneous, but it is bidirectional.Data goes out from memory to the process
or on a read operation and from the processor to memory on a write operation, Thus, data can flow in either direction , depending upon the instruction being executed.
The Status Bus is heterogeneous.It is made up of different kinds of signal,so we can’t group the some are bidirectional.The Status Bus is the “houkeeping”bus.All of the signals that are also needed to control system operation are grouped into the Status Bus.
Let’s now look at how the signals on the bus work together with memory so that we may read and write.Figure 6.23 shows us the processor side of the memory interface.
Now we can e how the processor and the clock work together to quence the accessing of the memory data.While it may em quite bewildering at fires .It is actually many additional signals that may prent or abnt in various processor designs and tried to restrict our discussion to the bare esntials .
The Y-axis shows the various signals coming from the processor.In order to simplify things ,we’ve grouped all the signals for the address bus and the data bus into a “band”of signals.That way ,at any given time ,we can assume that some are 1 and some are 0,but the key is that we must specify when they are valid.The crossings,or X’s in the
address and data bus may be changing ,such as an address changing to a new value,or data coming from the processor .
Since the microprocessor is a state machine,everything is synchronized with the edges of the clock.Some events occur on the positive going edges and some may be synchronized with the negative going edges .Also.for convenience.we’ll divide the bus cycles into identifiable time signatures called ‘T states.” Not all processors work this way ,but this is a reasonable approximation of how may processors actually work.Keep in mind that the processor is always running the bus cycles.The operations form it Fundamental method of data exchange between the processor and memory.Therefore ,we can answer a question that was pod at the beginning of this chapter.Recall that the state machine truth table for the operation ,ADD B,A left out any explanation of how the data got into the registers in the first place.and how the instruction itlf got into the computer.
Thus, before we look at the timing diagram for the processor/memory interface, we need to remind ourlves that the control of this interface is handled by another part of our state machine.In algo-rithmic terms,we do a “function call”to the portion of the state machine that handles the memory interface,and the data is read or written by that algorithm.
stand
Let’s start with a READ cycle .During the falling edge of the clock in T1the address becomes stable and the ADDR VLA signal is asrted LOW.Also ,the RD signal goes LOW to indicate that this is a read operation .During the falling edge of T3 the READ and ADDRESS V ALID signals are de-asrted indicating to memory that that the cycle is ending and the data from memory is being read by the processor.Thus ,the memory must be able to provide the data to the processor within two full clock cycles(all of T2 plus half of T1 and half of T3).
Suppo the memory isn’t fast enough to guarantee that the data will be ready in time.We discusd this situation for the ca of the NEC static RAM chip and decided that a possible solution would be to slow the processor clock until the access time requirements for the memory could be guaranteed to be within specs.Now we will consider another alternative.In this scenario,the memory system may asrt the WAIT signal back to the processor.The processor checks the state of the WAIT signal on the on the falling edge of the clock during T2cycle.If the WAIT sinnal is asrted ,the processor generates another T2 cycle and checks again.As long as the WAIT signal is LOW,the processor keeps marking time in T2,Only when WAIT goes high will the processor complete the bus cycle.This is called a wait state ,and is ud to synchronize slower memory to faster processors .
The write cycle is similar to the read cycle.During the falling edge of the clock in T1 the address beco
mes valid.During the rising edge of the clock in T2 the data to be written is put on the data bus and the write signal goes low,indicating a memory write operation .WATI signal has the same function in T2 on the write cycle.During the falling edge of the clock in T3 the WRsignal is de-asrted ,giving the memory a rising edge to store the data .ADDR V AL also is de-asrted and the write cycle ends .
There are veral interesting concepts buried in the previous discussion that require some explanation before we move on .The first is the idea of a state machine that operates on both edges of the clock.so let’s consider that first .When we input single clock signal to the processor in order to synchronize tis internal operations,we d on’t really e what happens to the internal clock .
Many processors will internally convert te clock to a 2-pha clock .A timing diagram for a 2-pha clock is shown in Figure 6.24.
The input clock ,which is generated by an external oscillator ,is converted to a 2-pha clock,labeled 1 and 2 .The two clock phas now 180 degrees out of pha from each other,so that every rising or falling edge of the CLKIN signal generates an internal rising clock edge.How could we generate a 2-pha clock?You actually already know how to do it, but there’s a piece of information that we first need to place in context.Figure 6.25 is a circuit that can be ud to generate a 2-pha clock .
dear basketball>bkb是什么The 4 XOR gates are convenient to u becau there is a common integrated circuit part which contains 4 Xor gates in one package.The circuit makes u of the propagation delays that are inherent in a logic gate .Suppo that each XOR gate has a propagation delay of 10ns .Assume that the clock input is LOW.One input of XOR gates 1 through 3 is permanently ties to ground (logic LOW).Since both inputs of gate 1 are LOW ,tis output is also LOW.This situation carries through to gates 2,3and 4.Now ,the CLKIN input goes to logic state HIGH.The output of gate 4goeshigh 10ns later and toggles the D-FF to change state.Since the Qand Q outputs are opposite each other,we conveniently have a source of two alternationg clock phas by nature of the divide-by-two wiring of the D-FF.
After a propagation delay of 30ns the output of tate 3 also goes HIGH,which caus the output of XOR gate 4 to go LOW again becau the output of an XOR gate is LOW if both inputs are the same and HIGH if the inputs are different .At some time later,the clock input goes low again and we generate nother 30ns wide positive going pul at the output of gate 4 becau for 30ns both outputs are different .This cau the D-FF to toggle at both edges of the clock and the Q and Q outputs give us the alternating phas that we need.Figure6.26 shows the relevant waveforms .
This circuit works for any clock frequency that has a period greater than 4 XOR gate delays .Also ,by
专业四级成绩查询
using both outputs of the D-FF,we are guaranteed a two-pha clock output that is exactly 180 degrees out of pha from each other.
弗里特Now we can revisit Figure6.23 and e the other subtle point that was buried in the diagram.Since we are apparently changing states on the rising and falling edges of the clock ,we now know that the internal state machine of the processor is actually using a 2-pha clock and each of the ‘T’states is ,in reality,two states.Thus ,we can redraw the timing diagram for a READ cycle as a state diagram.This will clearly demonstrate the READ pha of the bus cycle,reprent as a state diagram.
Referring to Figure 6.27 we can clearly e that in state T20the processor tests the state of the WAIT input.If the input is asrted LOW ,the processor remains in state T20 ,effectively lengthening the total time for the bus cycle.The advantage of the wait state
over decreasing the clock frequency is that we can design our system such that a wait penalty is incurred only when the processor access certain memory regions,rather than slowing it for all operations.We can now summarize the entire bus READ cycle as follows.
T10: READ cycle begins.Processor outputs new memory address for READ operation .
T11: Address is now stable and ADV AL goes LOWRD goes low indicating that a READ cycle is beginning
T20: READ cycle continues.
T21: Process samples WAIT input .If asrted T21 cycle continues.
T30: READ cycle continues.
T31: READ cycles terminates.ADV AL and RD are de-asrted and processor inputs the data form memory .
Direct Memory Access(DMA)
We’ll conclude Chapter 6with a brief discussion of another form of memory access called DMA,or direct memory access.The need for a DMA system is a result of the fact that memory system and the processor are connected to each other by bus.Since the bus is the only path in and out of flicts will ari when peripheral devices,such as disk drives or network cards have data for the processor,but the processor is busy executing program code .
japanegirlswet 16In many systems, the peripheral devices and memory share the same bus with the processor .When a device,such as a hard disk drive needs to transfer data to the processor,we could imagine two scenarios .