ADSP-TS201 TigerSHARC® Processor
Hardware Reference

Revision 1.1, December 2004
Part Number
82-000815-01

Analog Devices, Inc.
One Technology Way
Norwood, Mass. 02062-9106
Copyright Information

© 2004 Analog Devices, Inc., ALL RIGHTS RESERVED. This document may not be reproduced in any form without prior, express written consent from Analog Devices, Inc.

Printed in the USA.

Disclaimer

Analog Devices, Inc. reserves the right to change this product without prior notice. Information furnished by Analog Devices is believed to be accurate and reliable. However, no responsibility is assumed by Analog Devices for its use; nor for any infringement of patents or other rights of third parties which may result from its use. No license is granted by implication or otherwise under the patent rights of Analog Devices, Inc.

Trademark and Service Mark Notice

The Analog Devices logo, Blackfin, EZ-ICE, SHARC, TigerSHARC, the TigerSHARC logo, and VisualDSP++, and are registered trademarks of Analog Devices, Inc.

Static Superscalar is a trademark of Analog Devices, Inc.

All other brand and product names are trademarks or service marks of their respective owners.
CONTENTS

PREFACE

Purpose of This Manual ................................................................. xxi
Intended Audience ......................................................................... xxi
Manual Contents .......................................................................... xxii
What’s New in This Manual ......................................................... xxiv
Technical or Customer Support ..................................................... xxv
Supported Processors .................................................................... xxvi
Product Information ................................................................... xxvii
  MyAnalog.com ...................................................................... xxvii
  Processor Product Information ................................................. xxvii
Related Documents .................................................................... xxviii
  Online Technical Documentation ............................................. xxix
  Accessing Documentation From VisualDSP++ ....................... xxx
  Accessing Documentation From Windows ................................. xxx
  Accessing Documentation From the Web ................................. xxxi
Printed Manuals ....................................................................... xxxi
CONTENTS

VisualDSP++ Documentation Set ........................................... xxxi
Hardware Tools Manuals .................................................... xxxii
Processor Manuals ............................................................ xxxii
Data Sheets ...................................................................... xxxii
Conventions ........................................................................ xxxiii

PROCESSOR ARCHITECTURE

Processor Core ........................................................................ 1-8
  Compute Blocks .................................................................... 1-10
    Arithmetic Logic Unit (ALU) ............................................ 1-12
    Communications Logic Unit (CLU) ................................. 1-12
    Multiply Accumulator (Multiplier) ............................... 1-12
    Bit Wise Barrel Shifter (Shifter) .................................... 1-13
  Integer Arithmetic Logic Unit (IALU) .............................. 1-13
  Program Sequencer .......................................................... 1-15
 Processor Core Controls ...................................................... 1-17
  Clock Domains ................................................................ 1-17
  Operation Modes ............................................................ 1-18
    User Mode ................................................................. 1-19
    Supervisor Mode .......................................................... 1-20
    Emulator Mode ............................................................ 1-20
  Boot Modes ................................................................. 1-22
 Memory, Registers, and Buses ............................................. 1-22
  Internal Buses .................................................................. 1-23
  Internal Registers ........................................................... 1-23
## CONTENTS

Flag Control (FLAGREG) Register ........................................ 2-28
Sequencer Control (SQCTL) Register .................................. 2-29
Static Flags (SFREG) Register .......................................... 2-30
Sequencer Control Set Bits (SQCTLST) Register ................. 2-31
Sequencer Control Clear Bits (SQCTLCL) Register ............. 2-31
Sequencer Status (SQSTAT) Register ................................ 2-31
Cache Register Groups ...................................................... 2-34
Cache/Memory System Command/Status
  (CACMDx, CCAIRx, and CASTATx) Registers ....................... 2-36
Interrupt Register Groups .............................................. 2-41
  Interrupt Vector Table Register Groups .......................... 2-41
  Interrupt Control (INTCTL) Register ............................ 2-45
  Interrupt Latch (ILATL/ILATH) Registers ....................... 2-46
  Interrupt Pointer Mask (PMASKL/PMASKH) Registers ....... 2-47
  Interrupt Mask (IMASKL/IMASKH) Registers .................... 2-47
  Timer Interrupt (TIMERxH/L) Registers ......................... 2-50
  Timer Initial Value (TMRINxH/L) Registers .................... 2-50
DMA Control and Status Register Group ............................. 2-51
  DMA Control (DCNT) Register .................................... 2-51
  DMA Status (DSTAT) Register .................................... 2-53
External Port DMA TCB Register Group ............................... 2-56
Link Port Transmit DMA TCB Register Group ....................... 2-57
Link Port Receive DMA TCB Register Group ....................... 2-58
AutoDMA Register Group ............................................... 2-60
<table>
<thead>
<tr>
<th>Section</th>
<th>Page</th>
</tr>
</thead>
<tbody>
<tr>
<td>DMA Index (DI), X-Dimension (DX), Y-Dimension (DY), and Pointer (DP) TCB Registers</td>
<td>2-61</td>
</tr>
<tr>
<td>Link Port Register Groups</td>
<td>2-65</td>
</tr>
<tr>
<td>Link Receive Control (LRCTLx) Registers</td>
<td>2-68</td>
</tr>
<tr>
<td>Link Transmit Control (LTCTLx) Registers</td>
<td>2-69</td>
</tr>
<tr>
<td>Link Receive Status (LRSTATx) Registers</td>
<td>2-70</td>
</tr>
<tr>
<td>Link Transmit Status (LTSTATx) Registers</td>
<td>2-71</td>
</tr>
<tr>
<td>External Bus Interface Register Groups</td>
<td>2-72</td>
</tr>
<tr>
<td>System Configuration (SYSCON) Register</td>
<td>2-73</td>
</tr>
<tr>
<td>Bus Lock (BUSLOCK) System Control Register</td>
<td>2-76</td>
</tr>
<tr>
<td>SDRAM Configuration (SDRCON) Register</td>
<td>2-76</td>
</tr>
<tr>
<td>System Status (SYSTAT) Register</td>
<td>2-77</td>
</tr>
<tr>
<td>Bus Master Maximum (BMAX) Register</td>
<td>2-80</td>
</tr>
<tr>
<td>Bus Master Maximum Current Count (BMAXC) Register</td>
<td>2-80</td>
</tr>
<tr>
<td>JTAG Test and Emulation Register Groups</td>
<td>2-81</td>
</tr>
<tr>
<td>Emulation Control (EMUCTL) Register</td>
<td>2-82</td>
</tr>
<tr>
<td>Emulation Status ( EMUSTAT ) Register</td>
<td>2-83</td>
</tr>
<tr>
<td>Silicon Version and ID Code (IDCODE) Register</td>
<td>2-83</td>
</tr>
<tr>
<td>Debug Register Groups</td>
<td>2-84</td>
</tr>
<tr>
<td>Watchpoint Control (WPxCTL) Registers</td>
<td>2-86</td>
</tr>
<tr>
<td>Watchpoint Status (WPxSTAT) Registers</td>
<td>2-89</td>
</tr>
<tr>
<td>Watchpoint Address Pointer (WPxL/WPxH) Registers</td>
<td>2-89</td>
</tr>
<tr>
<td>Performance Monitor Mask (PRFM) Register</td>
<td>2-89</td>
</tr>
<tr>
<td>Performance Monitor Counter (PRFCNT) Register</td>
<td>2-92</td>
</tr>
<tr>
<td>Cycle Counter (CCNTx) Registers</td>
<td>2-92</td>
</tr>
</tbody>
</table>
Contents

Trace Buffers and Pointer (TRCBx/TRCBPTR) Registers  2-93

SOC Interface

SOC Interface Operations  3-5
SOC OFIFO Transactions  3-5
SOC IFIFO/OBUF Transactions  3-7
SOC Interface Programming  3-8

Timers

Timer Operations  4-2
Timer Programming  4-4
Timer Expired (TMR0E) Pin and Signal Timing  4-8

Flags

Flag Operations  5-1
Flag Programming  5-3
Flag (FLAG3-0) Pin and Signal Timing  5-10

Interrupts

Interrupt Operations  6-5
Interrupt Service Routines  6-8
Handling Interrupts  6-13
Returning From an Interrupt  6-14
Handling Exceptions  6-15
Interrupt Vector Table  6-17
Timer Expired Interrupts  6-19
CONTENTS

Link Port Service Request Interrupts ........................................... 6-20
DMA Complete Interrupts .......................................................... 6-20
External (IRQ3–0) Input Interrupts .......................................... 6-20
Vector Interrupt ..................................................................... 6-21
Bus Lock Interrupt Register .................................................... 6-21
 Hardware Error Interrupt ....................................................... 6-22
Software Exception Interrupts ................................................. 6-23
Emulation Exception Interrupt ............................................... 6-24
Interrupt Programming ............................................................. 6-25
Interrupt (IRQ3-0) Pin and Signal Timing ................................. 6-32

DIRECT MEMORY ACCESS

DMA Controller Features .......................................................... 7-8
Cluster Bus Transfers ............................................................... 7-8
AutoDMA Transfers ............................................................... 7-10
Link Port Transfers ............................................................... 7-10
Two-Dimensional DMA .......................................................... 7-11
Chained DMA ...................................................................... 7-11
DMA Architecture ................................................................. 7-12
DMA Request (DMAR3-0) Pins .............................................. 7-13
Terminology .......................................................................... 7-14
Setting Up DMA Transfers ....................................................... 7-15
DMA Transfer Control Block Registers .................................. 7-15
DMA Channel Control ............................................................ 7-16
Transfer Control Block (TCB) Registers ................................. 7-16
CONTENTS

DIx Register ................................................................. 7-18
DXx Register ................................................................. 7-18
DYx Register ................................................................. 7-19
DPx Register ................................................................. 7-19
DMA Control and Status Registers ............................................. 7-23
DMA Status Register (DSTAT/DSTATC) .................................. 7-23
DMA Control Registers .......................................................... 7-26
DCNT Register ................................................................. 7-26
DCNTST Register .............................................................. 7-27
DCNTCL Register .............................................................. 7-27
DMA Control Register Restrictions ........................................... 7-27
DMA Controller Operations ....................................................... 7-31
Link Port DMA Control ....................................................... 7-31
External Port DMA Control ..................................................... 7-32
AutoDMA Register Control ..................................................... 7-32
DMA Transfers ................................................................. 7-33
Internal Memory Buses ......................................................... 7-33
DMA Channels ................................................................. 7-34
DMA Memory Addressing ....................................................... 7-34
DMA Channel Prioritization ..................................................... 7-36
Internal Memory Bus Priority ..................................................... 7-36
DMA Channel Priority ......................................................... 7-36
Rotating Priority ............................................................... 7-38
DMA Chaining ................................................................. 7-40
Enabling and Disabling Chaining ........................................... 7-42
Generating a DMA Complete Interrupt While Chaining .... 7-42
Transfer Control Blocks and Chain Loading ....................... 7-42
Setting Up and Starting the Chain ........................................ 7-43
Chain Insertion .................................................................... 7-43
Two-Dimensional DMA .......................................................... 7-44
Two-Dimensional DMA Channel Organization ................. 7-45
Two-Dimensional DMA Operation ...................................... 7-46
Examples of Two Dimensional DMA Transfers .......... 7-48
DMA Interrupts .................................................................... 7-49
Starting and Stopping DMA Sequences ............................... 7-51
Starting a DMA Sequence .................................................. 7-51
Ending a DMA Sequence ................................................... 7-52
Suspending a DMA Sequence ............................................. 7-52
Resuming a DMA Sequence ............................................... 7-52
External Port DMA ............................................................... 7-52
Internal and External Address Generation ....................... 7-53
External Port DMA Transfer Types ..................................... 7-54
External to Internal Memory ............................................. 7-54
Internal to External Memory ............................................ 7-57
External I/O Device to External Memory (Flyby) ............... 7-60
External Memory to External I/O Device (Flyby) .......... 7-62
DMA Semaphores ............................................................. 7-63
Handshake Mode ............................................................... 7-64
CONTENTS

Flyby Mode ................................................................. 7-65
Link Port DMA ............................................................. 7-66
Link Ports DMA Transfer Types ............................... 7-66
  Link Port to Internal/External Memory ............... 7-66
  Internal/External Memory to Link ...................... 7-68
  Receiving Link Port to Link Port ...................... 7-69
DMA Throughput ......................................................... 7-71
  Internal Memory DMA ....................................... 7-71
  External Memory DMA ....................................... 7-71
DMA Operation on Boot .............................................. 7-72
DMA Examples ........................................................... 7-73

EXTERNAL PORT AND SDRAM INTERFACE

  External Bus Features ........................................... 8-2
  Bus Interface I/O Pins ........................................... 8-3
  Processor Microarchitecture ............................... 8-5
  SYSCon Register Programming ............................ 8-13
    Bus Width ....................................................... 8-13
    Slow Device Protocol for Bus ......................... 8-14
    Pipelined Protocol for Bus .............................. 8-14
    Initial Value for Bus Operation ..................... 8-15
    Pipelined Protocol Interface ......................... 8-15
    Control Signals for Pipelined Transactions ........ 8-16
    Using Basic Pipelined Transactions ................ 8-16
    Using Burst Pipelined Transactions ................. 8-18
## CONTENTS

- **Wait Cycles** ................................................................. 8-23
- **Slow Device Protocol Interface** ........................................ 8-26
- **EPROM Interface** .......................................................... 8-32
- **Multiprocessing Interface** ................................................. 8-34
  - **Bus Arbitration Protocol** .................................................. 8-38
    - **Core Priority Access (CPA) Pin** ..................................... 8-40
    - **DMA Priority Access (DPA) Pin** ..................................... 8-42
    - **Bus Fairness (BMAX) Register** ....................................... 8-44
    - **Bus Lock (BUSLOCK) Register** ...................................... 8-44
    - **Software (Processor Core) Reset Operation** ..................... 8-45
- **Host Processor Interface** .................................................. 8-46
- **Back Off (BOFF) Pin** ...................................................... 8-48
- **SDRAM Interface** ............................................................ 8-50
  - **SDRAM Interface I/O Pins** .............................................. 8-55
  - **SDRAM Physical Connection** ............................................ 8-56
    - **Bank Select Pins** .......................................................... 8-59
    - **Internal Address and SDRAM Physical Connection** ............ 8-60
- **SDRAM Programming** ...................................................... 8-67
  - **Enabling SDRAM** .......................................................... 8-68
  - **Selecting the CAS Latency Value – CAS** ........................ 8-69
  - **Setting the SDRAM Buffering Option – Pipeline Depth** ....... 8-69
  - **Selecting the SDRAM Page Size – Page Boundary** .............. 8-70
  - **Setting the Refresh Counter Value – Refresh Rate** .............. 8-70
  - **Selecting the Precharge to RAS Delay** ............................ 8-72
LINK PORTS

Link Architecture ................................................................. 9-2
Link I/O Pins ........................................................................ 9-4
Link Transmit and Receive Data ........................................... 9-7
Link DMA ............................................................................. 9-9
Link Block Completion ....................................................... 9-9
Link Interrupts ................................................................. 9-10
Link Reset Initialization and Boot ..................................... 9-10
Link Alternate (Software) Initialization ......................... 9-11
Link Port Communication Protocol ............................... 9-12
Link Port Transmission Delays ......................................... 9-17
Link Port Error Detection Mechanisms ......................... 9-19
Link Transmitter Timeout ................................................. 9-20
Link Receiver Timeout .................................................... 9-20
Link Verification Error ..................................................... 9-20
Link Transmit/Receive Write Error ................................. 9-21
Link Port Control Registers ............................................. 9-21
Link Port Status Registers .............................................. 9-26

JTAG PORT AND TEST/DEBUG INTERFACE

Operating Modes ............................................................. 10-4
Debug Resources ............................................................ 10-5
Special Instructions ......................................................... 10-5
Watchpoints .................................................................. 10-5
## Contents

**Programming – Control and Address Pointer Registers** ........................................ 10-5  
Watchpoint Operation ........................................................................................................ 10-6  
Watchpoint Status (WPiSTAT) ......................................................................................... 10-7  
Instruction Address Trace Buffer (TBUF) ....................................................................... 10-8  
Performance Monitors ..................................................................................................... 10-8  
Cycle Counter (CCNT1–0) .............................................................................................. 10-9  
Performance Monitor Mask – PRFM ............................................................................. 10-9  
Performance Monitor Counter ....................................................................................... 10-10  
JTAG Functionality ......................................................................................................... 10-11  
JTAG Port I/O Pins ......................................................................................................... 10-11  
JTAG Instruction Register .............................................................................................. 10-13  
Data Registers .................................................................................................................. 10-13  

### System Design

Processor Booting Methods ............................................................................................... 11-2  
Using Boot Modes and Boot Loader Kernels .................................................................. 11-3  
Executing a Processor Boot Operation .......................................................................... 11-3  
EPROM Boot .................................................................................................................... 11-4  
Host Boot .......................................................................................................................... 11-5  
Link Port Boot .................................................................................................................. 11-6  
No Boot ............................................................................................................................... 11-6  
Booting References ......................................................................................................... 11-8  
Hardware System Design Guidelines ............................................................................ 11-8  
Power Supplies ................................................................................................................ 11-9  
System Clock (SCLK) Pin ............................................................................................... 11-10
CONTENTS

Considerations for SCLK Design .............................................. 11-10
Considerations for SCLK Distribution .............................. 11-11
Boundary Scan and Emulator Pins ..................................... 11-15
External Ports Pins .............................................................. 11-15
Link Ports Pins ................................................................. 11-17
Hardware Design References ............................................. 11-17
Processor Thermal Relief Methods .................................. 11-18
Thermal Overview ............................................................... 11-19
Thermal Calculations .......................................................... 11-21
Heat Sink Basics ................................................................. 11-23
Pin Fins versus Rectangular Fins ......................................... 11-23
PCB Design for Thermal Dissipation .................................. 11-24
Thermal Simulations ........................................................... 11-25
Thermal Design Terminology .......................................... 11-26
Thermal Design References ............................................... 11-27
Other Design and Test References ................................... 11-28
ADSP-TS201 Processor EZ-KIT Lite ................................. 11-28
ADSP-TS201S MP System Simulation and Analysis
Cluster Bus Topology – Signal Integrity and PCB Design
Considerations ................................................................. 11-29
IBIS Models ................................................................. 11-29
FPGA Link Port Support .................................................. 11-30
Recommended Reading References ........................... 11-31

INDEX
Thank you for purchasing and developing systems using TigerSHARC® processors from Analog Devices.

Purpose of This Manual

The *ADSP-TS201 TigerSHARC Processor Hardware Reference* contains information about the DSP architecture and DSP system design for TigerSHARC processors. These are 32-bit, fixed- and floating-point digital signal processors from Analog Devices for use in computing, communications, and consumer applications.

The manual provides information on how the processor core and I/O peripherals operate in the TigerSHARC processor’s architecture along with reference information about I/O peripheral features.

Intended Audience

The primary audience for this manual is a system developer who is familiar with Analog Devices processors. This manual assumes that the audience has a working knowledge of the appropriate processor architecture and microprocessor system design. Programmers who are unfamiliar with Analog Devices processors can use this manual, but should supplement it with other texts (such as the appropriate programming reference manuals and data sheets) that describe your target architecture.
Manual Contents

The manual consists of:

- **Chapter 1, Processor Architecture**
  This chapter provides an architectural overview of the TigerSHARC processor.

- **Chapter 2, Memory and Registers**
  This chapter defines the memory map of the ADSP-TS201 TigerSHARC processor. The memory space defines the location of each element on the TigerSHARC processor.

- **Chapter 3, SOC Interface**
  This chapter discusses clocking inputs, including the three different types of operating modes in which the ADSP-TS201 TigerSHARC processor can operate and the boot modes from which the TigerSHARC processor initiates.

- **Chapter 4, Timers**
  This chapter discusses clocking inputs, including the three different types of operating modes in which the ADSP-TS201 TigerSHARC processor can operate and the boot modes from which the TigerSHARC processor initiates.

- **Chapter 5, Flags**
  This chapter discusses clocking inputs, including the three different types of operating modes in which the ADSP-TS201 TigerSHARC processor can operate and the boot modes from which the TigerSHARC processor initiates.

- **Chapter 6, Interrupts**
  This chapter discusses the various types of interrupts supported by the ADSP-TS201 TigerSHARC processor. Some of the interrupts are generated internally or externally.
Preface

- Chapter 7, Direct Memory Access
  This chapter describes how the ADSP-TS201 TigerSHARC processor’s on-chip DMA controller acts as a machine for transferring data without core interruption.

- Chapter 8, External Port and SDRAM Interface
  This chapter focuses on the external bus interface of the ADSP-TS201 TigerSHARC processor, which includes the bus arbitration logic and the external address, data and control buses, and interface to SDRAM devices.

- Chapter 9, Link Ports
  This chapter describes how link ports provide point-to-point communications between ADSP-TS201 TigerSHARC processors in a system. The Link ports can also be used to interface with any other device that is designed to work in the same protocol.

- Chapter 10, JTAG Port and Test/Debug Interface
  This chapter describes features of the ADSP-TS201 TigerSHARC processor that are useful for performing software debugging and services usually found in Operating System (OS) kernels.

- Chapter 11, System Design
  This chapter describes system features of the ADSP-TS201 TigerSHARC processor. These include Power, Reset, Clock, JTAG, and Booting, as well as pin descriptions and other system level information.

This hardware reference is a companion document to the *ADSP-TS201 TigerSHARC Processor Programming Reference*. 
What’s New in This Manual

Revision 1.0 of the *ADSP-TS201 TigerSHARC Processor Hardware Reference* differs in a number of ways from the revision 0.2 book. In revision 1.0, the following additions and corrections have been made:

- The Processor Architecture chapter has replaced the previous Introduction chapter. This new chapter provides a more detailed road map to the processor architecture and processor core mode controls.

- The SOC Interface, Timers, and Flags chapters have been added. A description of the operation of all I/O peripherals as bus masters or slaves on the SOC bus has been added in all chapters.

- The Interrupts chapter has been re-ordered to provide more guidance on using interrupts, and a consolidated interrupt vector table has been added.

- The System Design chapter has been expanded. Many of the Engineer-to-Engineer (EE) Notes to which the revision 0.2 book referred have been added to the revision 1.0 book. The topics added include booting, system design guidelines, and thermal design guidelines.

- The index has been enhanced.

- Errata reports against the revision 0.2 book have been corrected.
Technical or Customer Support

You can reach Analog Devices, Inc. Customer Support in the following ways:

- E-mail tools questions to dsptools.support@analog.com
- E-mail processor questions to dsp.support@analog.com
- Phone questions to 1-800-ANALOGD
- Contact your Analog Devices, Inc. local sales office or authorized distributor
- Send questions by mail to:
  Analog Devices, Inc.
  One Technology Way
  P.O. Box 9106
  Norwood, MA 02062-9106
  USA
Supported Processors

The following is the list of Analog Devices, Inc. processors supported in VisualDSP++®.

**TigerSHARC (ADSP-TSxxx) Processors**

The name “TigerSHARC” refers to a family of floating-point and fixed-point [8-bit, 16-bit, and 32-bit] processors. VisualDSP++ currently supports the following TigerSHARC processors:

ADSP-TS101, ADSP-TS201, ADSP-TS202, and ADSP-TS203

**SHARC® (ADSP-21xxx) Processors**

The name “SHARC” refers to a family of high-performance, 32-bit, floating-point processors that can be used in speech, sound, graphics, and imaging applications. VisualDSP++ currently supports the following SHARC processors:

ADSP-21020, ADSP-21060, ADSP-21061, ADSP-21062, ADSP-21065L, ADSP-21160, ADSP-21161, ADSP-21261, ADSP-21262, ADSP-21266, ADSP-21267, ADSP-21363, ADSP-21364, and ADSP-21365

**Blackfin® (ADSP-BFxxx) Processors**

The name “Blackfin” refers to a family of 16-bit, embedded processors. VisualDSP++ currently supports the following Blackfin processors:

ADSP-BF531, ADSP-BF532 (formerly ADSP-21532), ADSP-BF533, ADSP-BF535 (formerly ADSP-21535), ADSP-BF561, AD6532, and AD90747
Product Information

You can obtain product information from the Analog Devices Web site, from the product CD-ROM, or from the printed publications (manuals).

Analog Devices is online at www.analog.com. Our Web site provides information about a broad range of products—analogue integrated circuits, amplifiers, converters, and digital signal processors.

MyAnalog.com

MyAnalog.com is a free feature of the Analog Devices Web site that allows customization of a Web page to display only the latest information on products you are interested in. You can also choose to receive weekly e-mail notifications containing updates to the Web pages that meet your interests. MyAnalog.com provides access to books, application notes, data sheets, code examples, and more.

Registration

Visit www.myanalog.com to sign up. Click Register to use MyAnalog.com. Registration takes about five minutes and serves as a means to select the information you want to receive.

If you are already a registered user, just log on. Your user name is your e-mail address.

Processor Product Information

For information on embedded processors and DSPs, visit our Web site at www.analog.com/processors, which provides access to technical publications, data sheets, application notes, product overviews, and product announcements.
Product Information

You may also obtain additional information about Analog Devices and its products in any of the following ways.

- E-mail questions or requests for information to dsp.support@analog.com
- Fax questions or requests for information to 1-781-461-3010 (North America) 089/76 903-557 (Europe)

Related Documents

The following publications that describe the ADSP-TS201 TigerSHARC processor (and related processors) can be ordered from any Analog Devices sales office:

- ADSP-TS201S TigerSHARC Embedded Processor Data Sheet
- ADSP-TS202S TigerSHARC Embedded Processor Data Sheet
- ADSP-TS203S TigerSHARC Embedded Processor Data Sheet
- ADSP-TS201 TigerSHARC Processor Hardware Reference
- ADSP-TS201 TigerSHARC Processor Programming Reference

For information on product related development software and Analog Devices processors, see these publications:

- VisualDSP++ C/C++ Compiler and Library Manual for TigerSHARC Processors
Preface

- *VisualDSP++ Assembler and Preprocessor Manual for TigerSHARC Processors*
- *VisualDSP++ Linker and Utilities Manual for TigerSHARC Processors*

Visit the Technical Library Web site to access all processor and tools manuals and data sheets:

http://www.analog.com/processors/resources/technicalLibrary

**Online Technical Documentation**

Online documentation comprises the VisualDSP++ Help system, software tools manuals, hardware tools manuals, processor manuals, the Dinkum Abridged C++ library, and Flexible License Manager (FlexLM) network license manager software documentation. You can easily search across the entire VisualDSP++ documentation set for any topic of interest. For easy printing, supplementary .PDF files of most manuals are also provided.

Each documentation file type is described as follows.

<table>
<thead>
<tr>
<th>File</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>.CHM</td>
<td>Help system files and manuals in Help format</td>
</tr>
<tr>
<td>.HTM or .HTML</td>
<td>Dinkum Abridged C++ library and FlexLM network license manager software documentation. Viewing and printing the .HTML files requires a browser, such as Internet Explorer 4.0 (or higher).</td>
</tr>
<tr>
<td>.PDF</td>
<td>VisualDSP++ and processor manuals in Portable Documentation Format (PDF). Viewing and printing the .PDF files requires a PDF reader, such as Adobe Acrobat Reader (4.0 or higher).</td>
</tr>
</tbody>
</table>
Product Information

If documentation is not installed on your system as part of the software installation, you can add it from the VisualDSP++ CD-ROM at any time by running the Tools installation. Access the online documentation from the VisualDSP++ environment, Windows® Explorer, or the Analog Devices Web site.

Accessing Documentation From VisualDSP++

From the VisualDSP++ environment:

- Access VisualDSP++ online Help from the Help menu’s Contents, Search, and Index commands.
- Open online Help from context-sensitive user interface items (toolbar buttons, menu commands, and windows).

Accessing Documentation From Windows

In addition to any shortcuts you may have constructed, there are many ways to open VisualDSP++ online Help or the supplementary documentation from Windows.

Help system files (.CHM) are located in the Help folder, and .PDF files are located in the Docs folder of your VisualDSP++ installation CD-ROM. The Docs folder also contains the Dinkum Abridged C++ library and the FlexLM network license manager software documentation.

Using Windows Explorer

- Double-click the vdsp-help.chm file, which is the master Help system, to access all the other .CHM files.
- Double-click any file that is part of the VisualDSP++ documentation set.
Using the Windows Start Button

- Access VisualDSP++ online Help by clicking the Start button and choosing Programs, Analog Devices, VisualDSP++, and VisualDSP++ Documentation.

- Access the .PDF files by clicking the Start button and choosing Programs, Analog Devices, VisualDSP++, Documentation for Printing, and the name of the book.

Accessing Documentation From the Web

Download manuals at the following Web site:
http://www.analog.com/processors/resources/technicalLibrary/manuals

Select a processor family and book title. Download archive (.ZIP) files, one for each manual. Use any archive management software, such as WinZip, to decompress downloaded files.

Printed Manuals

For general questions regarding literature ordering, call the Literature Center at 1-800-ANALOGD (1-800-262-5643) and follow the prompts.

VisualDSP++ Documentation Set

To purchase VisualDSP++ manuals, call 1-603-883-2430. The manuals may be purchased only as a kit.

If you do not have an account with Analog Devices, you are referred to Analog Devices distributors. For information on our distributors, log onto http://www.analog.com/salesdir/continent.asp.
Product Information

Hardware Tools Manuals

To purchase EZ-KIT Lite™ and In-Circuit Emulator (ICE) manuals, call 1-603-883-2430. The manuals may be ordered by title or by product number located on the back cover of each manual.

Processor Manuals

Hardware reference and instruction set reference manuals may be ordered through the Literature Center at 1-800-ANALOGD (1-800-262-5643), or downloaded from the Analog Devices Web site. Manuals may be ordered by title or by product number located on the back cover of each manual.

Data Sheets

All data sheets (preliminary and production) may be downloaded from the Analog Devices Web site. Only production (final) data sheets (Rev. 0, A, B, C, and so on) can be obtained from the Literature Center at 1-800-ANALOGD (1-800-262-5643); they also can be downloaded from the Web site.

To have a data sheet faxed to you, call the Analog Devices Faxback System at 1-800-446-6212. Follow the prompts and a list of data sheet code numbers will be faxed to you. If the data sheet you want is not listed, check for it on the Web site.
# Conventions

Text conventions used in this manual are identified and described as follows.

<table>
<thead>
<tr>
<th>Example</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>Close command</td>
<td>Titles in reference sections indicate the location of an item within the VisualDSP++ environment’s menu system (for example, the Close command appears on the File menu).</td>
</tr>
<tr>
<td>{this</td>
<td>that}</td>
</tr>
<tr>
<td>[this</td>
<td>that]</td>
</tr>
<tr>
<td>[this,...]</td>
<td>Optional item lists in syntax descriptions appear within brackets delimited by commas and terminated with an ellipse; read the example as an optional comma-separated list of this.</td>
</tr>
<tr>
<td>.SECTION</td>
<td>Commands, directives, keywords, and feature names are in text with letter gothic font.</td>
</tr>
<tr>
<td>filename</td>
<td>Non-keyword placeholders appear in text with italic style format.</td>
</tr>
</tbody>
</table>

| Note:            | A Note: provides supplementary information on a related topic. In the online version of this book, the word Note appears instead of this symbol.           |

| Caution:          | Incorrect device operation may result if ...                                                                                                     |
|                  | A Caution: identifies conditions or inappropriate usage of the product that could lead to undesirable results or product damage. In the online version of this book, the word Caution appears instead of this symbol. |

| Warning:         | Injury to device users may result if ...                                                                                                       |
|                  | A Warning: identifies conditions or inappropriate usage of the product that could lead to conditions that are potentially hazardous for devices users. In the online version of this book, the word Warning appears instead of this symbol. |

Additional conventions, which apply only to specific chapters, may appear throughout this document.
Conventions
1 PROCESSOR ARCHITECTURE

The *ADSP-TS201 TigerSHARC Processor Hardware Reference* describes the ADSP-TS201 TigerSHARC processor architecture and hardware system support features. These descriptions provide the information required for designing and configuring TigerSHARC processor systems.

As shown in Figure 1-1 and Figure 1-2, the processor architecture consists of two divisions: the processor core (where instructions execute) and the I/O peripherals (where data is stored and off-chip I/O is processed).

This chapter provides a high level description of the processor core and peripherals architecture. More detailed descriptions appear in related chapters. This chapter introduces the following section on processor architecture:

- “Processor Core” on page 1-8
- “Memory, Registers, and Buses” on page 1-22
- “SOC Interface” on page 1-24
- “Timers” on page 1-24
- “Flags” on page 1-25
- “Interrupts” on page 1-26
- “Direct Memory Access” on page 1-26
- “External (Address and Data) Port” on page 1-28
Figure 1-1. ADSP-TS201 TigerSHARC Processor Core Diagram
Figure 1-2. ADSP-TS201 TigerSHARC Processor Peripherals Diagram
The ADSP-TS201 processor is a 128-bit, high performance TigerSHARC processor. The ADSP-TS201 processor sets a new standard of performance for digital signal processors, combining multiple computation units for floating-point and fixed-point processing as well as very wide word widths. The ADSP-TS201 processor maintains a ‘system-on-chip’ scalable computing design philosophy, including 24M bit of on-chip DRAM, six 4K word caches (one per memory block), integrated I/O peripherals, a host processor interface, DMA controllers, LVDS link ports, and shared bus connectivity for glueless multiprocessing.

In addition to providing unprecedented performance in DSP applications in raw MFLOPS and MIPS, the ADSP-TS201 processor boosts performance measures such as MFLOPS/Watt and MFLOPS/square inch in multiprocessing applications.

As shown in Figure 1-1 and Figure 1-2, the processor has the following architectural features:

- Dual computation blocks: X and Y – each consisting of a multiplier, ALU, CLU, shifter, and a 32-word register file
- Dual integer ALUs: J and K – each containing a 32-bit IALU and 32-word register file
- Program sequencer – Controls the program flow and contains an instruction alignment buffer (IAB) and a branch target buffer (BTB)
- Three 128-bit buses providing high bandwidth connectivity between internal memory and the rest of the processor core (compute blocks, IALUs, program sequencer, and SOC interface)
Processor Architecture

- A 128-bit bus providing high bandwidth connectivity between internal memory and external I/O peripherals (DMA, external port, and link ports)

- External port interface including the host interface, SDRAM controller, static pipelined interface, four DMA channels, four LVDS link ports (each with two DMA channels), and multiprocessing support

- 24M bits of internal memory organized as six 4M bit blocks—each block containing 128K words x 32 bits; each block connects to the crossbar through its own buffers and a 128K bit, 4-way set associative cache

- Debug features

- JTAG Test Access Port

Figure 1-3 illustrates a typical single processor system. A multiprocessor system is illustrated in Figure 1-4 and is discussed later in “Multiprocessing” on page 1-30.

The TigerSHARC processor includes several features that simplify system development. The features lie in three key areas:

- Support of IEEE floating-point formats

- IEEE Standard 1149.1 Joint Test Action Group (JTAG) serial scan path and on-chip emulation features

- Architectural features supporting high level languages and operating systems
The features of the TigerSHARC processor architecture that directly support high level language compilers and operating systems include:

- Simple, orthogonal instruction allowing the compiler to efficiently use the multi-instruction slots
- General-purpose data and IALU register files
- 32-bit (IEEE Standard 754/854) and 40-bit floating-point and 8-, 16-, 32-, and 64-bit fixed-point native data types
Processor Architecture

Figure 1-4. Multiprocessing Cluster Configuration

- Large address space
- Immediate address modify fields
- Easily supported relocatable code and data
- Fast save and restore of processor registers onto internal memory stacks
Processor Core

The processor core is the part of the processor that executes instructions. The following discussion provides some details on the processor core architecture. For more information on the processor core, see related chapters in the ADSP-TS201 TigerSHARC Processor Programming Reference. This section describes:

- “Compute Blocks” on page 1-10
  - “Arithmetic Logic Unit (ALU)” on page 1-12
  - “Communications Logic Unit (CLU)” on page 1-12
  - “Multiply Accumulator (Multiplier)” on page 1-12
  - “Bit Wise Barrel Shifter (Shifter)” on page 1-13
- “Integer Arithmetic Logic Unit (IALU)” on page 1-13
- “Program Sequencer” on page 1-15
- “Processor Core Controls” on page 1-17

High performance is facilitated by the ability to execute up to four 32-bit wide instructions per cycle. The TigerSHARC processor uses a variation of a Static Superscalar™ architecture to allow the programmer to specify which instructions are executed in parallel in each cycle. The instructions do not have to be aligned in memory so that program memory is not wasted.

The 24M bit internal memory is divided into six 128K word memory blocks. Each of the four internal address/data bus pairs connect to all of the six memory blocks via a crossbar interface. The six memory blocks support up to four accesses every cycle where each memory block can perform a 128-bit access in a cycle. Each block’s cache and prefetch mechanism improve access performance of internal memory (embedded DRAM).
The external port cluster bus is 64 bits wide. The high I/O bandwidth complements the high processing speeds of the core. To facilitate the high clock rate, the ADSP-TS201 processor uses a pipelined external bus with programmable pipeline depth for interprocessor communications and for Synchronous Flow-through SRAM (SSRAM) and SDRAM.

The four LVDS link ports support point-to-point high bandwidth data transfers. Each link port supports full-duplex communication.

The processor operates with a two cycle arithmetic pipeline. The branch pipeline is four to ten cycles. A branch target buffer (BTB) is implemented to reduce branch delay.

During compute intensive operations, one or both integer ALUs compute or generate addresses for fetching up to two quad operands from two memory blocks, while the program sequencer simultaneously fetches the next quad instruction from a third memory block. In parallel, the computation units can operate on previously fetched operands while the sequencer prepares for a branch.

While the core processor is doing the above, the DMA channels can be replenishing the internal memories in the background with quad data from either the external port or the link ports.

The processing core of the ADSP-TS201 processor reaches exceptionally high DSP performance through using these features:

- Computation pipeline
- Dual computation units
- Execution of up to four instructions per cycle
- Access of up to eight words per cycle from memory

The two identical computation units support floating-point as well as fixed-point arithmetic. These units (compute blocks) perform up to 6 floating-point or 24 fixed-point operations per cycle.
Processor Core

Each multiplier and ALU unit can execute four 16-bit fixed-point operations per cycle, using Single-Instruction, Multiple-Data (SIMD) operation. This operation boosts performance of critical imaging and signal processing applications that use fixed-point data.

Compute Blocks

The TigerSHARC processor core contains two computation units called compute blocks. Each compute block contains a register file and four independent computation units—an ALU, a CLU, a multiplier, and a shifter. For meeting a wide variety of processing needs, the computation units process data in several fixed- and floating-point formats.

These formats are listed here and shown in Figure 1-5:

- **Fixed-point format**
  These include 64-bit long word, 32-bit normal word, 32-bit complex (16-bit real and 16-bit imaginary), 16-bit short word, and 8-bit byte word. For short word fixed-point arithmetic, quad parallel operations on quad-aligned data allow fast processing of array data. Byte operations are also supported for octal-aligned data.

- **Floating-point format**
  These include 32-bit normal word and 40-bit extended word. Floating-point operations are single or extended precision. The normal word floating-point format is the standard IEEE format, and the 40-bit extended-precision format occupies a double word (64 bits) with eight additional least significant bits (LSBs) of mantissa for greater accuracy.

Each compute block has a general-purpose, multiport, 32-word data register file for transferring data between the computation units and the data buses and storing intermediate results. All of these registers can be accessed as single-, double-, or quad-aligned registers. For more information on the register file, see Chapter 2, “Compute Block Registers” in the ADSP-TS201 TigerSHARC Processor Programming Reference.
The TigerSHARC processor internal data buses are 128 bits (one quad word) wide. In a quad word, the processor can move 16 byte words, 8 short words, 4 normal words, or 2 long words over the bus at the same time.

---

**Figure 1-5. Word Format Definitions**

1 The TigerSHARC processor internal data buses are 128 bits (one quad word) wide. In a quad word, the processor can move 16 byte words, 8 short words, 4 normal words, or 2 long words over the bus at the same time.
Processor Core

Arithmetic Logic Unit (ALU)

The ALU performs arithmetic operations on fixed-point and floating-point data and logical operations on fixed-point data. The source and destination of most ALU operations is the compute block register file. For more information on ALU features, see Chapter 3, “ALU” in the ADSP-TS201 TigerSHARC Processor Programming Reference.

Communications Logic Unit (CLU)

On the ADSP-TS201 processor, there is a special purpose compute unit called the communications logic unit (CLU). The CLU instructions are designed to support different algorithms used for communications applications. The algorithms that are supported by the CLU instructions are:

- Viterbi Decoding
- Turbo code Decoding
- Despreading for code division multiple access (CDMA) systems
- Cross correlations used for path search

For more information on CLU features, see Chapter 4, “CLU” in the ADSP-TS201 TigerSHARC Processor Programming Reference.

Multiply Accumulator (Multiplier)

The multiplier performs fixed-point or floating-point multiplication and fixed-point multiply/accumulate operations. The multiplier supports several data types in fixed- and floating-point formats. The floating-point formats are float and float-extended, as in the ALU. The source and destination of most operations is the compute block register file.

The ADSP-TS201 processor’s multiplier supports complex multiply-accumulate operations. Complex numbers are represented by a pair of 16-bit short words within a 32-bit word. The LSBs of the input operand repre-
sent the real part, and the most significant bits (MSBs) of the input operand represent the imaginary part. For more information on multiplier features, see Chapter 5, “Multiplier” in the ADSP-TS201 TigerSHARC Processor Programming Reference.

Bit Wise Barrel Shifter (Shifter)

The shifter performs logical and arithmetic shifts, bit manipulation, field deposit, and field extraction. The shifter operates on one 64-bit, one or two 32-bit, two or four 16-bit, and four or eight 8-bit fixed-point operands. Shifter operations include:

- Shifts and rotates from off-scale left to off-scale right
- Bit manipulation operations, including bit set, clear, toggle and test
- Bit field manipulation operations, including field extract and deposit, using register BFOTMP (which is internal to the shifter)
- Bit FIFO operations to support bit streams with fields of varying length
- Support for ADSP-21000 DSP family compatible fixed-point/floating-point conversion operations (such as exponent extract, number of leading ones or zeros)

For more information on shifter features, see Chapter 6, “Shifter” in the ADSP-TS201 TigerSHARC Processor Programming Reference.

Integer Arithmetic Logic Unit (IALU)

The IALUs can execute standard standalone ALU operations on IALU register files. The IALUs also execute register load, store, and transfer operations, providing memory addresses when data is transferred between memory and registers. The processor has dual IALUs (the J-IALU and the K-IALU) that enable simultaneous addresses for two transactions of up to
128 bits in parallel. The IALUs allow compute operations to execute with maximum efficiency because the computation units can be devoted exclusively to processing data.

Each IALU has a multiport, 32-word register file. All IALU calculations are executed in a single cycle. The IALUs support pre-modify with no update and post-modify with update address generation. Circular data buffers are implemented in hardware. The IALUs support the following types of instructions:

- Regular IALU instructions
- Move Data instructions
- Load Data instructions
- Load/Store instructions with register update
- Load/Store instructions with immediate update

For indirect addressing (instructions with update), one of the registers in the register file can be modified by another register in the file or by an immediate 8- or 32-bit value, either before (pre-modify) or after (post-modify) the access. For circular buffer addressing, a length value can be associated with the first four registers to perform automatic modulo addressing for circular data buffers; the circular buffers can be located at arbitrary boundaries in memory. Circular buffers allow efficient implementation of delay lines and other data structures, which are commonly used in digital filters and Fourier transformations. The ADSP-TS201 processor circular buffers automatically handle address pointer wraparounds, reducing overhead and simplifying implementation.

The IALUs also support bit reverse addressing, which is useful for the FFT algorithm. Bit reverse addressing is implemented using a reverse carry addition that is similar to regular additions, but the carry is taken from the upper bits and is driven into lower bits.
Processor Architecture

The IALU provides flexibility in moving data as single, dual, or quad words. Every instruction can execute with a throughput of one per cycle. IALU instructions execute with a single cycle of latency. Normally, there are no dependency delays between IALU instructions, but if there are, four cycles of latency can occur.

For more information on IALU features, see Chapter 7, “IALU” in the ADSP-TS201 TigerSHARC Processor Programming Reference.

Program Sequencer

The program sequencer supplies instruction addresses to memory and, together with the IALUs, allows compute operations to execute with maximum efficiency. The sequencer supports efficient branching using the branch target buffer (BTB), which reduces branch delays for conditional and unconditional instructions. The two responsibilities of the sequencer are to decode fetched instructions—separating the instruction slots of the instruction line and sending each instruction to its execution unit (compute blocks, IALUs, or sequencer)—and to control the program flow. The sequencer’s control flow instructions divide into two types:

- **Control flow instructions.** These instructions are used to direct program execution by means of jumps and to execute individual instructions conditionally.

- **Immediate extension instructions.** These instructions are used to extend the numeric fields used in immediate operands for the sequencer and the IALU.
Control flow instructions divide into two types:

- Direct jumps and calls based on an immediate address operand specified in the instruction encoding. For example, ‘if <cond> jump 100;’ always jumps to address 100, if the <cond> evaluates as true.

- Indirect jumps based on an address supplied by a register. The instructions used for specifying conditional execution of a line are a subcategory of indirect jumps. For example, ‘if <cond> cjmp;’ is a jump to the address pointed to by the CJMP register.

The control flow instruction must use the first instruction slot in the instruction line.

Immediate extensions are associated with IALU or sequencer (control flow) instructions. These instructions are not specified by the programmer, but are implied by the size of the immediate data used in the instructions. The programmer must place the instruction that requires an immediate extension in the first instruction slot and leave an empty instruction slot in the line (use only three slots), so the assembler can place the immediate extension in the second instruction slot of the instruction line.

Note that only one immediate extension may be in a single instruction line.

The ADSP-TS201 processor achieves its fast execution rate by means of a ten-cycle pipeline.

Two stages of the sequencer’s pipeline actually execute in the computation units. The computation units perform single cycle operations with a two-cycle computation pipeline, meaning that results are available for use two cycles after the operation is begun. Hardware causes a stall if a result is not available in a given cycle (register dependency check). Up to two com-
putation instructions per compute block can be issued in each cycle, instructing the ALU, multiplier, or shifter to perform independent, simultaneous operations.

The branch penalty in a deeply pipelined processor, such as the ADSP-TS201 processor, can be compensated for by using a branch target buffer (BTB) and branch prediction. The branch target address is stored in the BTB. When the address of a jump instruction, which is predicted by the user to be taken in most cases, is recognized (the tag address), the corresponding jump address is read from the BTB and is used as the jump address on the next cycle. Thus, the latency of a jump is reduced from five to nine wasted cycles to zero wasted cycles. If this address is not stored in the BTB, the instruction must be fetched from memory.

Other instructions also use the BTB to speed up these types of branches. These instructions are interrupt return, call return, and computed jump instructions.

For more information on the sequencer, BTB, and immediate extensions, see Chapter 8, “Program Sequencer” in the ADSP-TS201 TigerSHARC Processor Programming Reference.

**Processor Core Controls**

There are few modes and controls required for ADSP-TS201 TigerSHARC processor operation because most operational features are specified as part of instruction syntax. This section describes operational modes for the processor that are not controlled as part of instruction syntax. These controls include clock domain controls, operating mode controls, and boot mode controls.

**Clock Domains**

The processor uses calculated ratios of the input system clock (SCLK) clock to operate as shown in Figure 1-6. The instruction execution rate is equal to the processor core clock (CCLK). A PLL from SCLK generates
CCLK which is phase locked. The link port clocks (\(LxCLKOUT\) pins) are generated from CCLK from a software programmable divisor, and the SOC bus clock (SOCCLK) operates at 1/2 CCLK. Memory transfers to external and link port buffers operate at the SOCCLK rate. The SCLK also provides the clock input for the external bus interface and defines the AC specification reference for the external bus signals. The external bus interface runs at the SCLK frequency.

The \(SCLKRATx\) pins define the clock multiplication of SCLK to CCLK. For information on the \(SCLKRATx\) pins settings and the maximum SCLK frequency, see the \textit{ADSP-TS201 TigerSHARC Embedded Processor Data Sheet}.

The clock domains particularly influence the performance of operations that move data between domains over the processor’s SOC bus. For more information, see “SOC Interface” on page 1-24 and “SOC Interface” on page 3-1.

**Operation Modes**

The TigerSHARC processor operates in one of three modes—user, supervisor, and emulator. In user and supervisor modes, \textit{all} instructions are executed normally. In user mode, however, the register access is limited. Regardless of the operation mode (emulation, supervisor, or user), \textit{all} instructions are executed normally.
The current operating mode of the TigerSHARC processor affects which components of the processor are active and can be accessed. The mode also affects which exceptions are taken and how they are handled. The mode priorities from lowest priority to highest priority are: User Mode, Supervisor Mode, and Emulator Mode.

**User Mode**

The user mode operation is used for algorithms and control code that does not require the manipulation of system resources. Many system resources are not accessible in user mode. If the TigerSHARC processor attempts to access these resources, an exception occurs.

User mode is often used when running code out of an operating system. The operating system kernel runs in supervisor mode, but the user code is restricted to user mode.

The registers that may be accessed by core program in user mode are:

- Register groups 0x00 to 0x09 – compute block registers
- Register groups 0x0C to 0x0F – IALU registers
- Sequencer registers – CJMP, loop counter registers LC0 and LC1, and the static flag register (SFREG)

All other registers cannot be written by the core program in user mode. An attempt to write to a protected register causes an exception. These registers can still be accessed by another master (DMA, external host, and others).
Supervisor Mode

Most code that is not intended to run under an operating system should be designed to run in supervisor mode. Supervisor mode allows the program to access all processor resources. The TigerSHARC processor is in supervisor mode when one of these two conditions is true:

- The NMOD bit in SQCTL is set. For more information, see “Sequencer Control (SQCTL) Register” on page 2-29.
- An interrupt routine is executed—indicated by non-zero PMASK. For more information, see “PMASK Register” on page 6-6.)

Normally when the TigerSHARC processor is reset (via hardware or software), it goes into idle state. It exits the idle state to a running state when an interrupt is issued. The interrupt puts the TigerSHARC processor into supervisor mode.

If the NMOD bit in SQCTL is cleared, the processor enters user mode after leaving an interrupt routine (unless it is nested inside another interrupt routine).

Emulator Mode

Emulator mode is used when controlling the processor with an emulator tool via the JTAG port. The TigerSHARC processor enters emulation mode when an emulation exception is generated. An emulation exception is generated after any one of these events:

- EMUTRAP instruction
- Watchpoint when programmed to cause emulation trap
- JTAG private instruction
- TMS rising while TEME bit in EMUCTL is set

Emulation exceptions are the highest priority exceptions or interrupts.
Processor Architecture

While the TigerSHARC processor is in emulator mode, the only source of instructions is the EMUIR register. The EMUIR register is loaded via the JTAG Test Access Port (TAP). When entering this mode, the external JTAG controller (Analog EZ-ICE or other customer hardware) must be enabled. For more information, see “Sequencer Control (SQCTL) Register” on page 2-29.

When the emulation features are enabled and an emulation exception is encountered, the TigerSHARC processor enters emulation mode. When the TigerSHARC processor is operating in emulator mode, the only way it can exit emulator mode is by executing a return from interrupt (RTI).

In emulator mode, the debug registers (register group 0x1B) can be accessed only by move register-to-register or immediate data load instructions. These registers cannot be loaded from or stored to memory directly.

In emulation mode, the program access to the debug registers (register groups 0x1B and 0x3D) may be executed only using the instruction:

\[ Ureg = Ureg ; \]

The BTB, cycle count, performance monitors, and trace buffer are inactive in emulation mode. Except for if true—RTI (np), all control flow instructions (jump, call, RDS, conditional, and so on.) and the IDLE instruction cannot be used in emulator mode. The RTI instruction can only be used without condition.

If the external reset pin is being asserted, and an external emulation exception is generated, the TigerSHARC processor core exits reset internally, enters emulator mode, and waits to fetch the value of EMUIR, which is loaded from the JTAG. In this case, only the core processor operates. The external interfaces (external port, link ports, DMA, and others) are still held in reset, although their internal registers can be accessed by the instructions inserted via JTAG and EMUIR. This feature enables an EZ-ICE to initialize the TigerSHARC processor internal registers and memory while it is still in reset and start its run from a known state.
**Memory, Registers, and Buses**

**Boot Modes**

The internal memory of the TigerSHARC processor can be boot loaded from an 8-bit EPROM, a host processor, or a link port using a boot mechanism at system power up. The TigerSHARC processor can also be run (no boot) from a memory location at startup. Selection of the boot source is controlled by the BMS strap (external) pins. For more information on boot modes, see “Processor Booting Methods” on page 11-2.

**Memory, Registers, and Buses**

The on-chip memory consists of six blocks of 4M bits each. Each block is 128K words, thus providing high bandwidth sufficient to support both computation units, the instruction stream and external I/O, even in very intensive operations. The ADSP-TS201 processor provides access to program, two data operands, and a system access (over the SOC bus) to different memory blocks without memory or bus constraints. The memory blocks can store instructions and data interchangeably.

Each memory block is organized as 128K words of 32 bits each. The accesses are pipelined to meet one clock cycle access time needed by the core, DMA, or by the external bus. Each access can be up to four words. The six memory blocks are a resource that must be shared between the compute blocks, the IALUs, the sequencer, the external port, and the link ports. In general, if during a particular cycle more than one unit in the processor attempts to access the same memory, one of the competing units is granted access, while the other is held off for further arbitration until the following cycle—see “Bus Arbitration Protocol” on page 8-38. This type of conflict only has a small impact on performance due to the very high bandwidth afforded by the internal buses.
An important benefit of large on-chip memory is that by managing the movement of data on and off chip with DMA, a system designer can realize high levels of determinism in execution time. Predictable and deterministic execution time is a central requirement in DSP and real-time systems.

**Internal Buses**

The processor core has three buses (I-bus, J-bus, and K-bus), each connected to all of the internal memory blocks via a crossbar interface. These buses are 128 bits wide to allow up to four instructions, or four aligned data words, to be transferred in each cycle on each bus. On-chip system elements use the SOC bus and S-bus to access memory. Only one access to each memory block is allowed in each cycle, so if the application accesses a different memory segment for each purpose (instruction fetch, load/store J-IALU and K-IALU instructions and external accesses), all transactions can be executed with no stalls.

Most registers of the ADSP-TS201 processor are classified as universal registers ($Ureg$). Instructions are provided for transferring data between any two $Ureg$ registers, between a $Ureg$ and memory, or for the immediate load of a $Ureg$. This includes control registers and status registers, as well as the data registers in the register files. These transfers occur with the same timing as internal memory load/store. All registers can be accessed by register-move instructions or by external master access (another ADSP-TS201 on the same cluster bus or host), but only the core registers can be accessed by load/store instructions or load-immediate instructions.

**Internal Registers**

Most registers of the ADSP-TS201 processor are classified as universal registers ($Ureg$). Instructions are provided for transferring data between any two $Ureg$ registers, between a $Ureg$ and memory, or for the immediate
load of a \textit{u}reg. This includes control registers and status registers, as well as the data registers in the register files. These transfers occur with the same timing as internal memory load/store.

**SOC Interface**

A separate system on chip (SOC) bus connects the external interfaces (external port, DMA, link ports, JTAG port, and others) to the memory system via the SOC interface and S-bus. This bus has a 128-bit wide data bus and a 32-bit wide address bus. It works at half the processor core clock rate. All data transferred between internal memory or core, and external port (cluster bus, link port, and others) passes through this bus.

As shown in Figure 1-7, the SOC bus and other buses operate at the speed of the clock in their clock domain. Data that moves from one clock domain to another (for example, a DMA to or from internal memory) must pass through arbitration and synchronization at the border of each domain. If multiple masters are requesting a bus, this arbitration can become an important factor in system performance. For more information, see “Clock Domains” on page 1-17 and “SOC Interface” on page 3-1.

**Timers**

The TigerSHARC processor has two general-purpose 64-bit timers—Timer 0 and Timer 1. The timers are free-running counters that are loaded with an initial value and give an indication when expiring. The indication is normally an interrupt, but could also be an external pin (TMR0E) for Timer 0. For more information, see “Timers” on page 4-1.
Flags

There are four input/output flag pins. Each pin can be individually configured to be an input or output. When they are configured as input flag pins, they can be used either as a condition or as a value in the SQSTAT reg-
Interrupts

After powerup reset, \texttt{FLAG3–0} are inputs where static 5 k\textOmega pull-up resistors hold them at logic high value. For more information, see “Flags” on page 5-1.

Interrupts

The ADSP-TS201 processor has four general-purpose external interrupts, \texttt{IRQ3–0}. The processor also has internally generated interrupts for the two timers, DMA channels, link ports, arithmetic exceptions, multiprocessor vector interrupts, and user-defined software interrupts. Interrupts can be nested through instruction commands. Interrupts have a short latency and do not abort currently executing instructions. Interrupts vector directly to a user-supplied address in the interrupt table register file, removing the overhead of a second branch. For more information, see “Interrupts” on page 6-1.

Direct Memory Access

The TigerSHARC processor on-chip DMA controller allows zero-overhead data transfers without processor intervention. The TigerSHARC processor can simultaneously fetch instructions and access two memories for data without relying on data or instruction caches. The DMA controller operates independently of the processor core, supplying addresses for internal and external memory access. The DMA channels, therefore, are not part of the core processor from a programming point of view.

The processor core has four buses, each one connected to one of the internal bus masters (J-IALU, K-IALU, program sequencer, and SOC interface). Each bus can connect to any memory block. These buses are 128 bits wide to allow up to four instructions, or four aligned data words, to be transferred in each cycle on each bus. On-chip system elements also use these buses to access memory. Only one access to each memory block is allowed in each cycle, so DMA or external port transfers must compete
Processor Architecture

with core accesses on the same block. Because of the large bandwidth available from each memory block, not all the memory bandwidth can be used by the core units, which leaves some memory bandwidth available for use by the processor’s DMA processes or by the bus interface to serve other DSPs’ external cluster bus master transfers to the TigerSHARC processor’s memory.

Both code and data can be downloaded to the TigerSHARC processor using DMA transfers, which can occur between the following.

- TigerSHARC processor internal memory and external memory, external peripherals or a host processor
- External memory and external peripheral devices
- External memory and link ports or between two link ports

Six DMA channels (four external port DMA channels and two autoDMA channels) are available on the TigerSHARC processor for data transfers through the external port. Eight DMA channels are available for link data transfers (two per link).

Asynchronous off-chip peripherals can control any one of four DMA channels using DMA request lines (DMAR3–0). Other DMA features include flyby (for channel 0 only), interrupt generation upon completion of DMA transfers, and DMA chaining for automatically linked DMA transfers.

For more information on DMA, see “Direct Memory Access” on page 7-1.
External (Address and Data) Port

The TigerSHARC processor external port provides an interface to external memory, to memory-mapped I/O, to host processor, and to additional TigerSHARC processors. The external port performs external bus arbitration and supplies control signals to shared, global memory, SDRAM, and I/O devices.

The external port cluster bus can be configured to be either 32 or 64 bits wide. The high I/O bandwidth complements the high processing speeds of the core. To facilitate the high clock rate, the TigerSHARC processor uses a pipelined external bus protocol with programmable pipeline depth for external memory access and host communication. For more information on the external port, see “External Port and SDRAM Interface” on page 8-1.

External Bus and Host Interface

The TigerSHARC processor external port (EP) provides an interface between the core processor and the 32/64-bit parallel external bus. The external port contains FIFOs that maintain the throughput of an external bus that is shared with multiple processors and peripherals—each of which may operate at speeds other than that of the core.

The most effective way to access external data in the TigerSHARC processor is through the DMA. This runs in the background, allowing the core to continue processing while new data is read in or processed data is written out. Multiple DMA data streams can occur simultaneously, and the use of FIFOs helps to maintain throughput in the system.

Burst accesses are provided through the BRST pin, which allows a slave device on the bus to accept the first address and then automatically increment that address as successive data words arrive.
External Memory

The TigerSHARC processor external port provides the processor interface to off-chip memory and peripherals. The off-chip memory and peripherals are included in the TigerSHARC processor unified address space. The separate on-chip buses are multiplexed at the external port to create an external system bus with a single 32-bit address bus and a single 64-bit data bus. External memory and devices can be either 32 or 64 bits wide. The TigerSHARC processor automatically packs external data into either 32-, 64-, or 128-bit word widths, the latter being more efficient for reducing memory access conflicts.

On-chip decoding of high order address lines (to generate memory block select signals) facilitates addressing of external memory devices. Separate control lines are also generated for simplified addressing of page mode DRAM.

The TigerSHARC processor uses the address on the external port bus to pipeline the data. This allows interfacing to synchronous DRAM and speeds up interprocessor accesses. An option allows asynchronous operation for slower devices.

External data can be accessed by DMA channels or by the core. For core accesses, the read latency can be significant—eight or more cycles. The core provides I/O buffering by stalling if the data is accessed before the data is loaded in a universal register (Ureg).

Programmable memory wait states permit peripherals with different access, hold, and disable time requirements.

External shared memory resources are assigned between processors by using semaphore operations.
Multiprocessing

The ADSP-TS201 processor, like the ADSP-TS101 processor, is designed for multiprocessing applications. The primary multiprocessing architecture supported is a cluster of up to eight TigerSHARC processors that share a common bus, a global memory, and an interface to either a host processor or to other clusters. In large multiprocessing systems, this cluster can be considered an element and connected in configurations such as torroid, mesh, tree, crossbar, or others. The user can provide a personal interconnect method or use the on-chip communication ports.

The ADSP-TS201 processor includes the following multiprocessing capabilities:

- On-chip bus arbitration for glueless multiprocessing
- Globally accessible internal memory and registers
- Semaphore support
- Powerful, in-circuit multiprocessing emulation

The TigerSHARC processor offers features tailored to multiprocessing systems:

- The unified address space allows direct interprocessor accesses of each TigerSHARC processor internal memory and resources.
- Distributed bus arbitration logic is included on chip for glueless connection of systems containing up to eight TigerSHARC processors and a host processor.
- Bus arbitration rotates, except for host requests that always hold the highest priority.
- Processor bus lock allows indivisible read-modify-write sequences for semaphores.
Processor Architecture

- A vector interrupt capability is provided for interprocessor commands.
- Broadcast writes allow simultaneous transmissions of data to all TigerSHARC processors.

Host Interface

Connecting a host processor to a cluster of TigerSHARC processors is simplified by the memory-mapped nature of the interface bus and the availability of special host bus request signals.

A host that is able to access a pipelined memory interface can be easily connected to the parallel TigerSHARC processor bus. All the internal memory, Uregs, and resources within the TigerSHARC processor, such as the DMA control registers and the internal memory, are accessible to the host.

The host interface is through the TigerSHARC processor external address and data bus, with additional lines being provided for host control. The protocol is similar to the standard TigerSHARC processor pipelined bus protocol.

The host becomes bus master of the cluster by asserting the Host Bus Request (HBR) signal. Host Bus Grant (HBG) is returned by the TigerSHARC processors when the current master grants bus by asserting HBR. The host interface is synchronous, and can be delayed a number of cycles to allow slow host access. The host can also access external memory directly.

All DMA channels are accessible to the host interface, allowing code and data transfers to be accomplished with low software overhead. The host can directly read and write the internal memory of the TigerSHARC processor and can access the DMA channel setup. Vector interrupt support is provided for efficient execution of host commands and burst-mode transfers.
Link Ports

The TigerSHARC processor has four link ports that provide four-bit receive and four-bit transmit I/O capabilities in multiprocessing systems. The link ports have the following characteristics.

- Link clock speed is selectable as either x1/4, x1/2, x2/3, or x1 of internal clock frequency.
- Link port data is packed into 128-bit words for DMA transfer to on- or off-chip memory.
- Each link port has its own buffer registers.
- Link port transfers are controlled by acknowledge handshaking.
- Link ports support full-duplex transfer and transfers to/from the external port or other links.

For more information on the link ports, see “Link Ports” on page 9-1.

JTAG Port and Debug Interface

The ADSP-TS201 processor supports the IEEE Standard 1149.1 Joint Test Action Group (JTAG) port for system test. This standard defines a method for serially scanning the I/O status of each component in a system. The JTAG serial port is also used by the TigerSHARC processor emulator to gain access to the processor’s on-chip emulation features.

For more information, see “JTAG Port and Test/Debug Interface” on page 10-1.
Programming Information

Refer to the *ADSP-TS201 TigerSHARC Processor Programming Reference* for more programming information. Information available in the programming reference includes:

- Detailed chapters on each computation unit (ALU, CLU, multiplier, and shifter), IALU, program sequencer, and embedded DRAM (internal memory) operation
- Complete reference information on all processor instructions
- Complete reference information on instruction parallelism rules
- All available reference information on other programming topics
Programming Information
This chapter describes the ADSP-TS201 TigerSHARC processor memory and register map. For information on using registers for computations and memory for register loads and stores, see the ADSP-TS201 TigerSHARC Processor Programming Reference. For information on using registers for configuring the TigerSHARC processor’s peripherals, use the applicable chapters in this book.

The ADSP-TS201 TigerSHARC processor has six internal memory blocks as shown in the memory map (Figure 2-1). Each memory block consists of 4M bits of memory space, and is configured as 128K words, each 32 bits in width. There are four separate internal 128-bit data buses, each can access any of the memory blocks. Memory blocks can store instructions and data interchangeably, with one access per memory block per cycle. If the programmer ensures that program and data are in different memory blocks, then data access can occur at the same time as program fetch. The I/O processor has its own internal bus, so the I/O processor does not compete with the core for use of an internal bus. Thus in one cycle, up to four 128-bit transfers can occur within the core (two data transfers, one program instruction transfer, and one I/O processor interface transfer).

The TigerSHARC processor’s 32-bit address bus provides an address space of four gigawords. This address space is common to a cluster of TigerSHARC processors that share the same cluster bus. This chapter defines the memory map of each TigerSHARC processor in the system.
Figure 2-1. ADSP-TS201 Memory Map
Memory and Registers

and indicates where the memory space defines the location of each element. The zones in the memory space are made up of the following regions.

- External memory bank space—the region for standard addressing of off-chip memory, including SDRAM, bank 0 (\(MS0\)), bank 1 (\(MS1\)), and Host (\(MS\))
- External multiprocessor space—the on-chip memory of all other TigerSHARC processors connected in a multiprocessor system
- Internal address space—the region for standard internal addressing

The global memory map is shown in Figure 2-1.

In addition to the direct accesses (normal - one 32-bit word, long - two 32-bit words, and quad - four 32-bit words), several other efficient methods are available. These include Broadcast Write, Merged Distribution, and Broadcast Distribution. Broadcast Write is an external write to other DSPs in a multiprocessor cluster. Merged and Broadcast Distribution are internal access methods. For additional information on memory access methods, see “IALU” in the ADSP-TS201 TigerSHARC Processor Programming Reference.
Host Address Space

The host address space is the space defined for the host when it is accessed as a slave. When referring to this space, the pipelined or asynchronous protocol is used according to the host bits in the SYSCON register—for additional information on the SYSCON register see Figure 2-35 on page 2-74 and Figure 2-36 on page 2-75. The backoff signal is also effective on this zone—see “Back Off (BOFF) Pin” on page 8-48. The host address is two gigawords and is divided into fields, as illustrated in Table 2-1.

Table 2-1. Host Address Space

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Definition</th>
</tr>
</thead>
<tbody>
<tr>
<td>ADDR30–0</td>
<td>Address</td>
<td>Address in host range</td>
</tr>
<tr>
<td>ADDR31</td>
<td>Host Select</td>
<td>Determines the type of address.</td>
</tr>
<tr>
<td></td>
<td></td>
<td>If ADDR31=1, the address is in host address space</td>
</tr>
</tbody>
</table>

External Memory Bank Space

This memory space corresponds to off-chip memory and memory-mapped I/O devices (SDRAM, I/O peripherals, and other standard memory devices).

Normal external accesses are split into banks. One set of banks is for SDRAM and is accessed using external memory select pins MSSDO, MSSDT, MSSDZ, and MSSH; the access is executed in SDRAM protocol. Another set of banks is identified by external memory select pins MSO and MST, where the access protocol is user-configurable as pipelined or slow device protocol, and the parameters are defined by the SYSCON register value—see section “External Bus Interface Register Groups” on page 2-72.
The external memory address is divided into fields, as illustrated in Table 2-2.

### Table 2-2. External Memory Bank Space

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Definition</th>
</tr>
</thead>
<tbody>
<tr>
<td>ADDR25–0</td>
<td>Address</td>
<td>Internal address space, as described in “Internal Address Space” on page 2-7</td>
</tr>
<tr>
<td>ADDR30–26</td>
<td>MS/PRID</td>
<td>Memory Select—select an external/multiprocessor memory space:</td>
</tr>
<tr>
<td></td>
<td></td>
<td>01100 – Bank 0 (MS0)</td>
</tr>
<tr>
<td></td>
<td></td>
<td>01110 – Bank 1 (MST)</td>
</tr>
<tr>
<td></td>
<td></td>
<td>10000 – SDRAM Bank 0 (MSSD0)</td>
</tr>
<tr>
<td></td>
<td></td>
<td>10001 – Reserved</td>
</tr>
<tr>
<td></td>
<td></td>
<td>10100 – SDRAM Bank 1 (MSSD1)</td>
</tr>
<tr>
<td></td>
<td></td>
<td>10101 – Reserved</td>
</tr>
<tr>
<td></td>
<td></td>
<td>11000 – SDRAM Bank 2 (MSSD2)</td>
</tr>
<tr>
<td></td>
<td></td>
<td>11001 – Reserved</td>
</tr>
<tr>
<td></td>
<td></td>
<td>11100 – SDRAM Bank 3 (MSSD3)</td>
</tr>
<tr>
<td></td>
<td></td>
<td>11101 – Reserved</td>
</tr>
<tr>
<td>ADDR31</td>
<td>Host Select</td>
<td>Determine the type of address. If ADDR31=0, the address can be in external, internal, or multiprocessor address space.</td>
</tr>
</tbody>
</table>
The multiprocessing space maps the internal memory space of each TigerSHARC processor in the cluster into any other TigerSHARC processor. This allows one processor to easily write to and read from other processors in a multiprocessor system. Broadcast space allows write access to all TigerSHARC processors in the cluster. Each TigerSHARC processor in the cluster is identified by its ID. Valid processor ID values are 0 through 7.

The external multiprocessor address is divided into fields, as illustrated in Table 2-3.

### Table 2-3. Multiprocessor Space

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Definition</th>
</tr>
</thead>
<tbody>
<tr>
<td>ADDR25–0</td>
<td>Address</td>
<td>Internal address space, as described in &quot;Internal Address Space&quot; on page 2-7</td>
</tr>
<tr>
<td>ADDR30–26</td>
<td>MS/PRID</td>
<td>Memory Select—select an external/multiprocessor memory space:</td>
</tr>
<tr>
<td></td>
<td></td>
<td>00011 – Multiprocessor Broadcast</td>
</tr>
<tr>
<td></td>
<td></td>
<td>00100 – Multiprocessor ID0</td>
</tr>
<tr>
<td></td>
<td></td>
<td>00101 – Multiprocessor ID1</td>
</tr>
<tr>
<td></td>
<td></td>
<td>00110 – Multiprocessor ID2</td>
</tr>
<tr>
<td></td>
<td></td>
<td>00111 – Multiprocessor ID3</td>
</tr>
<tr>
<td></td>
<td></td>
<td>01000 – Multiprocessor ID4</td>
</tr>
<tr>
<td></td>
<td></td>
<td>01001 – Multiprocessor ID5</td>
</tr>
<tr>
<td></td>
<td></td>
<td>01010 – Multiprocessor ID6</td>
</tr>
<tr>
<td></td>
<td></td>
<td>01011 – Multiprocessor ID7</td>
</tr>
<tr>
<td>ADDR31</td>
<td>Host Select</td>
<td>Determine the type of address. If ADDR31=0, the address can be in external,</td>
</tr>
<tr>
<td></td>
<td></td>
<td>internal, or multiprocessor address space.</td>
</tr>
</tbody>
</table>
Memory and Registers

The TigerSHARC processor’s own internal space (including Ureg registers) can be accessed via the multiprocessing space (including broadcast space) for write transactions only. There are no interlocks. Because this is performed through the external bus it should only be used in special cases where data must pass through the TigerSHARC processor bus interface.

Internal Address Space

The internal address space corresponds to that processor’s own internal address space, Ureg registers. The internal address space is illustrated in Figure 2-1 on page 2-2.

Internal space is the space for transactions within the TigerSHARC processor and access to this memory space is not reflected on the cluster bus. Internal address space is used to access the internal memory blocks, or any of the Universal registers (Ureg). Universal registers are internal registers that are mapped to the TigerSHARC processor memory map. Most software accessible registers are Ureg registers.

Access to Ureg registers as memory locations is only available through multiprocessing space. Internal access to registers cannot be memory mapped—in load/store instructions, for example, the address cannot point to a register through the internal space. The DMA also cannot access a register directly, although there is an exception—link receive DMA channels may write to other link transmit registers.
Universal (Ureg) Register Space

Frequently, this text refers to universal register groups and access techniques or restrictions that apply to particular groups. The processor’s memory-mapped, universal registers are grouped according to their relationship to the processor’s architecture. For example, data registers (XR31–0, YR31–0, or XYSR31–0) for the compute blocks are in one register group, while data registers (J30–0 or K30–0) for the integer ALUs (J-IALU or K-IALU) are in another register group. Each register group corresponds to a range of memory addresses because all registers in these groups are memory mapped, universal registers.

The term universal (Ureg) register indicates several important features about the register. First, the register is memory-mapped, indicating that the register can be accessed by other processors in a multiprocessor system through the register’s address. Second, the register contents can be loaded, from memory, stored to memory, or transferred to or from other Ureg registers in the processor. Third, the register can be accessed as a single-, dual-, or quad-register.¹

Multiprocessor accesses to another processor’s Ureg registers use the memory addresses that appear in the register group tables as part of the address, but must also add the offset for a particular processor. (See the memory map in Figure 2-1 on page 2-2.) The following code examples show multiprocessor accesses of another processor’s Ureg registers.

/* In the following multiprocessor register access example, one processor in a multiprocessor system transfers the contents of its XR1 register to processor P1’s XRO register */

¹ Some universal system registers (for example, sequencer register group 0x1A, DMA control/status register group 0x23, external bus interface register group 0x24, and JTAG register group 0x3D) are not accessible as dual- or quad-registers.
Memory and Registers

J0 = P1_OFFSET_LOC ;;
   /* P1_OFFSET_LOC is 0x1400 0000; the location of processor P1 */
J1 = XR0_LOC ;;
   /* XR0_LOC is 0x001E 0000; the location of register XR0 */
[J0 + J1] = XR1 ;;
   /* transfers register XR1 data to register XR0 on proc. P1 */

For information on memory read and write instruction syntax, see the “IALU Load, Store, and Transfer (Data Addressing) Operations” section in Chapter 7, IALU, in the ADSP-TS201 TigerSHARC Processor Programming Reference.

A processor may not make a read access of its own Ureg registers using a multiprocessor access. Such an read access causes an illegal access exception. A processor may make a multiprocessor write access (such as a multiprocessor broadcast write) to its own Ureg registers. For more information on exceptions, see “Handling Exceptions” on page 6-15.

Load, store, or transfer Ureg register accesses within a processor use the register’s name, not its memory-mapped address. The register names appear in the register group tables. A register load access copies data from the processor’s memory and places the data into the register. A register store access copies data from the register and places the data into the processor’s memory. A register transfer (or move) access copies data from one register to another. The following code examples show single-register load, store, and transfer accesses.

/* In the following single-register load, store, and transfer access examples, data is copied to and from single, 32-bit registers */
XR0 = 0x76543210 ;; /* loads XR0 with immediate 32-bit data */
XR4 = [J31 + 0x43] ;; /* loads XR0 from memory */
[J0 + J4] = YR0 ;; /* stores YR0 to memory */
XR7 = SQSTAT ;; /* transfers SQSTAT contents to XR7 */
Universal (Ureg) Register Space

For information on memory read and write instruction syntax, see the “IALU Load, Store, and Transfer (Data Addressing) Operations” section in Chapter 7, IALU, in the ADSP-TS201 TigerSHARC Processor Programming Reference.

If using legal instruction parallelism, there can be up to two load, store, or transfer accesses on an instruction line—greatly increasing performance of routines that require loading values into many registers. For information on instruction parallelism, see the “Instruction Parallelism Rules” section in Chapter 1, Introduction, in the ADSP-TS201 TigerSHARC Processor Programming Reference.

Unlike the single-register accesses in the previous example that use the registers name to access a single register, dual- and quad-register accesses within a processor use a modified form of the register’s name to access multiple, adjacent registers that are located on dual- or quad-word address boundaries. The modification to the register’s name indicates a range of registers, rather than a single register. For example the dual-register name XR1:0 indicates register R1 and R0 in the X compute block, and the quad-register name J31:28 indicates registers J31, J30, J29, and J28 in the J-IALU. Alignment to a dual- or quad-word address boundary refers to the register number (and address) of the lowest register in the group; dual-register boundaries end in multiples of two (for example, R1:0, R3:2, R27:26); quad-register boundaries end in multiples of four (for example, R3:0, R7:4, R31:28). The following code examples show dual- and quad-register load, store, and transfer accesses.

/* In the following dual- and quad-register load, store, and transfer access examples data is copied to and from dual (64-bit) and quad (128-bit) registers */
YR5:4 = L [J31 + 0x1ff0] ;; /* loads YR5:4 from memory */
XR3:0 = Q [K4 += K5] ;; /* loads XR3:0 from memory */
L [J0 + J2] = YR31:30 ;; /* stores YR31:30 to memory */
Q [K31 + K28] = YR7:4 ;; /* stores Y7:4 to memory */
Memory and Registers

CACMADRO = J1:0 ;; /* transfers J1:0 to CCAIRO:CACMD0 */
DCS0 = YR11:8 ;; /* transfers YR11:8 to DCS0 (DP, DY, DX, DI) */

For information on single-, dual-, and quad-register name instruction syntax, see the “Register File Registers” section in Chapter 2, Compute Block Registers, in the ADSP-TS201 TigerSHARC Processor Programming Reference.

Register groups and register quad relationships can have important influence on instruction parallelism. For information on instruction parallelism, see the “Instruction Parallelism Rules” section in Chapter 1, Introduction, in the ADSP-TS201 TigerSHARC Processor Programming Reference.

Normally, a register load from memory or a register store to memory access occurs between one register and a memory word of the same size—a single-register load/store of a normal (32-bit) word, a dual-register load/store of a long (64-bit) word, or a quad-register load/store of a quad (128-bit) word. When one register is loaded/stored using a memory word of the same size, the operation is called a normal read or write access. To take advantage of single-instruction, multiple-data (SIMD) processing, the processor’s bus architecture also supports types of accesses in which the contents of one memory word can be loaded into two registers (broadcast read access) and supports types of accesses in which multiple registers can be loaded/stored with different data using one memory word (merged read or write access). For information on access types and data placement, see the “Normal, Merged, and Broadcast Memory Accesses” section in Chapter 7, IALU, in the ADSP-TS201 TigerSHARC Processor Programming Reference.

Note that broadcast read accesses only occur between memory and registers within a processor, while multiprocessor broadcast write accesses only occur simultaneously across all processors in a multiprocessor system.
Universal (Ureg) Register Space

The register space is composed of 64 register groups with up to 32 registers in each group. Register groups are defined in the range 0x3F–0 (63–0), where groups 0x1F–0 are accessible by all transfer instructions (load immediate, move register, load and store), and groups 0x3F–0x20 are accessible only by move register instructions and direct accesses of other masters.

The register groups are:

- Compute block register file – groups 0x00–0x09
- IALU registers – groups 0x0C–0x0F
- Debug feature registers – groups 0x0A, 0x1B
- Program sequencer – group 0x1A
- Memory controller – groups 0x1E–0x1F
- DMA registers – groups 0x20 – 0x23
- External port control/status registers – group 0x24
- Link port registers – groups 0x25 – 0x27
- Interrupt controller – groups 0x38, 0x39, and 0x3A
- JTAG registers – group 0x3D
- AutoDMA registers – group 0x3F
- Others – Reserved. These registers must not be accessed by applications since they could cause unexpected behavior by the TigerSHARC processor.

There is a direct relationship between a Ureg register’s memory-mapped address and the register group in which the register resides. (See Figure 2-2.) Because register groups have different access restrictions, it is
always important to know the group to which a register belongs. When in
doubt, use the information in Figure 2-2 to determine the register group
number.

<table>
<thead>
<tr>
<th>Register Group Number, Bits</th>
<th>5</th>
<th>4</th>
<th>3</th>
<th>2</th>
<th>1</th>
<th>0</th>
</tr>
</thead>
<tbody>
<tr>
<td>Register Address, Bits</td>
<td>16</td>
<td>9</td>
<td>8</td>
<td>7</td>
<td>6</td>
<td>5</td>
</tr>
</tbody>
</table>

Figure 2-2. Register Group Number Versus Register Address

It is also important to note from a \textit{Ureg} register’s memory-mapped address
whether the register resides in the processor core (internal \textit{Ureg} registers)
or in the SOC interface (SOC \textit{Ureg} registers). (See Figure 2-1 on
page 2-2.) Because register groups in the SOC interface have different
access restrictions than core registers, it is important to know the where
the register resides.

\textbf{i} Some of the registers do not use all 32 bits. The unused bits are
reserved. When writing to register with reserved bits, these bits
must be written with to zero. When reading a register with reserved
bits, the reserved bits may be of any value.
Universal (Ureg) Register Space

Compute Block Register Groups

Each compute block register file is a 32-word register. There are two such register files, one in each of the compute blocks (X and Y). The register file is aliased in several groups, some of which are applicable for data transfer instructions only. Some groups point to compute block X and some to block Y; those that point to both compute blocks in parallel actually point to the same register. The X and Y registers are memory mapped Universal (Ureg) registers.

The Compute Block register types of accesses are listed with their memory mapped addresses in Table 2-4, Table 2-5, Table 2-6, Table 2-7, and Table 2-8. Note that Ureg register file groups for alternative access are only for load/store instruction—DAB, circular buffer. For more information about alternate access, see the ADSP-TS201 TigerSHARC Processor Programming Reference.
### Table 2-4. Compute Block X Register Group

<table>
<thead>
<tr>
<th>Name</th>
<th>Address</th>
<th>Default</th>
<th>Name</th>
<th>Address</th>
<th>Default</th>
</tr>
</thead>
<tbody>
<tr>
<td>XR0</td>
<td>0x1E 0000</td>
<td>Undefined</td>
<td>XR16</td>
<td>0x1E 0010</td>
<td>Undefined</td>
</tr>
<tr>
<td>XR1</td>
<td>0x1E 0001</td>
<td>Undefined</td>
<td>XR17</td>
<td>0x1E 0011</td>
<td>Undefined</td>
</tr>
<tr>
<td>XR2</td>
<td>0x1E 0002</td>
<td>Undefined</td>
<td>XR18</td>
<td>0x1E 0012</td>
<td>Undefined</td>
</tr>
<tr>
<td>XR3</td>
<td>0x1E 0003</td>
<td>Undefined</td>
<td>XR19</td>
<td>0x1E 0013</td>
<td>Undefined</td>
</tr>
<tr>
<td>XR4</td>
<td>0x1E 0004</td>
<td>Undefined</td>
<td>XR20</td>
<td>0x1E 0014</td>
<td>Undefined</td>
</tr>
<tr>
<td>XR5</td>
<td>0x1E 0005</td>
<td>Undefined</td>
<td>XR21</td>
<td>0x1E 0015</td>
<td>Undefined</td>
</tr>
<tr>
<td>XR6</td>
<td>0x1E 0006</td>
<td>Undefined</td>
<td>XR22</td>
<td>0x1E 0016</td>
<td>Undefined</td>
</tr>
<tr>
<td>XR7</td>
<td>0x1E 0007</td>
<td>Undefined</td>
<td>XR23</td>
<td>0x1E 0017</td>
<td>Undefined</td>
</tr>
<tr>
<td>XR8</td>
<td>0x1E 0008</td>
<td>Undefined</td>
<td>XR24</td>
<td>0x1E 0018</td>
<td>Undefined</td>
</tr>
<tr>
<td>XR9</td>
<td>0x1E 0009</td>
<td>Undefined</td>
<td>XR25</td>
<td>0x1E 0019</td>
<td>Undefined</td>
</tr>
<tr>
<td>XR10</td>
<td>0x1E 000A</td>
<td>Undefined</td>
<td>XR26</td>
<td>0x1E 001A</td>
<td>Undefined</td>
</tr>
<tr>
<td>XR11</td>
<td>0x1E 000B</td>
<td>Undefined</td>
<td>XR27</td>
<td>0x1E 001B</td>
<td>Undefined</td>
</tr>
<tr>
<td>XR12</td>
<td>0x1E 000C</td>
<td>Undefined</td>
<td>XR28</td>
<td>0x1E 001C</td>
<td>Undefined</td>
</tr>
<tr>
<td>XR13</td>
<td>0x1E 000D</td>
<td>Undefined</td>
<td>XR29</td>
<td>0x1E 001D</td>
<td>Undefined</td>
</tr>
<tr>
<td>XR14</td>
<td>0x1E 000E</td>
<td>Undefined</td>
<td>XR30</td>
<td>0x1E 001E</td>
<td>Undefined</td>
</tr>
<tr>
<td>XR15</td>
<td>0x1E 000F</td>
<td>Undefined</td>
<td>XR31</td>
<td>0x1E 001F</td>
<td>Undefined</td>
</tr>
</tbody>
</table>
### Universal (Ureg) Register Space

#### Table 2-5. Compute Block Y Register Group

<table>
<thead>
<tr>
<th>Name</th>
<th>Address</th>
<th>Default</th>
<th>Name</th>
<th>Address</th>
<th>Default</th>
</tr>
</thead>
<tbody>
<tr>
<td>YR0</td>
<td>0x1E 0040</td>
<td>Undefined</td>
<td>YR16</td>
<td>0x1E 0050</td>
<td>Undefined</td>
</tr>
<tr>
<td>YR1</td>
<td>0x1E 0041</td>
<td>Undefined</td>
<td>YR17</td>
<td>0x1E 0051</td>
<td>Undefined</td>
</tr>
<tr>
<td>YR2</td>
<td>0x1E 0042</td>
<td>Undefined</td>
<td>YR18</td>
<td>0x1E 0052</td>
<td>Undefined</td>
</tr>
<tr>
<td>YR3</td>
<td>0x1E 0043</td>
<td>Undefined</td>
<td>YR19</td>
<td>0x1E 0053</td>
<td>Undefined</td>
</tr>
<tr>
<td>YR4</td>
<td>0x1E 0044</td>
<td>Undefined</td>
<td>YR20</td>
<td>0x1E 0054</td>
<td>Undefined</td>
</tr>
<tr>
<td>YR5</td>
<td>0x1E 0045</td>
<td>Undefined</td>
<td>YR21</td>
<td>0x1E 0055</td>
<td>Undefined</td>
</tr>
<tr>
<td>YR6</td>
<td>0x1E 0046</td>
<td>Undefined</td>
<td>YR22</td>
<td>0x1E 0056</td>
<td>Undefined</td>
</tr>
<tr>
<td>YR7</td>
<td>0x1E 0047</td>
<td>Undefined</td>
<td>YR23</td>
<td>0x1E 0057</td>
<td>Undefined</td>
</tr>
<tr>
<td>YR8</td>
<td>0x1E 0048</td>
<td>Undefined</td>
<td>YR24</td>
<td>0x1E 0058</td>
<td>Undefined</td>
</tr>
<tr>
<td>YR9</td>
<td>0x1E 0049</td>
<td>Undefined</td>
<td>YR25</td>
<td>0x1E 0059</td>
<td>Undefined</td>
</tr>
<tr>
<td>YR10</td>
<td>0x1E 004A</td>
<td>Undefined</td>
<td>YR26</td>
<td>0x1E 005A</td>
<td>Undefined</td>
</tr>
<tr>
<td>YR11</td>
<td>0x1E 004B</td>
<td>Undefined</td>
<td>YR27</td>
<td>0x1E 005B</td>
<td>Undefined</td>
</tr>
<tr>
<td>YR12</td>
<td>0x1E 004C</td>
<td>Undefined</td>
<td>YR28</td>
<td>0x1E 005C</td>
<td>Undefined</td>
</tr>
<tr>
<td>YR13</td>
<td>0x1E 004D</td>
<td>Undefined</td>
<td>YR29</td>
<td>0x1E 005D</td>
<td>Undefined</td>
</tr>
<tr>
<td>YR14</td>
<td>0x1E 004E</td>
<td>Undefined</td>
<td>YR30</td>
<td>0x1E 005E</td>
<td>Undefined</td>
</tr>
<tr>
<td>YR15</td>
<td>0x1E 004F</td>
<td>Undefined</td>
<td>YR31</td>
<td>0x1E 005F</td>
<td>Undefined</td>
</tr>
</tbody>
</table>
## Memory and Registers

### Table 2-6. Compute Block XY Merged Register Group

<table>
<thead>
<tr>
<th>Name</th>
<th>Address</th>
<th>Default</th>
<th>Name</th>
<th>Address</th>
<th>Default</th>
</tr>
</thead>
<tbody>
<tr>
<td>XYR0</td>
<td>0x1E 0080</td>
<td>Undefined</td>
<td>XYR16</td>
<td>0x1E 0090</td>
<td>Undefined</td>
</tr>
<tr>
<td>XYR1</td>
<td>0x1E 0081</td>
<td>Undefined</td>
<td>XYR17</td>
<td>0x1E 0091</td>
<td>Undefined</td>
</tr>
<tr>
<td>XYR2</td>
<td>0x1E 0082</td>
<td>Undefined</td>
<td>XYR18</td>
<td>0x1E 0092</td>
<td>Undefined</td>
</tr>
<tr>
<td>XYR3</td>
<td>0x1E 0083</td>
<td>Undefined</td>
<td>XYR19</td>
<td>0x1E 0093</td>
<td>Undefined</td>
</tr>
<tr>
<td>XYR4</td>
<td>0x1E 0084</td>
<td>Undefined</td>
<td>XYR20</td>
<td>0x1E 0094</td>
<td>Undefined</td>
</tr>
<tr>
<td>XYR5</td>
<td>0x1E 0085</td>
<td>Undefined</td>
<td>XYR21</td>
<td>0x1E 0095</td>
<td>Undefined</td>
</tr>
<tr>
<td>XYR6</td>
<td>0x1E 0086</td>
<td>Undefined</td>
<td>XYR22</td>
<td>0x1E 0096</td>
<td>Undefined</td>
</tr>
<tr>
<td>XYR7</td>
<td>0x1E 0087</td>
<td>Undefined</td>
<td>XYR23</td>
<td>0x1E 0097</td>
<td>Undefined</td>
</tr>
<tr>
<td>XYR8</td>
<td>0x1E 0088</td>
<td>Undefined</td>
<td>XYR24</td>
<td>0x1E 0098</td>
<td>Undefined</td>
</tr>
<tr>
<td>XYR9</td>
<td>0x1E 0089</td>
<td>Undefined</td>
<td>XYR25</td>
<td>0x1E 0099</td>
<td>Undefined</td>
</tr>
<tr>
<td>XYR10</td>
<td>0x1E 008A</td>
<td>Undefined</td>
<td>XYR26</td>
<td>0x1E 009A</td>
<td>Undefined</td>
</tr>
<tr>
<td>XYR11</td>
<td>0x1E 008B</td>
<td>Undefined</td>
<td>XYR27</td>
<td>0x1E 009B</td>
<td>Undefined</td>
</tr>
<tr>
<td>XYR12</td>
<td>0x1E 008C</td>
<td>Undefined</td>
<td>XYR28</td>
<td>0x1E 009C</td>
<td>Undefined</td>
</tr>
<tr>
<td>XYR13</td>
<td>0x1E 008D</td>
<td>Undefined</td>
<td>XYR29</td>
<td>0x1E 009D</td>
<td>Undefined</td>
</tr>
<tr>
<td>XYR14</td>
<td>0x1E 008E</td>
<td>Undefined</td>
<td>XYR30</td>
<td>0x1E 009E</td>
<td>Undefined</td>
</tr>
<tr>
<td>XYR15</td>
<td>0x1E 008F</td>
<td>Undefined</td>
<td>XYR31</td>
<td>0x1E 009F</td>
<td>Undefined</td>
</tr>
</tbody>
</table>
## Universal (Ureg) Register Space

Table 2-7. Compute Block YX Merged Register Group

<table>
<thead>
<tr>
<th>Name</th>
<th>Address</th>
<th>Default</th>
<th>Name</th>
<th>Address</th>
<th>Default</th>
</tr>
</thead>
<tbody>
<tr>
<td>YXR0</td>
<td>0x1E 00C0</td>
<td>Undefined</td>
<td>YXR16</td>
<td>0x1E 00D0</td>
<td>Undefined</td>
</tr>
<tr>
<td>YXR1</td>
<td>0x1E 00C1</td>
<td>Undefined</td>
<td>YXR17</td>
<td>0x1E 00D1</td>
<td>Undefined</td>
</tr>
<tr>
<td>YXR2</td>
<td>0x1E 00C2</td>
<td>Undefined</td>
<td>YXR18</td>
<td>0x1E 00D2</td>
<td>Undefined</td>
</tr>
<tr>
<td>YXR3</td>
<td>0x1E 00C3</td>
<td>Undefined</td>
<td>YXR19</td>
<td>0x1E 00D3</td>
<td>Undefined</td>
</tr>
<tr>
<td>YXR4</td>
<td>0x1E 00C4</td>
<td>Undefined</td>
<td>YXR20</td>
<td>0x1E 00D4</td>
<td>Undefined</td>
</tr>
<tr>
<td>YXR5</td>
<td>0x1E 00C5</td>
<td>Undefined</td>
<td>YXR21</td>
<td>0x1E 00D5</td>
<td>Undefined</td>
</tr>
<tr>
<td>YXR6</td>
<td>0x1E 00C6</td>
<td>Undefined</td>
<td>YXR22</td>
<td>0x1E 00D6</td>
<td>Undefined</td>
</tr>
<tr>
<td>YXR7</td>
<td>0x1E 00C7</td>
<td>Undefined</td>
<td>YXR23</td>
<td>0x1E 00D7</td>
<td>Undefined</td>
</tr>
<tr>
<td>YXR8</td>
<td>0x1E 00C8</td>
<td>Undefined</td>
<td>YXR24</td>
<td>0x1E 00D8</td>
<td>Undefined</td>
</tr>
<tr>
<td>YXR9</td>
<td>0x1E 00C9</td>
<td>Undefined</td>
<td>YXR25</td>
<td>0x1E 00D9</td>
<td>Undefined</td>
</tr>
<tr>
<td>YXR10</td>
<td>0x1E 00CA</td>
<td>Undefined</td>
<td>YXR26</td>
<td>0x1E 00DA</td>
<td>Undefined</td>
</tr>
<tr>
<td>YXR11</td>
<td>0x1E 00CB</td>
<td>Undefined</td>
<td>YXR27</td>
<td>0x1E 00DB</td>
<td>Undefined</td>
</tr>
<tr>
<td>YXR12</td>
<td>0x1E 00CC</td>
<td>Undefined</td>
<td>YXR28</td>
<td>0x1E 00DC</td>
<td>Undefined</td>
</tr>
<tr>
<td>YXR13</td>
<td>0x1E 00CD</td>
<td>Undefined</td>
<td>YXR29</td>
<td>0x1E 00DD</td>
<td>Undefined</td>
</tr>
<tr>
<td>YXR14</td>
<td>0x1E 00CE</td>
<td>Undefined</td>
<td>YXR30</td>
<td>0x1E 00DE</td>
<td>Undefined</td>
</tr>
<tr>
<td>YXR15</td>
<td>0x1E 00CF</td>
<td>Undefined</td>
<td>YXR31</td>
<td>0x1E 00DF</td>
<td>Undefined</td>
</tr>
</tbody>
</table>
Memory and Registers

Table 2-8. Compute Block XY Broadcast Register Group

<table>
<thead>
<tr>
<th>Name</th>
<th>Address</th>
<th>Default</th>
<th>Name</th>
<th>Address</th>
<th>Default</th>
</tr>
</thead>
<tbody>
<tr>
<td>XYR0</td>
<td>0x1E 0100</td>
<td>Undefined</td>
<td>XYR16</td>
<td>0x1E 0110</td>
<td>Undefined</td>
</tr>
<tr>
<td>XYR1</td>
<td>0x1E 0101</td>
<td>Undefined</td>
<td>XYR17</td>
<td>0x1E 0111</td>
<td>Undefined</td>
</tr>
<tr>
<td>XYR2</td>
<td>0x1E 0102</td>
<td>Undefined</td>
<td>XYR18</td>
<td>0x1E 0112</td>
<td>Undefined</td>
</tr>
<tr>
<td>XYR3</td>
<td>0x1E 0103</td>
<td>Undefined</td>
<td>XYR19</td>
<td>0x1E 0113</td>
<td>Undefined</td>
</tr>
<tr>
<td>XYR4</td>
<td>0x1E 0104</td>
<td>Undefined</td>
<td>XYR20</td>
<td>0x1E 0114</td>
<td>Undefined</td>
</tr>
<tr>
<td>XYR5</td>
<td>0x1E 0105</td>
<td>Undefined</td>
<td>XYR21</td>
<td>0x1E 0115</td>
<td>Undefined</td>
</tr>
<tr>
<td>XYR6</td>
<td>0x1E 0106</td>
<td>Undefined</td>
<td>XYR22</td>
<td>0x1E 0116</td>
<td>Undefined</td>
</tr>
<tr>
<td>XYR7</td>
<td>0x1E 0107</td>
<td>Undefined</td>
<td>XYR23</td>
<td>0x1E 0117</td>
<td>Undefined</td>
</tr>
<tr>
<td>XYR8</td>
<td>0x1E 0108</td>
<td>Undefined</td>
<td>XYR24</td>
<td>0x1E 0118</td>
<td>Undefined</td>
</tr>
<tr>
<td>XYR9</td>
<td>0x1E 0109</td>
<td>Undefined</td>
<td>XYR25</td>
<td>0x1E 0119</td>
<td>Undefined</td>
</tr>
<tr>
<td>XYR10</td>
<td>0x1E 010A</td>
<td>Undefined</td>
<td>XYR26</td>
<td>0x1E 011A</td>
<td>Undefined</td>
</tr>
<tr>
<td>XYR11</td>
<td>0x1E 010B</td>
<td>Undefined</td>
<td>XYR27</td>
<td>0x1E 011B</td>
<td>Undefined</td>
</tr>
<tr>
<td>XYR12</td>
<td>0x1E 010C</td>
<td>Undefined</td>
<td>XYR28</td>
<td>0x1E 011C</td>
<td>Undefined</td>
</tr>
<tr>
<td>XYR13</td>
<td>0x1E 010D</td>
<td>Undefined</td>
<td>XYR29</td>
<td>0x1E 011D</td>
<td>Undefined</td>
</tr>
<tr>
<td>XYR14</td>
<td>0x1E 010E</td>
<td>Undefined</td>
<td>XYR30</td>
<td>0x1E 011E</td>
<td>Undefined</td>
</tr>
<tr>
<td>XYR15</td>
<td>0x1E 010F</td>
<td>Undefined</td>
<td>XYR31</td>
<td>0x1E 011F</td>
<td>Undefined</td>
</tr>
</tbody>
</table>

Unmapped Compute Block Registers

There are several registers in the compute block units that are not Universal registers (\textit{Ureg} registers), in that they are not accessed in the same way that \textit{Ureg} registers are accessed. These registers are accessed by regular compute block instructions that transfer the data between the unmapped registers and the general-purpose compute block \textit{Ureg} registers.
### Compute Block Status (XSTAT/YSTAT) Registers

The `XSTAT` and `YSTAT` registers are 32-bit compute block status registers that record the state of the compute block status flags. Every flag is updated when an instruction belonging to its computation unit is completed. The `X/YSTAT` registers (reset value = 0x0000 0000) appear in Figure 2-3 and Figure 2-4. For more information on using the `X/YSTAT` registers, see the “Compute Block Registers” chapter in the *ADSP-TS201 TigerSHARC Processor Programming Reference*.

### ALU Registers

The Parallel Results registers (`PR0` and `PR1`) are two 32-bit registers used for sideways sum instructions. For more information regarding the `PRx` registers, see “ALU” in the *ADSP-TS201 TigerSHARC Processor Programming Reference*.

---

**Figure 2-3. XSTAT/YSTAT (Upper) Register Bit Descriptions**

<table>
<thead>
<tr>
<th>Bit</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>31</td>
<td>MIS—Multiplier floating-pt. invalid op., sticky</td>
</tr>
<tr>
<td>30</td>
<td>MOS—Multiplier fixed-pt. overflow, sticky</td>
</tr>
<tr>
<td>29</td>
<td>MVS—Multiplier floating-pt. overflow, sticky</td>
</tr>
<tr>
<td>28</td>
<td>MUS—Multiplier floating-pt. underflow, sticky</td>
</tr>
<tr>
<td>27</td>
<td>AIS—ALU floating-pt. invalid op., sticky</td>
</tr>
<tr>
<td>26</td>
<td>AOS—ALU fixed-pt. overflow, sticky</td>
</tr>
<tr>
<td>25</td>
<td>AVS—ALU floating-pt. overflow, sticky</td>
</tr>
<tr>
<td>24</td>
<td>AUS—ALU floating-pt. underflow, sticky</td>
</tr>
<tr>
<td>23</td>
<td>Reserved</td>
</tr>
<tr>
<td>22</td>
<td>IVEN—Exception enable on invalid floating-pt.</td>
</tr>
<tr>
<td>21</td>
<td>OEN—Exception enable on overflow status</td>
</tr>
<tr>
<td>20</td>
<td>UEN—Exception enable on underflow status</td>
</tr>
<tr>
<td>19</td>
<td>Reserved</td>
</tr>
<tr>
<td>18</td>
<td></td>
</tr>
</tbody>
</table>
Memory and Registers

Multiplier Registers

The Multiplier Results registers (MR3–0 and MR4) are used as accumulators for the different types of fixed-point Multiply-Accumulate instructions. For more information regarding the MRx registers, see “Multiplier” in the ADSP-TS201 TigerSHARC Processor Programming Reference. These registers are accessed by different multiplier instructions.

Shifter Registers

The Bit FIFO Overflow register (BFOTMP) is a 64-bit register used in the PUTBITS instruction. For more information regarding the BFOTMP register, see the PUTBITS instruction in the ADSP-TS201 TigerSHARC Processor Programming Reference.
Universal (Ureg) Register Space

Communications Logic Unit (CLU) Registers

The CLU instructions use Trellis (TR31–0) data registers, Trellis history (THR3–0) registers, and/or the Communication Control (CMCTL) register. Each register is 32 bits wide. For more information regarding these registers, see the “CLU” chapter in the ADSP-TS201 TigerSHARC Processor Programming Reference.

IALU Register Groups

Each IALU has two Ureg groups. The first group is the general-purpose register file, which includes 32 normal-word registers each. The second group is the circular buffer register file which is used to specify parameters for circular buffering. The IALU register groups are listed in Table 2-9, Table 2-10, Table 2-11, and Table 2-12.

The circular buffer register files contain these registers:

- 3–0 circular buffer base registers JB3–0 and KB3–0
- 7–4 circular buffer length registers JL3–0 and KL3–0
- 31–8 Reserved
### Memory and Registers

#### Table 2-9. J-IALU Register Group

<table>
<thead>
<tr>
<th>Name</th>
<th>Address</th>
<th>Default</th>
<th>Name</th>
<th>Address</th>
<th>Default</th>
</tr>
</thead>
<tbody>
<tr>
<td>J0</td>
<td>0x1E 0180</td>
<td>Undefined</td>
<td>J16</td>
<td>0x1E 0190</td>
<td>Undefined</td>
</tr>
<tr>
<td>J1</td>
<td>0x1E 0181</td>
<td>Undefined</td>
<td>J17</td>
<td>0x1E 0191</td>
<td>Undefined</td>
</tr>
<tr>
<td>J2</td>
<td>0x1E 0182</td>
<td>Undefined</td>
<td>J18</td>
<td>0x1E 0192</td>
<td>Undefined</td>
</tr>
<tr>
<td>J3</td>
<td>0x1E 0183</td>
<td>Undefined</td>
<td>J19</td>
<td>0x1E 0193</td>
<td>Undefined</td>
</tr>
<tr>
<td>J4</td>
<td>0x1E 0184</td>
<td>Undefined</td>
<td>J20</td>
<td>0x1E 0194</td>
<td>Undefined</td>
</tr>
<tr>
<td>J5</td>
<td>0x1E 0185</td>
<td>Undefined</td>
<td>J21</td>
<td>0x1E 0195</td>
<td>Undefined</td>
</tr>
<tr>
<td>J6</td>
<td>0x1E 0186</td>
<td>Undefined</td>
<td>J22</td>
<td>0x1E 0196</td>
<td>Undefined</td>
</tr>
<tr>
<td>J7</td>
<td>0x1E 0187</td>
<td>Undefined</td>
<td>J23</td>
<td>0x1E 0197</td>
<td>Undefined</td>
</tr>
<tr>
<td>J8</td>
<td>0x1E 0188</td>
<td>Undefined</td>
<td>J24</td>
<td>0x1E 0198</td>
<td>Undefined</td>
</tr>
<tr>
<td>J9</td>
<td>0x1E 0189</td>
<td>Undefined</td>
<td>J25</td>
<td>0x1E 0199</td>
<td>Undefined</td>
</tr>
<tr>
<td>J10</td>
<td>0x1E 018A</td>
<td>Undefined</td>
<td>J26</td>
<td>0x1E 019A</td>
<td>Undefined</td>
</tr>
<tr>
<td>J11</td>
<td>0x1E 018B</td>
<td>Undefined</td>
<td>J27</td>
<td>0x1E 019B</td>
<td>Undefined</td>
</tr>
<tr>
<td>J12</td>
<td>0x1E 018C</td>
<td>Undefined</td>
<td>J28</td>
<td>0x1E 019C</td>
<td>Undefined</td>
</tr>
<tr>
<td>J13</td>
<td>0x1E 018D</td>
<td>Undefined</td>
<td>J29</td>
<td>0x1E 019D</td>
<td>Undefined</td>
</tr>
<tr>
<td>J14</td>
<td>0x1E 018E</td>
<td>Undefined</td>
<td>J30</td>
<td>0x1E 019E</td>
<td>Undefined</td>
</tr>
<tr>
<td>J15</td>
<td>0x1E 018F</td>
<td>Undefined</td>
<td>J31</td>
<td>0x1E 019F</td>
<td>Undefined</td>
</tr>
</tbody>
</table>
## Universal (Ureg) Register Space

### Table 2-10. K-IALU Register Group

<table>
<thead>
<tr>
<th>Name</th>
<th>Address</th>
<th>Default</th>
<th>Name</th>
<th>Address</th>
<th>Default</th>
</tr>
</thead>
<tbody>
<tr>
<td>K0</td>
<td>0x1E 01A0</td>
<td>Undefined</td>
<td>K16</td>
<td>0x1E 01B0</td>
<td>Undefined</td>
</tr>
<tr>
<td>K1</td>
<td>0x1E 01A1</td>
<td>Undefined</td>
<td>K17</td>
<td>0x1E 01B1</td>
<td>Undefined</td>
</tr>
<tr>
<td>K2</td>
<td>0x1E 01A2</td>
<td>Undefined</td>
<td>K18</td>
<td>0x1E 01B2</td>
<td>Undefined</td>
</tr>
<tr>
<td>K3</td>
<td>0x1E 01A3</td>
<td>Undefined</td>
<td>K19</td>
<td>0x1E 01B3</td>
<td>Undefined</td>
</tr>
<tr>
<td>K4</td>
<td>0x1E 01A4</td>
<td>Undefined</td>
<td>K20</td>
<td>0x1E 01B4</td>
<td>Undefined</td>
</tr>
<tr>
<td>K5</td>
<td>0x1E 01A5</td>
<td>Undefined</td>
<td>K21</td>
<td>0x1E 01B5</td>
<td>Undefined</td>
</tr>
<tr>
<td>K6</td>
<td>0x1E 01A6</td>
<td>Undefined</td>
<td>K22</td>
<td>0x1E 01B6</td>
<td>Undefined</td>
</tr>
<tr>
<td>K7</td>
<td>0x1E 01A7</td>
<td>Undefined</td>
<td>K23</td>
<td>0x1E 01B7</td>
<td>Undefined</td>
</tr>
<tr>
<td>K8</td>
<td>0x1E 01A8</td>
<td>Undefined</td>
<td>K24</td>
<td>0x1E 01B8</td>
<td>Undefined</td>
</tr>
<tr>
<td>K9</td>
<td>0x1E 01A9</td>
<td>Undefined</td>
<td>K25</td>
<td>0x1E 01B9</td>
<td>Undefined</td>
</tr>
<tr>
<td>K10</td>
<td>0x1E 01AA</td>
<td>Undefined</td>
<td>K26</td>
<td>0x1E 01BA</td>
<td>Undefined</td>
</tr>
<tr>
<td>K11</td>
<td>0x1E 01AB</td>
<td>Undefined</td>
<td>K27</td>
<td>0x1E 01BB</td>
<td>Undefined</td>
</tr>
<tr>
<td>K12</td>
<td>0x1E 01AC</td>
<td>Undefined</td>
<td>K28</td>
<td>0x1E 01BC</td>
<td>Undefined</td>
</tr>
<tr>
<td>K13</td>
<td>0x1E 01AD</td>
<td>Undefined</td>
<td>K29</td>
<td>0x1E 01BD</td>
<td>Undefined</td>
</tr>
<tr>
<td>K14</td>
<td>0x1E 01AE</td>
<td>Undefined</td>
<td>K30</td>
<td>0x1E 01BE</td>
<td>Undefined</td>
</tr>
<tr>
<td>K15</td>
<td>0x1E 01AF</td>
<td>Undefined</td>
<td>K31</td>
<td>0x1E 01BF</td>
<td>Undefined</td>
</tr>
</tbody>
</table>
Memory and Registers

Table 2-11. J-IALU Circular Buffer Register Group

<table>
<thead>
<tr>
<th>Name</th>
<th>Address</th>
<th>Default</th>
</tr>
</thead>
<tbody>
<tr>
<td>JB0</td>
<td>0x1E 01C0</td>
<td>Undefined</td>
</tr>
<tr>
<td>JB1</td>
<td>0x1E 01C1</td>
<td>Undefined</td>
</tr>
<tr>
<td>JB2</td>
<td>0x1E 01C2</td>
<td>Undefined</td>
</tr>
<tr>
<td>JB3</td>
<td>0x1E 01C3</td>
<td>Undefined</td>
</tr>
<tr>
<td>JL0</td>
<td>0x1E 01C4</td>
<td>Undefined</td>
</tr>
<tr>
<td>JL1</td>
<td>0x1E 01C5</td>
<td>Undefined</td>
</tr>
<tr>
<td>JL2</td>
<td>0x1E 01C6</td>
<td>Undefined</td>
</tr>
<tr>
<td>JL3</td>
<td>0x1E 01C7</td>
<td>Undefined</td>
</tr>
<tr>
<td>Reserved</td>
<td>0x1E 01C8–0x1E 01DF</td>
<td>N/A</td>
</tr>
</tbody>
</table>

Table 2-12. K-IALU Circular Buffer Register Group

<table>
<thead>
<tr>
<th>Name</th>
<th>Address</th>
<th>Default</th>
</tr>
</thead>
<tbody>
<tr>
<td>KB0</td>
<td>0x1E 01E0</td>
<td>Undefined</td>
</tr>
<tr>
<td>KB1</td>
<td>0x1E 01E1</td>
<td>Undefined</td>
</tr>
<tr>
<td>KB2</td>
<td>0x1E 01E2</td>
<td>Undefined</td>
</tr>
<tr>
<td>KB3</td>
<td>0x1E 01E3</td>
<td>Undefined</td>
</tr>
<tr>
<td>KL0</td>
<td>0x1E 01E4</td>
<td>Undefined</td>
</tr>
<tr>
<td>KL1</td>
<td>0x1E 01E5</td>
<td>Undefined</td>
</tr>
<tr>
<td>KL2</td>
<td>0x1E 01E6</td>
<td>Undefined</td>
</tr>
<tr>
<td>KL3</td>
<td>0x1E 01E7</td>
<td>Undefined</td>
</tr>
<tr>
<td>Reserved</td>
<td>0x1E 01E8–0x1E 01FF</td>
<td>N/A</td>
</tr>
</tbody>
</table>
IALU Status (J31/JSTAT and K31/KSTAT) Registers

The JSTAT (for the J-IALU) and KSTAT (for the K-IALU) status registers records the state of the carry flags and is updated as a result of various IALU instructions. When being written to, the JSTAT register can be referred to as J31. When used as an operand in an IALU arithmetic, logical, or function operation—J31 is treated as zero. The J31/JSTAT (K31/KSTAT) registers (reset value = 0x0000 0000) appear in Figure 2-5. For information on using the J31/JSTAT (K31/KSTAT) registers, see the “IALU” chapter of the ADSP-TS201 TigerSHARC Processor Programming Reference.

![Figure 2-5. JSTAT/KSTAT (Lower Half) Register Bit Descriptions](image)

1 Bit 31–16 (not shown) are reserved (=0).
Memory and Registers

Sequencer Register Groups

This 32-bit register group is dedicated to sequencer registers, interrupt control, and status registers. See Table 2-13. All sequencer registers are accessible by single-word access only. For more information regarding the sequencer registers, see “Program Sequencer” in the *ADSP-TS201 TigerSHARC Processor Programming Reference*.

Table 2-13. Sequencer Register Group

<table>
<thead>
<tr>
<th>Name</th>
<th>Description</th>
<th>Address</th>
<th>Default</th>
</tr>
</thead>
<tbody>
<tr>
<td>CJMP</td>
<td>Computed jump address(^1)</td>
<td>0xE 0340</td>
<td>Undefined</td>
</tr>
<tr>
<td>Reserved</td>
<td>N/A</td>
<td>0xE 0341</td>
<td>N/A</td>
</tr>
<tr>
<td>RETI</td>
<td>Return from interrupt address</td>
<td>0xE 0342</td>
<td>Undefined</td>
</tr>
<tr>
<td>RETIB</td>
<td>RETI alias, for interrupt nesting(^2)</td>
<td>0xE 0343</td>
<td>Undefined</td>
</tr>
<tr>
<td>RETS</td>
<td>Return from subroutine address(^3)</td>
<td>0xE 0344</td>
<td>Undefined</td>
</tr>
<tr>
<td>DBGE</td>
<td>Return from emulation address(^3)</td>
<td>0xE 0345</td>
<td>Undefined</td>
</tr>
<tr>
<td>Reserved</td>
<td>N/A</td>
<td>0xE 0346–0xE 0347</td>
<td>N/A</td>
</tr>
<tr>
<td>LC0</td>
<td>Loop counter #0</td>
<td>0xE 0348</td>
<td>Undefined</td>
</tr>
<tr>
<td>LC1</td>
<td>Loop counter #1</td>
<td>0xE 0349</td>
<td>Undefined</td>
</tr>
<tr>
<td>Reserved</td>
<td>N/A</td>
<td>0xE 034A–0xE 034F</td>
<td>N/A</td>
</tr>
<tr>
<td>IVSW</td>
<td>Software exception</td>
<td>0xE 0350</td>
<td>Undefined</td>
</tr>
<tr>
<td>Reserved</td>
<td>N/A</td>
<td>0xE 0351–0xE 0353</td>
<td>N/A</td>
</tr>
<tr>
<td>FLAGREG</td>
<td>Flag control register</td>
<td>0xE 0354</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>FLAGREGST</td>
<td>Flag register set</td>
<td>0xE 0355</td>
<td>Undefined</td>
</tr>
<tr>
<td>FLAGREGCL</td>
<td>Flag register clear</td>
<td>0xE 0356</td>
<td>Undefined</td>
</tr>
<tr>
<td>Reserved</td>
<td>N/A</td>
<td>0xE 0357</td>
<td>N/A</td>
</tr>
<tr>
<td>SQCTL</td>
<td>Sequencer control register</td>
<td>0xE 0358</td>
<td>0x0000 0004</td>
</tr>
<tr>
<td>SQCTLST</td>
<td>Sequencer control register set bits</td>
<td>0xE 0359</td>
<td>Undefined</td>
</tr>
</tbody>
</table>
Universal (Ureg) Register Space

Flag Control (FLAGREG) Register

The FLAGREG register is a 32-bit register that controls flag pin direction (input or output) and provides output value controls (1 or 0) for output flag pins. The FLAGREG register (reset value = 0x0000 0000) appears in Figure 2-6. For information on using the FLAGREG register, see the “Program Sequencer” chapter of the ADSP-TS201 TigerSHARC Processor Programming Reference.

Figure 2-6. FLAGREG (Lower) Register Bit Descriptions

Table 2-13. Sequencer Register Group (Cont’d)

<table>
<thead>
<tr>
<th>Name</th>
<th>Description</th>
<th>Address</th>
<th>Default</th>
</tr>
</thead>
<tbody>
<tr>
<td>SQCTLCL</td>
<td>Sequencer control register clear bits</td>
<td>0x1E 035A</td>
<td>Undefined</td>
</tr>
<tr>
<td>SQSTAT</td>
<td>Sequencer status register; read only</td>
<td>0x1E 035B</td>
<td>0x0000 FF04</td>
</tr>
<tr>
<td>SFREG</td>
<td>Static condition flag register</td>
<td>0x1E 035C</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>Reserved</td>
<td>N/A</td>
<td>0x1E 035D–0x1E 035F</td>
<td>N/A</td>
</tr>
</tbody>
</table>

1 See CJUMP Instruction in the ADSP-TS201 TigerSHARC Processor Programming Reference.
2 See “Interrupts” on page 6-1
3 See “Handling Exceptions” on page 6-15.

15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0
0 0 0 0 0 0 0 0 0 0 0 0 0 0 0

FLAGx_EN – Enables a flag pin as either 1=output or 0=input; bit 0-FLAG0, bit 1-FLAG1, bit 2-FLAG2, bit 3-FLAG3

FLAGx_OUT – Selects output value of flag pins if pin is enabled as an output by its FLAGx_EN bit; bit 4-FLAG0, bit 5-FLAG1, bit 6-FLAG2, bit 7-FLAG3

Reserved

Figure 2-6. FLAGREG (Lower) Register Bit Descriptions

1 Bits 31–16 (not shown) are reserved (=0).
Sequencer Control (SQCTL) Register

The sequencer is controlled and configured by writing to the SQCTL register. The SQCTL register (reset value = 0x0000 0004) appears in Figure 2-7. For information on using the SQCTL register, see the “Program Sequencer” chapter of the ADSP-TS201 TigerSHARC Processor Programming Reference.

Figure 2-7. SQCTL (Lower) Register Bit Descriptions

1. Bits 31–16 (not shown) are reserved (=0).
2. There is a five cycle stall after any write to the SQCTL register.
Static Flags (SFREG) Register

Static flags are used as static copies of conditions. When the programmer wishes to keep a condition value after another instruction changes its value, it can be copied into the SFREG and used later as a condition. All static condition flags are grouped into the SFREG register. The SFREG register (reset value = 0x0000 0000) appears in Figure 2-8. For information on using the SFREG register, see the “Program Sequencer” chapter of the ADSP-TS201 TigerSHARC Processor Programming Reference.

Figure 2-8. SFREG (Lower) Register Bit Descriptions

1 Bits 31–16 (not shown) are reserved (=0).
Sequencer Control Set Bits (SQCTLST) Register

The SQCTLST bit is an alias used to write to SQCTL. When writing to this address, the data written into SQCTL is the OR of the old register value and the new data written into SQCTLST. A ‘1’ in any bit of the written data sets the corresponding bit in SQCTL, while a ‘0’ in written data does not change the bit value.

Sequencer Control Clear Bits (SQCTLCL) Register

The SQCTLCL bit is an alias used to write to SQCTL. When writing to this address, the data written into SQCTL is the AND of the old register value and the new data written into SQCTLCL. This way a ‘0’ in any bit of the written data clears the corresponding bit in SQCTL, while a ‘1’ in written data does not change the bit value.

Sequencer Status (SQSTAT) Register

This is a read-only register that holds information about the current status of the sequencer. The SQSTAT register (reset value = 0x0000 FF04) appears in Figure 2-9 and Figure 2-10. For information on using the SQSTAT register, see the “Program Sequencer” chapter of the ADSP-TS201 TigerSHARC Processor Programming Reference.
Universal (Ureg) Register Space

- **31 30 29 28 27 26 25 24 23 22 21 20 19 18 17 16 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0**

- **Reserved**
- **BTBLK** – BTB lock; 1=lock (every new entry written to BTB is locked), 0=unlocked
- **BTBEN** – BTB enable; 1=enable, 0=disable
- **Reserved**
- **BTBMOD** – Indicates BTB mode;
  - 00=BTB disabled
  - 01=BTB enabled
  - 10=reserved
  - 11=BTB enabled, locking on
- **Reserved**
- **EMUL** – Emulation interrupt, status
- **SW** – Exception (software interrupt), status
- **GIE** – Global Interrupt Enable, nest
- **FLGx** – Indicates input value of flag pins if pin is enabled as an input by its FLAGx_EN bit; bit 16-FLAG0, bit 17-FLAG1, bit 18-FLAG2, bit 19-FLAG3

Figure 2-9. SQSTAT (Upper) Register Bit Descriptions
Memory and Registers

Figure 2-10. SQSTAT (Lower) Register Bit Descriptions
Universal (Ureg) Register Space

Cache Register Groups

These 32-bit register groups are dedicated to cache command and status registers. See Table 2-14 and Table 2-15. All cache control and status registers are accessible by single-word access only. For more information on using the cache registers, see “Memory and Buses” in the ADSP-TS201 TigerSHARC Processor Programming Reference.

Table 2-14. Cache Register Group A

<table>
<thead>
<tr>
<th>Name</th>
<th>Description</th>
<th>Address</th>
<th>Default</th>
</tr>
</thead>
<tbody>
<tr>
<td>CACMD0</td>
<td>Cache command block 0</td>
<td>0x1E 03C0</td>
<td>Undefined</td>
</tr>
<tr>
<td>CCAIR0</td>
<td>Cache address/index block 0</td>
<td>0x1E 03C1</td>
<td>Undefined</td>
</tr>
<tr>
<td>CASTAT0</td>
<td>Cache status block 0</td>
<td>0x1E 03C2</td>
<td>0x0000 01C2</td>
</tr>
<tr>
<td>Reserved</td>
<td>N/A</td>
<td>0x1E 03C3–0x1E 03C7</td>
<td>N/A</td>
</tr>
<tr>
<td>CACMD2</td>
<td>Cache command block 2</td>
<td>0x1E 03C8</td>
<td>Undefined</td>
</tr>
<tr>
<td>CCAIR2</td>
<td>Cache address/index block 2</td>
<td>0x1E 03C9</td>
<td>Undefined</td>
</tr>
<tr>
<td>CASTAT2</td>
<td>Cache status block 2</td>
<td>0x1E 03CA</td>
<td>0x0000 01C2</td>
</tr>
<tr>
<td>Reserved</td>
<td>N/A</td>
<td>0x1E 03CB–0x1E 03CF</td>
<td>N/A</td>
</tr>
<tr>
<td>CACMD4</td>
<td>Cache command block 4</td>
<td>0x1E 03D0</td>
<td>Undefined</td>
</tr>
<tr>
<td>CCAIR4</td>
<td>Cache address/index block 4</td>
<td>0x1E 03D1</td>
<td>Undefined</td>
</tr>
<tr>
<td>CASTAT4</td>
<td>Cache status block 4</td>
<td>0x1E 03D2</td>
<td>0x0000 01C2</td>
</tr>
<tr>
<td>Reserved</td>
<td>N/A</td>
<td>0x1E 03D3–0x1E 03D7</td>
<td>N/A</td>
</tr>
<tr>
<td>CACMD6</td>
<td>Cache command block 6</td>
<td>0x1E 03D8</td>
<td>Undefined</td>
</tr>
<tr>
<td>CCAIR6</td>
<td>Cache address/index block 6</td>
<td>0x1E 03D9</td>
<td>Undefined</td>
</tr>
<tr>
<td>CASTAT6</td>
<td>Cache status block 6</td>
<td>0x1E 03DA</td>
<td>0x0000 01C2</td>
</tr>
<tr>
<td>Reserved</td>
<td>N/A</td>
<td>0x1E 03DB–0x1E 03DF</td>
<td>N/A</td>
</tr>
</tbody>
</table>
### Table 2-15. Cache Register Group B

<table>
<thead>
<tr>
<th>Name</th>
<th>Description</th>
<th>Address</th>
<th>Default</th>
</tr>
</thead>
<tbody>
<tr>
<td>CACMD8</td>
<td>Cache command block 8</td>
<td>0x1E 03E0</td>
<td>Undefined</td>
</tr>
<tr>
<td>CCAIR8</td>
<td>Cache address/index block 8</td>
<td>0x1E 03E1</td>
<td>Undefined</td>
</tr>
<tr>
<td>CASTAT8</td>
<td>Cache status block 8</td>
<td>0x1E 03E2</td>
<td>0x0000 01C2</td>
</tr>
<tr>
<td>Reserved</td>
<td>N/A</td>
<td>0x1E 03E3–0x1E 03E7</td>
<td>N/A</td>
</tr>
<tr>
<td>CACMD10</td>
<td>Cache command block 10</td>
<td>0x1E 03E8</td>
<td>Undefined</td>
</tr>
<tr>
<td>CCAIR10</td>
<td>Cache address/index block 10</td>
<td>0x1E 03E9</td>
<td>Undefined</td>
</tr>
<tr>
<td>CASTAT10</td>
<td>Cache status block 10</td>
<td>0x1E 03EA</td>
<td>0x0000 01C2</td>
</tr>
<tr>
<td>Reserved</td>
<td>N/A</td>
<td>0x1E 03EB–0x1E 03FB</td>
<td>N/A</td>
</tr>
<tr>
<td>CACMDB</td>
<td>Cache command broadcast</td>
<td>0x1E 03FC</td>
<td>Undefined</td>
</tr>
<tr>
<td>CCAIRB</td>
<td>Cache address/index broadcast</td>
<td>0x1E 03FD</td>
<td>Undefined</td>
</tr>
<tr>
<td>Reserved</td>
<td>N/A</td>
<td>0x1E 03FE–0x1E 03FF</td>
<td>N/A</td>
</tr>
</tbody>
</table>
Cache/Memory System Command/Status (CACMDx, CCAIRx, and CASTATx) Registers

The embedded DRAM refresh rate and other embedded DRAM interface controls are applied to the interface using memory system control and status registers for each internal memory block. These registers include:

- Cache Status (CASTATx) registers
- Memory System Command (CACMDx) registers
- Cache Address/Index (CCAIRx) registers

The value for \( x \) can be 0, 2, 4, 6, 8, 10 or B (for broadcast to all blocks). For example, CACMD0 is the memory system command register for memory block 0, CACMD2 is the memory system command register for memory block 2, and so on. This numbering applies to the CCAIRx and CASTATx registers as well.

Note that reading a memory block on the cycle following a write to that blocks cache control registers causes a four cycle stall.

The CASTATx register is a read only register that indicates the cache status. This register can be read using the \( \text{Ureg} = \text{Ureg} \) register transfer instruction. For bit definitions of these registers, see Figure 2-11, Figure 2-12, and DEFTS201.H bit definitions file in the ADSP-TS201 TigerSHARC Processor Programming Reference.

The bits in the CACMDx register are set or cleared using memory system commands, register transfer instructions (CACMDx = IALU_DATA_Ureg), or register immediate load instructions (CACMDx = Imm). For bit definitions of the commands for the CACMDx registers, see Table 2-16.

The bits in the CCAIRx register only may be set or cleared using register transfer instructions (CCAIRx = IALU_DATA_Ureg).
Memory and Registers

There are different types of memory system commands. These commands are described in the “Memory and Buses” chapter in the *ADSP-TS201 TigerSHARC Processor Programming Reference*:

- Refresh rate select
- Cache enable
- Cache disable
- Set bus/cacheability (for bus transactions)
- Cache lock start
- Cache lock end
- Cache initialize (from memory)
- Cache copyback (to memory)
- Cache invalidate

Most of these instructions are executed as soon as the Memory System Command register is written (EX2 instruction pipeline stage executes the writes to CACMDx). Some of these Memory System Command register values are executed as a sequence, and the execution is in the background to normal program execution. These commands are cache initialize, cache copyback, and cache invalidate. While the command is executed, other memory accesses (to the cache) are allowed. The accesses of core and system (DMA or external master) are in higher priority than the command execution. An instruction fetch has the same priority as memory system command execution. Therefore, the selection between the two sources – fetch or execute – (if there is a conflict) takes turns.

---

1 Where *IALU_DATA_Ureg* is any IALU data register (J30–0 or K30–0)
Universal (Ureg) Register Space

Reserved

CASTAT_I_CACHING – I-bus (read only)
Caching Status, 0=no caching, 1=caching

CASTAT_S_CACHING – S-bus Caching Status, 00=no caching, 01=cache read,
10=cache write, 11=cache read/write

CASTAT_J_CACHING – J-bus Caching Status, 00=no caching, 01=cache read,
10=cache write, 11=cache read/write

CASTAT_K_CACHING – K-bus Caching Status, 00=no caching, 01=cache read,
10=cache write, 11=cache read/write

CASTAT_STL_ACTIVE – Cache Stall Mode
Active, 1=stall mode active, 0=not stall mode inactive

Reserved

CASTAT_COM_ABRTD – command aborted;
1 – indicates an aborted write attempt to a cache register during memory system command without STALL option

CASTAT_COM_ACTIVE – command active;
1 – indicates an ongoing memory or cache instruction

Figure 2-11. CASTATx (Upper) Register Bit Descriptions
Memory and Registers

Figure 2-12. CASTATx (Lower) Register Bit Descriptions

Table 2-16. CACMDx Register Bit Definitions

<table>
<thead>
<tr>
<th>Command Mnemonic</th>
<th>Opcode Bits</th>
<th>31–26</th>
<th>25–24</th>
<th>23–15</th>
<th>14</th>
<th>13–0</th>
</tr>
</thead>
<tbody>
<tr>
<td>CACMD_REFRESH</td>
<td>010100</td>
<td>00</td>
<td>000000000</td>
<td>0</td>
<td></td>
<td></td>
</tr>
<tr>
<td>CACMD_EN</td>
<td>000000</td>
<td>00</td>
<td>000000000</td>
<td>0</td>
<td>00 0000 0000 0000</td>
<td></td>
</tr>
<tr>
<td>CACMD_DIS</td>
<td>000001</td>
<td>00</td>
<td>000000000</td>
<td>0</td>
<td>00 0000 0000 0000</td>
<td></td>
</tr>
<tr>
<td>CACMD_SLOCK</td>
<td>000010</td>
<td>00</td>
<td>000000000</td>
<td>0</td>
<td>00 0000 0000 0000</td>
<td></td>
</tr>
<tr>
<td>CACMD_ELOCK</td>
<td>000011</td>
<td>00</td>
<td>000000000</td>
<td>0</td>
<td>00 0000 0000 0000</td>
<td></td>
</tr>
<tr>
<td>CACHE_INIT</td>
<td>010000</td>
<td>00</td>
<td>Length</td>
<td>Stall_Mode</td>
<td>00 0000 0000 0000</td>
<td></td>
</tr>
<tr>
<td>CACHE_INIT_LOCK</td>
<td>010001</td>
<td>00</td>
<td>Length</td>
<td>Stall_Mode</td>
<td>00 0000 0000 0000</td>
<td></td>
</tr>
<tr>
<td>CACMD_CB</td>
<td>000100</td>
<td>00</td>
<td>Length</td>
<td>Stall_Mode</td>
<td>00 0000 0000 0000</td>
<td></td>
</tr>
<tr>
<td>CACMD_INV</td>
<td>000101</td>
<td>00</td>
<td>Length</td>
<td>Stall_Mode</td>
<td>00 0000 0000 0000</td>
<td></td>
</tr>
<tr>
<td>CACMD_SET_BUS</td>
<td>000111</td>
<td>00</td>
<td>Caching(^1)</td>
<td>0</td>
<td>00 0000 0000 0000</td>
<td></td>
</tr>
</tbody>
</table>

\(^1\) Where caching sets bus caching ability for each bus; I-bus (bit 21) 0=no caching, 1=read caching; S-bus (bits 20–19), J-bus (bits 18–17), and K-bus (bits 16–) 00=no caching, 01 read caching, 10=write caching, and 11=read/write caching
For the cache initialize, cache copyback, and cache invalidate commands, the CCAIRx register must be preloaded to hold either a start address or a start index depending on the command being executed in the corresponding CACMDx register.

In these cases, the bits in CCAIRx hold the following:

- When used as a start address, CCAIRx holds a 14-bit address, using bits 16–3 (bits 2–0 are ignored)
- When used as a start index, CCAIRx holds a 7-bit index, using bits 16–10 (bits 9–0 are ignored)

While a long execution command is in progress, the CACMDx and CCAIRx registers may not be changed. For more information, see the Stall_Mode option for cache initialize (from memory), cache copyback (to memory), and cache invalidate in the “Memory and Buses” chapter in the ADSP-TS201 TigerSHARC Processor Programming Reference.
Interrupt Register Groups

These 32-bit register groups are dedicated to interrupt vector, interrupt control, and status registers. See Table 2-17, Table 2-18, and Table 2-19. All interrupt control registers are accessible by single-word access only. For more information regarding the interrupt registers, see “Program Sequencer” in the ADSP-TS201 TigerSHARC Processor Programming Reference.

Interrupt Vector Table Register Groups

These 32-bit register groups are dedicated to interrupt vectors and control—see Table 2-17, Table 2-18, Table 2-19, and “Interrupt Vector Table” on page 6-17.

Table 2-17. Interrupt Vector Table Register Group A

<table>
<thead>
<tr>
<th>Name</th>
<th>Description</th>
<th>Address</th>
<th>Default</th>
</tr>
</thead>
<tbody>
<tr>
<td>IVKERNEL</td>
<td>VDK kernel interrupt</td>
<td>0x1F 0300</td>
<td>Undefined</td>
</tr>
<tr>
<td>Reserved</td>
<td>N/A</td>
<td>0x1F 0301</td>
<td>N/A</td>
</tr>
<tr>
<td>IVTIMER0LP</td>
<td>Timer #0 low priority</td>
<td>0x1F 0302</td>
<td>Undefined</td>
</tr>
<tr>
<td>IVTIMER1LP</td>
<td>Timer #1 low priority</td>
<td>0x1F 0303</td>
<td>Undefined</td>
</tr>
<tr>
<td>Reserved</td>
<td>N/A</td>
<td>0x1F 0304–0x1F 0305</td>
<td>N/A</td>
</tr>
<tr>
<td>IVLINK0</td>
<td>Link #0 Reg</td>
<td>0x1F 0306</td>
<td>Undefined</td>
</tr>
<tr>
<td>IVLINK1</td>
<td>Link #1 Reg</td>
<td>0x1F 0307</td>
<td>Undefined</td>
</tr>
<tr>
<td>IVLINK2</td>
<td>Link #2 Reg</td>
<td>0x1F 0308</td>
<td>Undefined</td>
</tr>
<tr>
<td>IVLINK3</td>
<td>Link #3 Reg</td>
<td>0x1F 0309</td>
<td>Undefined</td>
</tr>
<tr>
<td>Reserved</td>
<td>N/A</td>
<td>0x1F 030A–0x1F 030D</td>
<td>N/A</td>
</tr>
<tr>
<td>IVDMA0</td>
<td>DMA #0 Reg</td>
<td>0x1F 030E</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>IVDMA1</td>
<td>DMA #1 Reg</td>
<td>0x1F 030F</td>
<td>0x0000 0000</td>
</tr>
</tbody>
</table>
### Universal (Ureg) Register Space

#### Table 2-17. Interrupt Vector Table Register Group A (Cont'd)

<table>
<thead>
<tr>
<th>Name</th>
<th>Description</th>
<th>Address</th>
<th>Default</th>
</tr>
</thead>
<tbody>
<tr>
<td>IVDMA2</td>
<td>DMA #2 Reg</td>
<td>0x1F 0310</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>IVDMA3</td>
<td>DMA #3 Reg</td>
<td>0x1F 0311</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>Reserved</td>
<td>N/A</td>
<td>0x1F 0312–0x1F 0315</td>
<td>N/A</td>
</tr>
<tr>
<td>IVDMA4</td>
<td>DMA #4 Reg</td>
<td>0x1F 0316</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>IVDMA5</td>
<td>DMA #5 Reg</td>
<td>0x1F 0317</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>IVDMA6</td>
<td>DMA #6 Reg</td>
<td>0x1F 0318</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>IVDMA7</td>
<td>DMA #7 Reg</td>
<td>0x1F 0319</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>Reserved</td>
<td>N/A</td>
<td>0x1F 031A–0x1F 031C</td>
<td>N/A</td>
</tr>
<tr>
<td>IVDMA8</td>
<td>DMA #8 Reg</td>
<td>0x1F 031D</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>IVDMA9</td>
<td>DMA #9 Reg</td>
<td>0x1F 031E</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>IVDMA10</td>
<td>DMA #10 Reg</td>
<td>0x1F 031F</td>
<td>0x0000 0000</td>
</tr>
</tbody>
</table>

#### Table 2-18. Interrupt Vector Table Register Group B

<table>
<thead>
<tr>
<th>Name</th>
<th>Description</th>
<th>Address</th>
<th>Default</th>
</tr>
</thead>
<tbody>
<tr>
<td>IVDMA11</td>
<td>DMA #11 Reg</td>
<td>0x1F 0320</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>Reserved</td>
<td>N/A</td>
<td>0x1F 0321–0x1F 0324</td>
<td>N/A</td>
</tr>
<tr>
<td>IVDMA12</td>
<td>DMA #12 Reg</td>
<td>0x1F 0325</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>IVDMA13</td>
<td>DMA #13 Reg</td>
<td>0x1F 0326</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>Reserved</td>
<td>N/A</td>
<td>0x1F 0327–0x1F 0328</td>
<td>N/A</td>
</tr>
<tr>
<td>IVIRQ0</td>
<td>IRQ0 Reg pin</td>
<td>0x1F 0329</td>
<td>0x3000 0000</td>
</tr>
<tr>
<td>IVIRQ1</td>
<td>IRQ1 Reg pin</td>
<td>0x1F 032A</td>
<td>0x3800 0000</td>
</tr>
<tr>
<td>IVIRQ2</td>
<td>IRQ2 Reg pin</td>
<td>0x1F 032B</td>
<td>0x8000 0000</td>
</tr>
<tr>
<td>IVIRQ3</td>
<td>IRQ3 Reg pin</td>
<td>0x1F 032C</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>Reserved</td>
<td>N/A</td>
<td>0x1F 032D–0x1F 032F</td>
<td>N/A</td>
</tr>
</tbody>
</table>
Memory and Registers

Table 2-18. Interrupt Vector Table Register Group B (Cont’d)

<table>
<thead>
<tr>
<th>Name</th>
<th>Description</th>
<th>Address</th>
<th>Default</th>
</tr>
</thead>
<tbody>
<tr>
<td>VIRPT</td>
<td>VIRPT (vector Reg)</td>
<td>0x1F 0330</td>
<td>Undefined</td>
</tr>
<tr>
<td>Reserved</td>
<td>N/A</td>
<td>0x1F 0331</td>
<td>N/A</td>
</tr>
<tr>
<td>IVBUSLK</td>
<td>Bus lock vector</td>
<td>0x1F 0332</td>
<td>Undefined</td>
</tr>
<tr>
<td>Reserved</td>
<td>N/A</td>
<td>0x1F 0333</td>
<td>N/A</td>
</tr>
<tr>
<td>IVTIMER0HP</td>
<td>Timer 0 high priority</td>
<td>0x1F 0334</td>
<td>Undefined</td>
</tr>
<tr>
<td>IVTIMER1HP</td>
<td>Timer 1 high priority</td>
<td>0x1F 0335</td>
<td>Undefined</td>
</tr>
<tr>
<td>Reserved</td>
<td>N/A</td>
<td>0x1F 0336–0x1F 0338</td>
<td>N/A</td>
</tr>
<tr>
<td>IVHW</td>
<td>Hardware error</td>
<td>0x1F 0339</td>
<td>Undefined</td>
</tr>
<tr>
<td>Reserved</td>
<td>N/A</td>
<td>0x1F 033A–0x1F 033F</td>
<td>N/A</td>
</tr>
</tbody>
</table>

Table 2-19. Interrupt Control Register Group

<table>
<thead>
<tr>
<th>Name</th>
<th>Description</th>
<th>Address</th>
<th>Default</th>
</tr>
</thead>
<tbody>
<tr>
<td>ILATL</td>
<td>ILAT, low half</td>
<td>0x1F 0340</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>ILATH</td>
<td>ILAT high half</td>
<td>0x1F 0341</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>ILATSTL</td>
<td>ILAT low half, set</td>
<td>0x1F 0342</td>
<td>Undefined</td>
</tr>
<tr>
<td>ILATSTH</td>
<td>ILAT high half, set</td>
<td>0x1F 0343</td>
<td>Undefined</td>
</tr>
<tr>
<td>ILATCLL</td>
<td>ILAT low half, clear</td>
<td>0x1F 0344</td>
<td>Undefined</td>
</tr>
<tr>
<td>ILATCLH</td>
<td>ILAT high half, clear</td>
<td>0x1F 0345</td>
<td>Undefined</td>
</tr>
<tr>
<td>PMASKL</td>
<td>PMASK low half</td>
<td>0x1F 0346</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>PMASKH</td>
<td>PMASK high half</td>
<td>0x1F 0347</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>IMASKL</td>
<td>IMASK low half</td>
<td>0x1F 0348</td>
<td>0xE3C3 C000</td>
</tr>
<tr>
<td>IMASKH</td>
<td>IMASK high half</td>
<td>0x1F 0349</td>
<td>See Description</td>
</tr>
<tr>
<td>Reserved</td>
<td>N/A</td>
<td>0x1F 034A–0x1F 034D</td>
<td>N/A</td>
</tr>
<tr>
<td>INTCTL</td>
<td>Interrupt control</td>
<td>0x1F 034E</td>
<td>See Definition</td>
</tr>
</tbody>
</table>
### Universal (Ureg) Register Space

Table 2-19. Interrupt Control Register Group (Cont’d)

<table>
<thead>
<tr>
<th>Name</th>
<th>Description</th>
<th>Address</th>
<th>Default</th>
</tr>
</thead>
<tbody>
<tr>
<td>Reserved</td>
<td>N/A</td>
<td>0x1F 034F</td>
<td>N/A</td>
</tr>
<tr>
<td>TIMER0L</td>
<td>Timer 0 run val. low half</td>
<td>0x1F 0350</td>
<td>Undefined</td>
</tr>
<tr>
<td>TIMER0H</td>
<td>Timer 0 run val. high half</td>
<td>0x1F 0351</td>
<td>Undefined</td>
</tr>
<tr>
<td>TIMER1L</td>
<td>Timer 1 run val. low half</td>
<td>0x1F 0352</td>
<td>Undefined</td>
</tr>
<tr>
<td>TIMER1H</td>
<td>Timer 1 run val. high half</td>
<td>0x1F 0353</td>
<td>Undefined</td>
</tr>
<tr>
<td>TMRIN0L</td>
<td>Timer 0 init. val. low half</td>
<td>0x1F 0354</td>
<td>Undefined</td>
</tr>
<tr>
<td>TMRIN0H</td>
<td>Timer 0 init. val. high half</td>
<td>0x1F 0355</td>
<td>Undefined</td>
</tr>
<tr>
<td>TMRIN1L</td>
<td>Timer 1 init. val. low half</td>
<td>0x1F 0356</td>
<td>Undefined</td>
</tr>
<tr>
<td>TMRIN1H</td>
<td>Timer 1 init. val. high half</td>
<td>0x1F 0357</td>
<td>Undefined</td>
</tr>
<tr>
<td>Reserved</td>
<td>N/A</td>
<td>0x1F 0358–0x1F 035F</td>
<td>N/A</td>
</tr>
</tbody>
</table>
Interrupt Control (INTCTL) Register

The INTCTL register is a 32-bit register that controls interrupt pin sensitivity (level- or edge-sensitive interrupts) and provides run/stop control for the timers. The initial value of IMASKH after reset differs depending for normal reset versus reset with IRQEN (interrupt enable) strap pin active. (See Figure 2-13.) For more information, see “INTCTL Register” on page 6-5 and the “Program Sequencer” chapter of the ADSP-TS201 TigerSHARC Processor Programming Reference.

Figure 2-13. INCTL (Lower) Register Bit Descriptions

1 Bits 31–16 (not shown) are reserved (=0).
2 If the IRQEN strap pin =1 at reset, the reset value for INTCTL is 0x0000 000F.
   If the IRQEN strap pin =0 at reset, the reset value for INTCTL is 0x0000 0000.
Universal (Ureg) Register Space

Interrupt Latch (ILATL/ILATH) Registers

The ILAT register is a single 64-bit register accessed as two 32-bit registers ILATH and ILATL.

- ILATL—used to read the ILATL register—read only register; reset value 0x0
- ILATH—used to read the ILATH register—read only register; reset value 0x0

For bit definitions of these registers, see Figure 2-14, Figure 2-15, Figure 2-16, and Figure 2-17. For more information on using these registers with interrupt processing, see “ILAT Register” on page 6-7.

Each bit is dedicated to an interrupt. When an interrupt occurs, the corresponding bit is set. The interrupt bits order is set according to the interrupt priority—bit 0 having lowest priority.

The ILAT register can be written through the set (ILATSTL or ILATSTH) and clear (ILATCLL or ILATCLH) addresses.

- ILATSTL low half, set—used to write to ILATL—write only register
- ILATSTH high half, set—used to write to ILATH—write only register
- ILATCLL low half, clear—used to clear ILATL—write only register
- ILATCLH high half, clear—used to clear ILATH—write only register

Writing to the set addresses updates the ILAT register to the OR of the old and written values. As a result, every set bit in the written data is set the corresponding bit in the ILAT register. Writing to the clear address will AND the data written with the old data of the ILAT register. In this case, a zero bit in the input data clears the corresponding bit in ILAT, while a set bit keeps it unchanged. The consequences and restrictions of these accesses are detailed in “ILAT Register” on page 6-7.
Interrupt Pointer Mask (PMASKL/PMASKH) Registers

The PMASK register is a single 64-bit register that is accessed as two 32-bit registers, PMASKL and PMASKH.

- PMASKL low half—read only; reset value 0x0
- PMASKH high half—read only; reset value 0x0

For bit definitions of these registers, see Figure 2-14, Figure 2-15, Figure 2-16, and Figure 2-17. For more information on using these registers with interrupt processing, see “PMASK Register” on page 6-6.

Interrupt Mask (IMASKL/IMASKH) Registers

The IMASK register is a single 64-bit register accessed as two 32-bit registers, IMASKL and IMASKH. The initial value of IMASKH after reset differs depending for normal reset versus reset with IRQEN strap pin enable active.

- IMASKL low half—reset value 0xE3C3C000
- IMASKH high half—reset value 0x00010061 or 0x00011E61 (according to IRQEN strap pin state at reset)

For bit definitions of these registers, see Figure 2-14, Figure 2-15, Figure 2-16, and Figure 2-17. For more information on using these registers with interrupt processing, see “IMASK Register” on page 6-5.
Universal (Ureg) Register Space

Figure 2-14. IMASKH, ILATH, PMASKH (Upper) Register Bit Descriptions

Figure 2-15. IMASKL, ILATL, PMASKL (Upper) Register Bit Descriptions
Memory and Registers

Figure 2-16. IMASKH, ILATH, PMASKH (Lower) Register Bit Descriptions

Figure 2-17. IMASKL, ILATL, PMASKL (Lower) Register Bit Descriptions
Universal (Ureg) Register Space

Timer Interrupt (TIMERxH/L) Registers

The TIMERx registers are single 64-bit registers accessed as two 32-bit registers, TIMERxL and TIMERxH. The TIMERx registers contain the running value for the timers.

- TIMER0L Timer 0 running value low half—read only
- TIMER0H Timer 0 running value high half—read only
- TIMER1L Timer 1 running value low half—read only
- TIMER1H Timer 1 running value high half—read only

For more information on using these registers with interrupt processing, see “Timer Expired Interrupts” on page 6-19. For more information on using these registers with timer operations, see “Timers” on page 4-1.

Timer Initial Value (TMRINxH/L) Registers

The TMRINx registers are single 64-bit registers accessed as two 32-bit registers, TMRINxL and TMRINxH. The TMRINx registers contain the initial value for the timers.

- TMRIN0L Timer 0 initial value low half
- TMRIN0H Timer 0 initial value high half
- TMRIN1L Timer 1 initial value low half
- TMRIN1H Timer 1 initial value high half

For more information on using these registers with interrupt processing, see “Timer Expired Interrupts” on page 6-19. For more information on using these registers with timer operations, see “Timers” on page 4-1.
Memory and Registers

DMA Control and Status Register Group

The DMA register groups are described in Table 2-21 on page 2-56 to Table 2-24 on page 2-60. The DMA TCB registers are only accessible via quad-word accesses.

Table 2-20. DMA Control Register Group

<table>
<thead>
<tr>
<th>Name</th>
<th>Description</th>
<th>Address</th>
<th>Default</th>
</tr>
</thead>
<tbody>
<tr>
<td>DCNT</td>
<td>DMA control</td>
<td>0x1F 0060</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>Reserved</td>
<td>N/A</td>
<td>0x1F 0061–0x1F 0063</td>
<td>N/A</td>
</tr>
<tr>
<td>DCNTST</td>
<td>DMA control, set bits</td>
<td>0x1F 0064</td>
<td>Undefined</td>
</tr>
<tr>
<td>Reserved</td>
<td>N/A</td>
<td>0x1F 0065–0x1F 0067</td>
<td>N/A</td>
</tr>
<tr>
<td>DCNTCL</td>
<td>DMA control, clr bits</td>
<td>0x1F 0068</td>
<td>Undefined</td>
</tr>
<tr>
<td>Reserved</td>
<td>N/A</td>
<td>0x1F 0069–0x1F 006B</td>
<td>N/A</td>
</tr>
<tr>
<td>DSTAT</td>
<td>DMA status</td>
<td>0x1F 006C</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>Reserved</td>
<td>N/A</td>
<td>0x1F 006D–0x1F 006F</td>
<td>N/A</td>
</tr>
<tr>
<td>DSTATCL</td>
<td>DMA status, clear bits</td>
<td>0x1F 0070</td>
<td>Undefined</td>
</tr>
<tr>
<td>Reserved</td>
<td>N/A</td>
<td>0x1F 0071–0x1F 007F</td>
<td>N/A</td>
</tr>
</tbody>
</table>

DMA Control (DCNT) Register

The DMA control register shown in Table 2-20 on page 2-51 has three addresses.
Universal (Ureg) Register Space

The addresses are:

- **DCNT** regular address – read and write; reset value = 0x0000 0000.

- **DCNTST** set bits – write. When writing to this address, the register’s old value is OR’ed with the data written into it. Therefore to set a bit, write a “1” logic value to the desired bit location. To leave the bit value intact, write a logic “0”.

- **DCNTCL** clear bits – write. When writing to this address, the register’s old value is AND’ed with the data written into it. Therefore, to clear a specific bit, write a logic “0” to the desired bit location. To retain the bit’s original value, write a logic “1”.

See “Direct Memory Access” on page 7-1 for complete DMA register definitions.
DMA Status (DSTAT) Register

The DMA status register DSTAT is read only, and has two addresses. When reading the status register from the clear bits address DSTAT, all the error codes in it are cleared, and changed to idle state DSTATCL. Note that the DMA control registers (DCNT, DCNTST, DCNTCL) can be accessed as normal, long, or quad-words. Also, note that the DMA status registers (DSTAT, DSTATCL) can be accessed as long or quad-words.
Universal (Ureg) Register Space

Figure 2-20. DSTATH (Upper) Register Bit Descriptions

Figure 2-21. DSTATL (Upper) Register Bit Descriptions
Memory and Registers

Figure 2-22. DSTATH (Lower) Register Bit Descriptions

Figure 2-23. DSTATL (Lower) Register Bit Descriptions

1 CHx=100 status occurs for initialization of an active TCB with an active value. Clear this status by reading from the DSTATC register.
2 CHx=101 status could be as a result of setting the incorrect type field for an AutoDMA channel. Clear this status by reading from the DSTATL register.
3 CHx=111 status occurs if a source TCB address field is in the range 0x1C000000–0x1FFFFFFF
External Port DMA TCB Register Group

Note that DMA registers can be accessed only as quad-words.

Table 2-21. External Port DMA TCB Register Group

<table>
<thead>
<tr>
<th>Name</th>
<th>Description</th>
<th>Address</th>
<th>Default</th>
</tr>
</thead>
<tbody>
<tr>
<td>DCS0:DI</td>
<td>DMA chan. 0 source TCB, index</td>
<td>0x1F 0000</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>DCS0:DX</td>
<td>DMA chan. 0 source TCB, x-dim</td>
<td>0x1F 0001</td>
<td>0x0100 0004</td>
</tr>
<tr>
<td>DCS0:DY</td>
<td>DMA chan. 0 source TCB, y-dim</td>
<td>0x1F 0002</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>DCS0:DP</td>
<td>DMA chan. 0 source TCB, prog.</td>
<td>0x1F 0003</td>
<td>0xD300 0000</td>
</tr>
<tr>
<td>DCD0:DI</td>
<td>DMA chan. 0 destin. TCB, index</td>
<td>0x1F 0004</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>DCD0:DX</td>
<td>DMA chan. 0 destin. TCB, x-dim</td>
<td>0x1F 0005</td>
<td>0x0100 0004</td>
</tr>
<tr>
<td>DCD0:DY</td>
<td>DMA chan. 0 destin. TCB, y-dim</td>
<td>0x1F 0006</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>DCD0:DP</td>
<td>DMA chan. 0 destin. TCB, prog.</td>
<td>0x1F 0007</td>
<td>0x5300 0000</td>
</tr>
<tr>
<td>DCS1:DI</td>
<td>DMA chan. 1 source TCB, index</td>
<td>0x1F 0008</td>
<td>Undefined</td>
</tr>
<tr>
<td>DCS1:DX</td>
<td>DMA chan. 1 source TCB, x-dim</td>
<td>0x1F 0009</td>
<td>Undefined</td>
</tr>
<tr>
<td>DCS1:DY</td>
<td>DMA chan. 1 source TCB, y-dim</td>
<td>0x1F 000A</td>
<td>Undefined</td>
</tr>
<tr>
<td>DCS1:DP</td>
<td>DMA chan. 1 source TCB, prog.</td>
<td>0x1F 000B</td>
<td>Undefined</td>
</tr>
<tr>
<td>DCD1:DI</td>
<td>DMA chan. 1 destin. TCB, index</td>
<td>0x1F 000C</td>
<td>Undefined</td>
</tr>
<tr>
<td>DCD1:DX</td>
<td>DMA chan. 1 destin. TCB, x-dim</td>
<td>0x1F 000D</td>
<td>Undefined</td>
</tr>
<tr>
<td>DCD1:DY</td>
<td>DMA chan. 1 destin. TCB, y-dim</td>
<td>0x1F 000E</td>
<td>Undefined</td>
</tr>
<tr>
<td>DCD1:DP</td>
<td>DMA chan. 1 destin. TCB, prog.</td>
<td>0x1F 000F</td>
<td>Undefined</td>
</tr>
<tr>
<td>DCS2:DI</td>
<td>DMA chan. 2 source TCB, index</td>
<td>0x1F 0010</td>
<td>Undefined</td>
</tr>
<tr>
<td>DCS2:DX</td>
<td>DMA chan. 2 source TCB, x-dim</td>
<td>0x1F 0011</td>
<td>Undefined</td>
</tr>
<tr>
<td>DCS2:DY</td>
<td>DMA chan. 2 source TCB, y-dim</td>
<td>0x1F 0012</td>
<td>Undefined</td>
</tr>
<tr>
<td>DCS2:DP</td>
<td>DMA chan. 2 source TCB, prog.</td>
<td>0x1F 0013</td>
<td>Undefined</td>
</tr>
<tr>
<td>DCD2:DI</td>
<td>DMA chan. 2 destin. TCB, index</td>
<td>0x1F 0014</td>
<td>Undefined</td>
</tr>
</tbody>
</table>
Memory and Registers

Table 2-21. External Port DMA TCB Register Group (Cont’d)

<table>
<thead>
<tr>
<th>Name</th>
<th>Description</th>
<th>Address</th>
<th>Default</th>
</tr>
</thead>
<tbody>
<tr>
<td>DCD2:DX</td>
<td>DMA chan. 2 destin. TCB, x-dim</td>
<td>0x1F 0015</td>
<td>Undefined</td>
</tr>
<tr>
<td>DCD2:DY</td>
<td>DMA chan. 2 destin. TCB, y-dim</td>
<td>0x1F 0016</td>
<td>Undefined</td>
</tr>
<tr>
<td>DCD2:DP</td>
<td>DMA chan. 2 destin. TCB, prog.</td>
<td>0x1F 0017</td>
<td>Undefined</td>
</tr>
<tr>
<td>DCS3:DI</td>
<td>DMA chan. 3 source TCB, index</td>
<td>0x1F 0018</td>
<td>Undefined</td>
</tr>
<tr>
<td>DCS3:DX</td>
<td>DMA chan. 3 source TCB, x-dim</td>
<td>0x1F 0019</td>
<td>Undefined</td>
</tr>
<tr>
<td>DCS3:DY</td>
<td>DMA chan. 3 source TCB, y-dim</td>
<td>0x1F 001A</td>
<td>Undefined</td>
</tr>
<tr>
<td>DCS3:DP</td>
<td>DMA chan. 3 source TCB, prog.</td>
<td>0x1F 001B</td>
<td>Undefined</td>
</tr>
<tr>
<td>DCD3:DI</td>
<td>DMA chan. 3 destin. TCB, index</td>
<td>0x1F 001C</td>
<td>Undefined</td>
</tr>
<tr>
<td>DCD3:DX</td>
<td>DMA chan. 3 destin. TCB, x-dim</td>
<td>0x1F 001D</td>
<td>Undefined</td>
</tr>
<tr>
<td>DCD3:DY</td>
<td>DMA chan. 3 destin. TCB, y-dim</td>
<td>0x1F 001E</td>
<td>Undefined</td>
</tr>
<tr>
<td>DCD3:DP</td>
<td>DMA chan. 3 destin. TCB, prog.</td>
<td>0x1F 001F</td>
<td>Undefined</td>
</tr>
</tbody>
</table>

1 This default value only applies when EPROM boot mode select is active. Otherwise, the TY field is zero.

Link Port Transmit DMA TCB Register Group

Note that DMA registers can be accessed only as quad-words.

Table 2-22. Link Tx DMA TCB Register Group

<table>
<thead>
<tr>
<th>Name</th>
<th>Description</th>
<th>Address</th>
<th>Default</th>
</tr>
</thead>
<tbody>
<tr>
<td>DC4:DI</td>
<td>DMA channel 4 TCB, index</td>
<td>0x1F 0020</td>
<td>Undefined</td>
</tr>
<tr>
<td>DC4:DX</td>
<td>DMA channel 4 TCB, x-dim</td>
<td>0x1F 0021</td>
<td>Undefined</td>
</tr>
<tr>
<td>DC4:DY</td>
<td>DMA channel 4 TCB, y-dim</td>
<td>0x1F 0022</td>
<td>Undefined</td>
</tr>
<tr>
<td>DC4:DP</td>
<td>DMA channel 4 TCB, prog.</td>
<td>0x1F 0023</td>
<td>Undefined</td>
</tr>
<tr>
<td>DC5:DI</td>
<td>DMA channel 5 TCB, index</td>
<td>0x1F 0024</td>
<td>Undefined</td>
</tr>
<tr>
<td>DC5:DX</td>
<td>DMA channel 5 TCB, x-dim</td>
<td>0x1F 0025</td>
<td>Undefined</td>
</tr>
</tbody>
</table>
Universal (Ureg) Register Space

Table 2-22. Link Tx DMA TCB Register Group  (Cont’d)

<table>
<thead>
<tr>
<th>Name</th>
<th>Description</th>
<th>Address</th>
<th>Default</th>
</tr>
</thead>
<tbody>
<tr>
<td>DC5:DI</td>
<td>DMA channel 5 TCB, index</td>
<td>0x1F 0040</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>DC5:DX</td>
<td>DMA channel 5 TCB, x-dim</td>
<td>0x1F 0041</td>
<td>0x0100 0004</td>
</tr>
<tr>
<td>DC5:DY</td>
<td>DMA channel 5 TCB, y-dim</td>
<td>0x1F 0042</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>DC5:DP</td>
<td>DMA channel 5 TCB, prog.</td>
<td>0x1F 0043</td>
<td>0x5780 0000</td>
</tr>
<tr>
<td>DC6:DI</td>
<td>DMA channel 6 TCB, index</td>
<td>0x1F 0044</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>DC6:DX</td>
<td>DMA channel 6 TCB, x-dim</td>
<td>0x1F 0045</td>
<td>0x0100 0004</td>
</tr>
<tr>
<td>DC6:DY</td>
<td>DMA channel 6 TCB, y-dim</td>
<td>0x1F 0026</td>
<td>Undefined</td>
</tr>
<tr>
<td>DC6:DP</td>
<td>DMA channel 6 TCB, prog.</td>
<td>0x1F 0027</td>
<td>Undefined</td>
</tr>
<tr>
<td>DC7:DI</td>
<td>DMA channel 7 TCB, index</td>
<td>0x1F 0028</td>
<td>Undefined</td>
</tr>
<tr>
<td>DC7:DX</td>
<td>DMA channel 7 TCB, x-dim</td>
<td>0x1F 0029</td>
<td>Undefined</td>
</tr>
<tr>
<td>DC7:DY</td>
<td>DMA channel 7 TCB, y-dim</td>
<td>0x1F 002A</td>
<td>Undefined</td>
</tr>
<tr>
<td>DC7:DP</td>
<td>DMA channel 7 TCB, prog.</td>
<td>0x1F 002B</td>
<td>Undefined</td>
</tr>
<tr>
<td>Reserved</td>
<td>N/A</td>
<td>0x1F 0030–0x1F 003F</td>
<td>N/A</td>
</tr>
</tbody>
</table>

Link Port Receive DMA TCB Register Group

Link Input and IFIFO DMA TCBs are DMA channels 8, 9, 10, 11, 12, and 13. Note that DMA registers can be accessed only as quad-words.

Table 2-23. Link Rx DMA/AUTODMA TCB Register Group

<table>
<thead>
<tr>
<th>Name</th>
<th>Description</th>
<th>Address</th>
<th>Default</th>
</tr>
</thead>
<tbody>
<tr>
<td>DC8:DI</td>
<td>DMA channel 8 TCB, index</td>
<td>0x1F 0040</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>DC8:DX</td>
<td>DMA channel 8 TCB, x-dim</td>
<td>0x1F 0041</td>
<td>0x0100 0004</td>
</tr>
<tr>
<td>DC8:DY</td>
<td>DMA channel 8 TCB, y-dim</td>
<td>0x1F 0042</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>DC8:DP</td>
<td>DMA channel 8 TCB, prog.</td>
<td>0x1F 0043</td>
<td>0x5780 0000</td>
</tr>
<tr>
<td>DC9:DI</td>
<td>DMA channel 9 TCB, index</td>
<td>0x1F 0044</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>DC9:DX</td>
<td>DMA channel 9 TCB, x-dim</td>
<td>0x1F 0045</td>
<td>0x0100 0004</td>
</tr>
</tbody>
</table>
### Memory and Registers

Table 2-23. Link Rx DMA/AUTODMA TCB Register Group  (Cont’d)

<table>
<thead>
<tr>
<th>Name</th>
<th>Description</th>
<th>Address</th>
<th>Default</th>
</tr>
</thead>
<tbody>
<tr>
<td>DC9:DY</td>
<td>DMA channel 9 TCB, y-dim</td>
<td>0x1F 0046</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>DC9:DP</td>
<td>DMA channel 9 TCB, prog.</td>
<td>0x1F 0047</td>
<td>0x5780 0000</td>
</tr>
<tr>
<td>DC10:DI</td>
<td>DMA channel 10 TCB, index</td>
<td>0x1F 0048</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>DC10:DX</td>
<td>DMA channel 10 TCB, x-dim</td>
<td>0x1F 0049</td>
<td>0x0100 0004</td>
</tr>
<tr>
<td>DC10:DY</td>
<td>DMA channel 10 TCB, y-dim</td>
<td>0x1F 004A</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>DC10:DP</td>
<td>DMA channel 10 TCB, prog.</td>
<td>0x1F 004B</td>
<td>0x5780 0000</td>
</tr>
<tr>
<td>DC11:DI</td>
<td>DMA channel 11 TCB, index</td>
<td>0x1F 004C</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>DC11:DX</td>
<td>DMA channel 11 TCB, x-dim</td>
<td>0x1F 004D</td>
<td>0x0100 0004</td>
</tr>
<tr>
<td>DC11:DY</td>
<td>DMA channel 11 TCB, y-dim</td>
<td>0x1F 004E</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>DC11:DP</td>
<td>DMA channel 11 TCB, prog.</td>
<td>0x1F 004F</td>
<td>0x5780 0000</td>
</tr>
<tr>
<td>Reserved</td>
<td>N/A</td>
<td>0x1F 0050– 0x1F 0057</td>
<td>N/A</td>
</tr>
<tr>
<td>DC12:DI</td>
<td>DMA channel 12 TCB, index</td>
<td>0x1F 0058</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>DC12:DX</td>
<td>DMA channel 12 TCB, x-dim</td>
<td>0x1F 0059</td>
<td>0x0100 0001</td>
</tr>
<tr>
<td>DC12:DY</td>
<td>DMA channel 12 TCB, y-dim</td>
<td>0x1F 005A</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>DC12:DP</td>
<td>DMA channel 12 TCB, prog.</td>
<td>0x1F 005B</td>
<td>0x5380 0000</td>
</tr>
<tr>
<td>DC13:DI</td>
<td>DMA channel 13 TCB, index</td>
<td>0x1F 005C</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>DC13:DX</td>
<td>DMA channel 13 TCB, x-dim</td>
<td>0x1F 005D</td>
<td>0x0100 0001</td>
</tr>
<tr>
<td>DC13:DY</td>
<td>DMA channel 13 TCB, y-dim</td>
<td>0x1F 005E</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>DC13:DP</td>
<td>DMA channel 13 TCB, prog.</td>
<td>0x1F 005F</td>
<td>0x5380 0000</td>
</tr>
</tbody>
</table>
Universal (Ureg) Register Space

AutoDMA Register Group

The AutoDMA registers are used to implement a slave mode DMA (see “Direct Memory Access” on page 7-1). These registers can only be accessed through multiprocessing memory.

Table 2-24. AUTO DMA Register Group

<table>
<thead>
<tr>
<th>Name</th>
<th>Description</th>
<th>Address</th>
<th>Default</th>
</tr>
</thead>
<tbody>
<tr>
<td>AUTODMA0:0</td>
<td>AutoDMA reg 0, data bits 31–0</td>
<td>0x1F 03E0</td>
<td>N/A</td>
</tr>
<tr>
<td>AUTODMA0:1</td>
<td>AutoDMA reg 0, data bits 63–32</td>
<td>0x1F 03E1</td>
<td>N/A</td>
</tr>
<tr>
<td>AUTODMA0:2</td>
<td>AutoDMA reg 0, data bits 95–64</td>
<td>0x1F 03E2</td>
<td>N/A</td>
</tr>
<tr>
<td>AUTODMA0:3</td>
<td>AutoDMA reg 0, data bits 127–96</td>
<td>0x1F 03E3</td>
<td>N/A</td>
</tr>
<tr>
<td>AUTODMA1:0</td>
<td>AutoDMA reg 1, data bits 31–0</td>
<td>0x1F 03E4</td>
<td>N/A</td>
</tr>
<tr>
<td>AUTODMA1:1</td>
<td>AutoDMA reg 1, data bits 63–32</td>
<td>0x1F 03E5</td>
<td>N/A</td>
</tr>
<tr>
<td>AUTODMA1:2</td>
<td>AutoDMA reg 1, data bits 95–64</td>
<td>0x1F 03E6</td>
<td>N/A</td>
</tr>
<tr>
<td>AUTODMA1:3</td>
<td>AutoDMA reg 1, data bits 127–96</td>
<td>0x1F 03E7</td>
<td>N/A</td>
</tr>
<tr>
<td>Reserved</td>
<td>N/A</td>
<td>0x1F 03E8–0x1F 03FF</td>
<td>N/A</td>
</tr>
</tbody>
</table>

The AutoDMA channel 0 data registers (listed in appear in Table 2-24) contain 32-, 64-, or 128-bit data. When writing to this register, DMA channel 12 transfers the written data into the internal memory address programmed in the DMA. This register cannot be read, and can only be written through the multiprocessing address space. If the DMA is not initialized, the data is lost.

The AutoDMA channel 1 data registers (listed in appear in Table 2-24) contain 32-, 64-, or 128-bit data. When writing to this register, DMA channel 13 transfers the written data into the internal memory address programmed in the DMA. This register cannot be read, and can only be written through the multiprocessing address space. If the DMA is not initialized, the data is lost.
DMA Index (DI), X-Dimension (DX), Y-Dimension (DY), and Pointer (DP) TCB Registers

Each of the four external memory DMA channels is controlled by a transfer control block (TCB) quad-word register pair. In addition, each of the four link ports to or from memory DMA channels is controlled by a TCB quad-word register. Finally, each of the two AutoDMA registers-to-memory DMA channels is controlled by a TCB quad-word register. The registers that make up the TCBs are:

DMA index (DIx) register

This is the 32-bit Index register for the DMA. It can point to the address of external, internal memory, or link ports. (See Figure 2-24 and Figure 2-27.)

DMA X dimension (DXx) register

If a two-dimensional (2D) DMA is disabled, this register contains the 16-bit modify (LSBs) and 16-bit count (MSBs) values for the DMA. If a two-dimensional (2D) DMA is enabled, it contains the 16-bit modify and 16-bit count X dimension values. (See Figure 2-25 and Figure 2-28.) For some transfer types (TY bits in DPx register equal 011 or 111), the DXx holds a 10-bit count (MSBs) and 22-bit modify (LSBs).

DMA Y dimension (DYx) register

This register is used together with the DX register when a two-dimensional DMA is enabled. It contains the 16-bit modify and the 16-bit Y count dimension values. (See Figure 2-26 and Figure 2-29.) For some transfer types (TY bits in DPx register equal 011 or 111), the DYx holds a 10-bit count (MSBs) and 22-bit modify (LSBs).

For more information on using the TCB registers for DMA (and more details on the bits in the DIx, DXx, DYx, and DPx registers), see “Setting Up DMA Transfers” on page 7-15.
Universal (Ureg) Register Space

**Figure 2-24. DI (DMA Index, Upper) Register Bit Descriptions**

**Figure 2-25. DX (DMA X Dimension, Upper) Register Bit Descriptions**

**Figure 2-26. DY (DMA Y Dimension, Upper) Register Bit Descriptions**
Memory and Registers

DMA chaining pointer (DPx) register

This register is split into two fields, where the first is dedicated to DMA control and the second is dedicated to chaining. (See Figure 2-30.)
Universal (Ureg) Register Space

TY – Specifies the device type;
  000 – DMA disabled
  001 – I/O link port DMA
  010 – Internal memory
  011 – Internal memory (larger stride)
  100 – External memory
  101 – External I/O device (Flyby)
  110 – Boot EPROM
  111 – External memory (larger stride)

PR – Determines priority;
  1=high, 0=normal

2DDMA – Two-dimensional DMA;
  1=2 dimensional, 0=1 dimensional

LEN – Operand length;
  00 – Reserved
  01 – Normal (32-bit) word
  10 – Long (64-bit) word
  11 – Quad (128-bit) word

INT – Interrupt enable;
  1=enabled, 0=disabled

DRQ – DMA request enable;
  1=enabled, 0=disabled

CHEN – Chaining enabled
  1=enabled, 0=disabled

CHTG – Chaining destination channel;
  000 – Link Receive Channel 0
  001 – Link Receive Channel 1
  010 – Link Receive Channel 2
  011 – Link Receive Channel 3
  100 – Link Transmit Channel 0
  101 – Link Transmit Channel 1
  110 – Link Transmit Channel 2
  111 – Link Transmit Channel 3

CHPT – Chain pointer address

1 The CHPT field includes bits 18–0 (bits 15–0 not shown).

Figure 2-30. DPx (Upper) Register Bit Descriptions
Memory and Registers

Link Port Register Groups

The Link Port Buffer registers are only accessible as quad words. There are two Link register groups described in Table 2-25 and Table 2-26. Link Control and Status registers can be accessed only as single words. See “Link Ports” on page 9-1 for complete Link register definitions.

Table 2-25. Link Port Receive/Transmit Buffer Register Group

<table>
<thead>
<tr>
<th>Name</th>
<th>Description</th>
<th>Address</th>
<th>Default</th>
</tr>
</thead>
<tbody>
<tr>
<td>LBUFTX0:0</td>
<td>Link # 0 Tx data</td>
<td>0x1F 04A0</td>
<td>Undefined</td>
</tr>
<tr>
<td>LBUFTX0:1</td>
<td>Link # 0 Tx data</td>
<td>0x1F 04A1</td>
<td>Undefined</td>
</tr>
<tr>
<td>LBUFTX0:2</td>
<td>Link # 0 Tx data</td>
<td>0x1F 04A2</td>
<td>Undefined</td>
</tr>
<tr>
<td>LBUFTX0:3</td>
<td>Link # 0 Tx data</td>
<td>0x1F 04A3</td>
<td>Undefined</td>
</tr>
<tr>
<td>LBUFRX0:0</td>
<td>Link # 0 Rx data; Read only</td>
<td>0x1F 04A4</td>
<td>Undefined</td>
</tr>
<tr>
<td>LBUFRX0:1</td>
<td>Link # 0 Rx data; Read only</td>
<td>0x1F 04A5</td>
<td>Undefined</td>
</tr>
<tr>
<td>LBUFRX0:2</td>
<td>Link # 0 Rx data; Read only</td>
<td>0x1F 04A6</td>
<td>Undefined</td>
</tr>
<tr>
<td>LBUFRX0:3</td>
<td>Link # 0 Rx data; Read only</td>
<td>0x1F 04A7</td>
<td>Undefined</td>
</tr>
<tr>
<td>LBUFTX1:0</td>
<td>Link # 1 Tx data</td>
<td>0x1F 04A8</td>
<td>Undefined</td>
</tr>
<tr>
<td>LBUFTX1:1</td>
<td>Link # 1 Tx data</td>
<td>0x1F 04A9</td>
<td>Undefined</td>
</tr>
<tr>
<td>LBUFTX1:2</td>
<td>Link # 1 Tx data</td>
<td>0x1F 04AA</td>
<td>Undefined</td>
</tr>
<tr>
<td>LBUFTX1:3</td>
<td>Link # 1 Tx data</td>
<td>0x1F 04AB</td>
<td>Undefined</td>
</tr>
<tr>
<td>LBUFRX1:0</td>
<td>Link # 1 Rx data; Read only</td>
<td>0x1F 04AC</td>
<td>Undefined</td>
</tr>
<tr>
<td>LBUFRX1:1</td>
<td>Link # 1 Rx data; Read only</td>
<td>0x1F 04AD</td>
<td>Undefined</td>
</tr>
<tr>
<td>LBUFRX1:2</td>
<td>Link # 1 Rx data; Read only</td>
<td>0x1F 04AE</td>
<td>Undefined</td>
</tr>
<tr>
<td>LBUFRX1:3</td>
<td>Link # 1 Rx data; Read only</td>
<td>0x1F 04AF</td>
<td>Undefined</td>
</tr>
<tr>
<td>LBUFTX2:0</td>
<td>Link # 2 Tx data</td>
<td>0x1F 04B0</td>
<td>Undefined</td>
</tr>
<tr>
<td>LBUFTX2:1</td>
<td>Link # 2 Tx data</td>
<td>0x1F 04B1</td>
<td>Undefined</td>
</tr>
<tr>
<td>LBUFTX2:2</td>
<td>Link # 2 Tx data</td>
<td>0x1F 04B2</td>
<td>Undefined</td>
</tr>
</tbody>
</table>
Table 2-25. Link Port Receive/Transmit Buffer Register Group (Cont’d)

<table>
<thead>
<tr>
<th>Name</th>
<th>Description</th>
<th>Address</th>
<th>Default</th>
</tr>
</thead>
<tbody>
<tr>
<td>LBUFTX2:3</td>
<td>Link # 2 Tx data</td>
<td>0x1F 04B3</td>
<td>Undefined</td>
</tr>
<tr>
<td>LBUFRX2:0</td>
<td>Link # 2 Rx data; Read only</td>
<td>0x1F 04B4</td>
<td>Undefined</td>
</tr>
<tr>
<td>LBUFRX2:1</td>
<td>Link # 2 Rx data; Read only</td>
<td>0x1F 04B5</td>
<td>Undefined</td>
</tr>
<tr>
<td>LBUFRX2:2</td>
<td>Link # 2 Rx data; Read only</td>
<td>0x1F 04B6</td>
<td>Undefined</td>
</tr>
<tr>
<td>LBUFRX2:3</td>
<td>Link # 2 Rx data; Read only</td>
<td>0x1F 04B7</td>
<td>Undefined</td>
</tr>
<tr>
<td>LBUFTX3:0</td>
<td>Link # 3 Tx data</td>
<td>0x1F 04B8</td>
<td>Undefined</td>
</tr>
<tr>
<td>LBUFTX3:1</td>
<td>Link # 3 Tx data</td>
<td>0x1F 04B9</td>
<td>Undefined</td>
</tr>
<tr>
<td>LBUFTX3:2</td>
<td>Link # 3 Tx data</td>
<td>0x1F 04BA</td>
<td>Undefined</td>
</tr>
<tr>
<td>LBUFTX3:3</td>
<td>Link # 3 Tx data</td>
<td>0x1F 04BB</td>
<td>Undefined</td>
</tr>
<tr>
<td>LBUFRX3:0</td>
<td>Link # 3 Rx data; Read only</td>
<td>0x1F 04BC</td>
<td>Undefined</td>
</tr>
<tr>
<td>LBUFRX3:1</td>
<td>Link # 3 Rx data; Read only</td>
<td>0x1F 04BD</td>
<td>Undefined</td>
</tr>
<tr>
<td>LBUFRX3:2</td>
<td>Link # 3 Rx data; Read only</td>
<td>0x1F 04BE</td>
<td>Undefined</td>
</tr>
<tr>
<td>LBUFRX3:3</td>
<td>Link # 3 Rx data; Read only</td>
<td>0x1F 04BF</td>
<td>Undefined</td>
</tr>
</tbody>
</table>

Table 2-26. Link Control and Status Register Group

<table>
<thead>
<tr>
<th>Name</th>
<th>Description</th>
<th>Address</th>
<th>Default</th>
</tr>
</thead>
<tbody>
<tr>
<td>LRCTL0</td>
<td>Link # 0 Rx control</td>
<td>0x1F 00E0</td>
<td>0x0000 0001 (or 0x0000 0011)</td>
</tr>
<tr>
<td>LRCTL1</td>
<td>Link # 1 Rx control</td>
<td>0x1F 00E1</td>
<td>0x0000 0001 (or 0x0000 0011)</td>
</tr>
<tr>
<td>LRCTL2</td>
<td>Link # 2 Rx control</td>
<td>0x1F 00E2</td>
<td>0x0000 0001 (or 0x0000 0011)</td>
</tr>
<tr>
<td>LRCTL3</td>
<td>Link # 3 Rx control</td>
<td>0x1F 00E3</td>
<td>0x0000 0001 (or 0x0000 0011)</td>
</tr>
<tr>
<td>LTCTL0</td>
<td>Link # 0 Tx control</td>
<td>0x1F 00E4</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>LTCTL1</td>
<td>Link # 1 Tx control</td>
<td>0x1F 00E5</td>
<td>0x0000 0000</td>
</tr>
</tbody>
</table>
Table 2-26. Link Control and Status Register Group (Cont’d)

<table>
<thead>
<tr>
<th>Name</th>
<th>Description</th>
<th>Address</th>
<th>Default</th>
</tr>
</thead>
<tbody>
<tr>
<td>LTCTL2</td>
<td>Link # 2 Tx control</td>
<td>0x1F 00E6</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>LTCTL3</td>
<td>Link # 3 Tx control</td>
<td>0x1F 00E7</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>Reserved</td>
<td>N/A</td>
<td>0x1F 00E6–0x1F 00EF</td>
<td>N/A</td>
</tr>
<tr>
<td>LRSTAT0</td>
<td>Link # 0 Rx status</td>
<td>0x1F 00F0</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>LRSTAT1</td>
<td>Link # 1 Rx status</td>
<td>0x1F 00F1</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>LRSTAT2</td>
<td>Link # 2 Rx status</td>
<td>0x1F 00F2</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>LRSTAT3</td>
<td>Link # 3 Rx status</td>
<td>0x1F 00F3</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>LTSTAT0</td>
<td>Link # 0 Tx status</td>
<td>0x1F 00F4</td>
<td>0x0000 0002</td>
</tr>
<tr>
<td>LTSTAT1</td>
<td>Link # 1 Tx status</td>
<td>0x1F 00F5</td>
<td>0x0000 0002</td>
</tr>
<tr>
<td>LTSTAT2</td>
<td>Link # 2 Tx status</td>
<td>0x1F 00F6</td>
<td>0x0000 0002</td>
</tr>
<tr>
<td>LTSTAT3</td>
<td>Link # 3 Tx status</td>
<td>0x1F 00F7</td>
<td>0x0000 0002</td>
</tr>
<tr>
<td>LRSTATC0</td>
<td>Link # 0 Rx status clear</td>
<td>0x1F 00F8</td>
<td>Undefined</td>
</tr>
<tr>
<td>LRSTATC1</td>
<td>Link # 1 Rx status clear</td>
<td>0x1F 00F9</td>
<td>Undefined</td>
</tr>
<tr>
<td>LRSTATC2</td>
<td>Link # 2 Rx status clear</td>
<td>0x1F 00FA</td>
<td>Undefined</td>
</tr>
<tr>
<td>LRSTATC3</td>
<td>Link # 3 Rx status clear</td>
<td>0x1F 00FB</td>
<td>Undefined</td>
</tr>
<tr>
<td>LTSTATC0</td>
<td>Link # 0 Tx status clear</td>
<td>0x1F 00FC</td>
<td>Undefined</td>
</tr>
<tr>
<td>LTSTATC1</td>
<td>Link # 1 Tx status clear</td>
<td>0x1F 00FD</td>
<td>Undefined</td>
</tr>
<tr>
<td>LTSTATC2</td>
<td>Link # 2 Tx status clear</td>
<td>0x1F 00FE</td>
<td>Undefined</td>
</tr>
<tr>
<td>LTSTATC3</td>
<td>Link # 3 Tx status clear</td>
<td>0x1F 00FF</td>
<td>Undefined</td>
</tr>
</tbody>
</table>
Universal (Ureg) Register Space

Link Receive Control (LRCTLx) Registers

The LRCTLx registers setup receive features for link ports. The LRCTLx registers' reset value is 0x0000 0001, when the LINK_DWIDTH strap pin =0 at reset (RDSIZE=0). When the LINK_DWIDTH strap pin =1 at reset, the LRCTLx registers' reset value is 0x0000 0011 (RDSIZE=1).

![LRCTLx Register Diagram]

Figure 2-31. LRCTLx Low (Lower) Register Bit Descriptions

1 The reset value of the RDSIZE bit depends on the value of the LINK_DWIDTH strap pin at reset. If LINK_DWIDTH=0 at reset, RDSIZE=0 (1 bit data).

2 The upper 16 bits (31–16) of LRCTLx are reserved (=0)
Link Transmit Control (LTCTLx) Registers

The LTCTLx registers setup transmit features for link ports. The LTCTLx registers’ reset value is 0x0000 0001.

- **TEN** Transmit Enable
  - 1 = enable, 0 = disable
- **TVERE** Verification Enable
  - 1 = enable, 0 = disable
- **TTOE** Transmit Time Out check IRQ Enable
  - 1 = enable, 0 = disable
- **TBCMPE** Transmit Block Completion
  - 1 = enable LxBCMPO output,
  - 0 = disable LxBCMPO output
- **TDSIZE** Data Size
  - 1 = 4 bit data, 0 = 1 bit data
- **SPD** Transfer Speed.
  - Sets LxCLKOUT divisor from CCLK:
    - 000: divide CCLK by 1
    - 001: divide CCLK by 1.5
    - 010: divide CCLK by 2
    - 100: divide CCLK by 4
    - 011: (reserved)
    - 101: (reserved)
    - 110: (reserved)
    - 111: (reserved)
- **Reserved (DLLV3–0)**
- **Reserved**

Figure 2-32. LTCTLx Low (Lower) Register Bit Descriptions

1 The upper 16 bits (31–16) of LTCTLx are reserved (=0)
Universal (Ureg) Register Space

Link Receive Status (LRSTATx) Registers

The LRSTATx registers indicate receive status for link ports. The LRSTATx registers’ reset value is 0x0000 0000.

Figure 2-33. LRSTATx Low (Lower) Register Bit Descriptions

1 The upper 16 bits (31–16) of LRSTATx are reserved (=0)
Memory and Registers

Link Transmit Status (LTSTATx) Registers

The LTSTATx registers indicate transmit status for link ports. The LTSTATx registers’ reset value is 0x0000 0002.

Figure 2-34. LTSTATx Low (Lower) Register Bit Descriptions

1 TVACANT=1 indicates that the buffer is vacant, and the transmit DLL is locked. After enabling the transmitter (LTEN=1 in LTCTLx), the transmit DLL requires up to 800 SOCCCLK cycles to stabilize.
2 TEMPTY should be tested for buffer transmitted before disabling the channel.
3 The upper 16 bits (31–16) of LTSTATx are reserved (=0)
### External Bus Interface Register Groups

The external port registers appear in Table 2-27.

#### Table 2-27. Bus Interface Unit Register Group

<table>
<thead>
<tr>
<th>Name</th>
<th>Description</th>
<th>Address</th>
<th>Default</th>
</tr>
</thead>
<tbody>
<tr>
<td>SYSCON</td>
<td>System configuration register</td>
<td>0x1F 0080</td>
<td>0XX00 9067</td>
</tr>
<tr>
<td>Reserved</td>
<td>N/A</td>
<td>0x1F 0081–0x1F 0082</td>
<td>N/A</td>
</tr>
<tr>
<td>BUSLOCK</td>
<td>System control register</td>
<td>0x1F 0083</td>
<td>See Definition</td>
</tr>
<tr>
<td>SDRCON</td>
<td>SDRAM configuration register (see “SDRAM Programming” on page 8-67)</td>
<td>0x1F 0084</td>
<td>See Definition</td>
</tr>
<tr>
<td>Reserved</td>
<td>N/A</td>
<td>0x1F 0085</td>
<td>N/A</td>
</tr>
<tr>
<td>SYSTAT</td>
<td>System status register; Read only</td>
<td>0x1F 0086</td>
<td>See Definition</td>
</tr>
<tr>
<td>SYSTATCL</td>
<td>System status register; destructive address (clear error); Read only</td>
<td>0x1F 0087</td>
<td>Undefined</td>
</tr>
<tr>
<td>Reserved</td>
<td>N/A</td>
<td>0x1F 0088–0x1F 008B</td>
<td>N/A</td>
</tr>
<tr>
<td>BMAX</td>
<td>Maximum cycle count for bus fairness (see “Bus Fairness (BMAX) Register” on page 8-44)</td>
<td>0x1F 008C</td>
<td>See Definition</td>
</tr>
<tr>
<td>BMAXC</td>
<td>Current count on BMAX; Read only</td>
<td>0x1F 008D</td>
<td>Undefined</td>
</tr>
<tr>
<td>Reserved</td>
<td>N/A</td>
<td>0x1F 008E–0x1F 009F</td>
<td>N/A</td>
</tr>
</tbody>
</table>
System Configuration (SYSCON) Register

The SYSCON register defines bus control configuration. The SYSCON register’s reset value = 0xXX00 9067 (where XX—bits 22–31—are undefined). The processor’s mode (User or Supervisor) and the reset state of the SYS_REG_WE strap pin control write access to the SYSCON register. In User mode with the SYS_REG_WE strap pin disabled at reset, the SYSCON register can be written only once after hardware reset and cannot be changed during system operation—subsequent writes to SYSCON are ignored.

To write to the SYSCON register (and SDRCON register) more than once after reset in User mode, enable the SYS_REG_WE strap pin during reset. In Supervisor mode, SYSCON and SDRCON have no write restrictions.

The bit descriptions for this register are shown in Figure 2-35 and Figure 2-36. For more information on using this register, see “SYSCON Register Programming” on page 8-13. Note that bits 22-31 are undefined.
### Universal (Ureg) Register Space

<table>
<thead>
<tr>
<th>Bit 31-16</th>
<th>Description</th>
<th>Width</th>
</tr>
</thead>
<tbody>
<tr>
<td></td>
<td>Reserved</td>
<td>0</td>
</tr>
<tr>
<td>31-30</td>
<td>HSTWIDTH – Host Bus Width</td>
<td>64 bits wide</td>
</tr>
<tr>
<td></td>
<td>1=64 bits wide</td>
<td></td>
</tr>
<tr>
<td></td>
<td>0=32 bits wide</td>
<td></td>
</tr>
<tr>
<td>29-28</td>
<td>MPWIDTH – Multiprocessing Bus Width</td>
<td>64 bits wide</td>
</tr>
<tr>
<td></td>
<td>1=64 bits wide</td>
<td></td>
</tr>
<tr>
<td></td>
<td>0=32 bits wide</td>
<td></td>
</tr>
<tr>
<td>27-26</td>
<td>MEMWIDTH – Memory Bus Width</td>
<td>64 bits wide</td>
</tr>
<tr>
<td></td>
<td>1=64 bits wide</td>
<td></td>
</tr>
<tr>
<td></td>
<td>0=32 bits wide</td>
<td></td>
</tr>
<tr>
<td>25-24</td>
<td>HOSTSLOW – Host Slow Protocol Bit</td>
<td>0</td>
</tr>
<tr>
<td></td>
<td>(See BNK0SLOW bit definition.)</td>
<td></td>
</tr>
<tr>
<td>23-22</td>
<td>HOSTPIPE – Host Pipe Depth Bits</td>
<td>0</td>
</tr>
<tr>
<td></td>
<td>(See BNK0PIPE bits definition.)</td>
<td></td>
</tr>
<tr>
<td>21-18</td>
<td></td>
<td></td>
</tr>
<tr>
<td>17-16</td>
<td></td>
<td></td>
</tr>
</tbody>
</table>

Reserved Bit 15 continued in Figure 2-36

---

Figure 2-35. SYSCON (Upper) Register Bit Descriptions
Figure 2-36. SYSCON (Lower) Register Bit Descriptions

1 If using slow protocol, BNKx/HOSTPIPE bit is cleared.
Universal (Ureg) Register Space

Bus Lock (BUSLOCK) System Control Register

The BUSLOCK register defines the bus lock status. The BUSLOCK register’s reset value is 0xXX00 0000 (where XX is undefined). Note that bits 22-31 are undefined.

![BUSLOCK (Lower) Register Bit Descriptions](image)

Figure 2-37. BUSLOCK (Lower) Register Bit Descriptions

1. The upper 16 bits (31–16) of BUSLOCK are reserved (=0)

SDRAM Configuration (SDRCON) Register

The SDRCON register defines SDRAM configuration. The SDRCON register’s reset value is 0xXX00 0000 (where XX is undefined). Note that this reset value indicates that at reset SDRAM is disabled. The processor’s mode (User or Supervisor) and the reset state of the SYS_REG_WE strap pin control write access to the SDRCON register. In User mode with the SYS_REG_WE strap pin disabled at reset, the SDRCON register can be written only once after hardware reset and cannot be changed during system operation—subsequent writes to SDRCON are ignored.

To write to the SDRCON register (and SYSCON register) more than once after reset in User mode, enable the SYS_REG_WE strap pin during reset. In Supervisor mode, SYSCON and SDRCON have no write restrictions.

The bit descriptions for this register are shown in Figure 2-38 on page 2-77. Note that bits 22–31 are undefined.
Memory and Registers

System Status (SYSTAT) Register

The SYSTAT register is a read-only register that indicates the system/bus status. The SYSTAT register’s reset value is 0xXX00 SS0S (where XX is undefined, and SS is strap pin).

Figure 2-38. SDRCON (Lower) Register Bit Descriptions

1 Bits 31–16 (not shown) are reserved (=0).
Programs can read the SYSTAT register using the name SYSTAT or SYSTATCL (SYSTAT clear). The register name (address) determines whether the error bits (19–16) are cleared after the read. Note that bits 22-31 are undefined. The bit descriptions for this register are shown in Figure 2-39 on page 2-78 and Figure 2-40 on page 2-79.

- **SYSTAT** – no change in register contents
- **SYSTATCL** – error Bits19–16 are cleared after the read

**Figure 2-39. SYSTAT/SYSTATCL (Upper) Register Bit Descriptions**
Memory and Registers

Figure 2-40. SYSTAT/SYSTATCL (Lower) Register Bit Descriptions
Universal (Ureg) Register Space

Bus Master Maximum (BMAX) Register

The BMAX register is set with the maximum number of internal clock cycles (CCLK) for which the TigerSHARC processor is allowed to retain master-ship on the external bus. When reading this address, the returned value is the value written to the BMAX register, not the current cycle count. When the BMAX initialization count (bits 15–0) is 0xFFFF, the BMAX counter is disabled. (See Figure 2-41 on page 2-80.) See “Bus Fairness (BMAX) Register” on page 8-44 for more information.

Figure 2-41. BMAX (Lower) Register Bit Descriptions

1 The upper 16 bits (31–16) of BMAX are reserved (=0)

Bus Master Maximum Current Count (BMAXC) Register

This address is read-only and returns the BMAX counter value when read. The BMAXC register’s reset value = 0xXX00 FFFF (where XX is undefined). Note that bits 22-31 are undefined. (See Figure 2-42 on page 2-80.)

Figure 2-42. BMAXC (Lower) Register Bit Descriptions

1 The upper 16 bits (31–16) of BMAXC are reserved (=0)
### JTAG Test and Emulation Register Groups

#### Table 2-28. JTAG Register Group

<table>
<thead>
<tr>
<th>Name</th>
<th>Description</th>
<th>Address</th>
<th>Default</th>
</tr>
</thead>
<tbody>
<tr>
<td>EMUCTL</td>
<td>Emulation Control</td>
<td>0x1F 03A0</td>
<td>0x0000 0020</td>
</tr>
<tr>
<td>EMUSTAT</td>
<td>Emulation Status</td>
<td>0x1F 03A1</td>
<td>0x0000 0020</td>
</tr>
<tr>
<td>EMUDAT</td>
<td>Emulation Data</td>
<td>0x1F 03A2</td>
<td>Undefined</td>
</tr>
<tr>
<td>IDCODE</td>
<td>ID Code</td>
<td>0x1F 03A3</td>
<td>0xX27A F0CB</td>
</tr>
<tr>
<td>EMUIR:0</td>
<td>Emulation Instruction 0</td>
<td>0x1F 03A4</td>
<td>0x33C0 0000</td>
</tr>
<tr>
<td>EMUIR:1</td>
<td>Emulation Instruction 1</td>
<td>0x1F 03A5</td>
<td>0x33C0 0000</td>
</tr>
<tr>
<td>EMUIR:2</td>
<td>Emulation Instruction 2</td>
<td>0x1F 03A6</td>
<td>0x33C0 0000</td>
</tr>
<tr>
<td>EMUIR:3</td>
<td>Emulation Instruction 3</td>
<td>0x1F 03A7</td>
<td>0xB3C0 0000</td>
</tr>
<tr>
<td>Reserved</td>
<td>Reserved</td>
<td>0x1F 03A8</td>
<td>N/A</td>
</tr>
<tr>
<td>OSPID</td>
<td>OSPID</td>
<td>0x1F 03A9</td>
<td>Undefined</td>
</tr>
<tr>
<td>Reserved</td>
<td>Reserved</td>
<td>0x1F 03AA–0x1F 03BF</td>
<td>N/A</td>
</tr>
</tbody>
</table>
Emulation Control (EMUCTL) Register

The EMUCTL register sets up emulation features. The EMUCTL register’s reset value is 0x0000 0020. Most bits in the EMUCTL register are used by the emulator and should not be modified by programs. The only bits in the EMUCTL register that are intended for program usage are the SWRST (software reset) bit and the BOOTDIS (boot disable) bit. (See Figure 2-43 on page 2-82.)

### Bit Descriptions

- **EMEN** – Emulation Enable
  - 1=enable, 0=disable
- **TEMEM** – TMS Emulation Exception
  - 1=exception, 0=no exception
- **EMUOE** – Emulation Output Enable
  - 1=enable, 0=disable
- **SPFDIS** – SPF Disable
  - 1=disable, 0=enable
- **SWRST** – Software Reset
  - 1=reset request, 0=no reset request
- **BOOTDIS** – Boot Disable
  - 1=disable, 0=enable
- **Reserved**

Figure 2-43. EMUCTL (Lower) Register Bit Descriptions

1. The BOOTDISL bit is cleared on hardware reset, but not cleared on software reset
2. The upper 16 bits (31–16) of EMUCTL are reserved (=0)
Memory and Registers

**Emulation Status (EMUSTAT) Register**

The `EMUSTAT` register indicates emulation status. The `EMUCTL` register’s reset value is 0x0000 0002. (See Figure 2-44 on page 2-83.)

![EMUSTAT Register Bit Descriptions](image)

<table>
<thead>
<tr>
<th>Bit</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>15-0</td>
<td>Reserved (=0)</td>
</tr>
<tr>
<td>14</td>
<td>EMUMOD – Emulation Mode</td>
</tr>
<tr>
<td></td>
<td>1=emulation, 0=not emulation</td>
</tr>
<tr>
<td>13</td>
<td>IRFREE – Emulation Instruction Register Free</td>
</tr>
<tr>
<td></td>
<td>1=EMUIR ready, 0=EMUIR not ready</td>
</tr>
<tr>
<td>12</td>
<td>INRESET – Emulation Output Enable</td>
</tr>
<tr>
<td></td>
<td>1=reset active, 0=no reset</td>
</tr>
</tbody>
</table>

Figure 2-44. EMUSTAT (Lower) Register Bit Descriptions¹

¹ The upper 16 bits (31–16) of EMUSTAT are reserved (=0)

**Silicon Version and ID Code (IDCODE) Register**

The `IDCODE` register indicates processor model ID and silicon version. The `EMUCTL` register’s reset value is 0xV27A F0CB (where V is version). (See Figure 2-45 on page 2-83.)

![IDCODE Register Bit Descriptions](image)

<table>
<thead>
<tr>
<th>Bit</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>31-0</td>
<td>Reserved (=0)</td>
</tr>
<tr>
<td>29</td>
<td>ID31–28 – Silicon Version</td>
</tr>
<tr>
<td>16</td>
<td>ID27–0 – IDCODE (0x27A F0CB)¹</td>
</tr>
</tbody>
</table>

Figure 2-45. IDCODE (Upper) Register Bit Descriptions

¹ The ID27–0 field includes bits 27–0 (bits 15–0 not shown).
Universal (Ureg) Register Space

Debug Register Groups

The debug groups are described in Table 2-29 on page 2-84. The debug registers can only be accessed as single-word registers. In emulation mode, the debug registers can be accessed only by move, register to register, or immediate data load instructions. These registers cannot be loaded from or stored to memory directly. There is a five cycle stall after any write to the debug group registers.

Table 2-29. Debug Register Group

<table>
<thead>
<tr>
<th>Register</th>
<th>Description</th>
<th>Address</th>
<th>Default</th>
</tr>
</thead>
<tbody>
<tr>
<td>WP0CTL</td>
<td>Watchpoint 0 control</td>
<td>0x1E 0360</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>WP1CTL</td>
<td>Watchpoint 1 control</td>
<td>0x1E 0361</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>WP2CTL</td>
<td>Watchpoint 2 control</td>
<td>0x1E 0362</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>Reserved</td>
<td>N/A</td>
<td>0x1E 0363</td>
<td>N/A</td>
</tr>
<tr>
<td>WP0STAT</td>
<td>Watchpoint 0 status; (RO)</td>
<td>0x1E 0364</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>WP1STAT</td>
<td>Watchpoint 1 status; (RO)</td>
<td>0x1E 0365</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>WP2STAT</td>
<td>Watchpoint 2 status; (RO)</td>
<td>0x1E 0366</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>Reserved</td>
<td>N/A</td>
<td>0x1E 0367</td>
<td>N/A</td>
</tr>
<tr>
<td>WP0L</td>
<td>Watchpoint 0 low address</td>
<td>0x1E 0368</td>
<td>Undefined</td>
</tr>
<tr>
<td>WP0H</td>
<td>Watchpoint 0 high address</td>
<td>0x1E 0369</td>
<td>Undefined</td>
</tr>
<tr>
<td>WP1L</td>
<td>Watchpoint 1 low address</td>
<td>0x1E 036A</td>
<td>Undefined</td>
</tr>
<tr>
<td>WP1H</td>
<td>Watchpoint 1 high address</td>
<td>0x1E 036B</td>
<td>Undefined</td>
</tr>
<tr>
<td>WP2L</td>
<td>Watchpoint 2 low address</td>
<td>0x1E 036C</td>
<td>Undefined</td>
</tr>
<tr>
<td>WP2H</td>
<td>Watchpoint 2 high address</td>
<td>0x1E 036D</td>
<td>Undefined</td>
</tr>
<tr>
<td>Reserved</td>
<td>N/A</td>
<td>0x1E 036E–0x1E 036F</td>
<td>N/A</td>
</tr>
<tr>
<td>CCNT0</td>
<td>Cycle counter low</td>
<td>0x1E 0370</td>
<td>Undefined</td>
</tr>
<tr>
<td>CCNT1</td>
<td>Cycle counter high</td>
<td>0x1E 0371</td>
<td>Undefined</td>
</tr>
<tr>
<td>PRFM</td>
<td>Performance monitor mask</td>
<td>0x1E 0372</td>
<td>0x0000 0000</td>
</tr>
</tbody>
</table>
### Table 2-29. Debug Register Group (Cont'd)

<table>
<thead>
<tr>
<th>Register</th>
<th>Description</th>
<th>Address</th>
<th>Default</th>
</tr>
</thead>
<tbody>
<tr>
<td>PRFCNT</td>
<td>Perform. monitor counter</td>
<td>0x1E 0373</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>TRCBMASK</td>
<td>Trace buffer mask (RO)</td>
<td>0x1E 0374</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>TRCBPTR</td>
<td>Trace buffer pointer (RO)</td>
<td>0x1E 0375</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>Reserved</td>
<td>N/A</td>
<td>0x1E 0376–0x1E 037B</td>
<td>N/A</td>
</tr>
<tr>
<td>TRCBVAL</td>
<td>Trace buffer valid (RO)</td>
<td>0x1E 037C</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>Reserved</td>
<td>N/A</td>
<td>0x1E 037D–0x1E 037F</td>
<td>N/A</td>
</tr>
<tr>
<td>TRCB31–0</td>
<td>Trace buffer (RO)</td>
<td>0x1E 0140–0x1E015F</td>
<td>0x0000 0000</td>
</tr>
</tbody>
</table>
Watchpoint Control (WPxCTL) Registers

Each of the three watchpoints has a control register (WP0CTL, WP1CTL, and WP2CTL) used to define its operation. The WPxCTL registers’ reset values are 0x0000 0000. The bit descriptions for these registers are shown in Figure 2-46, Figure 2-47, and Figure 2-48.

![Figure 2-46. WP0CTL (Lower) Register Bit Descriptions](image)

1 Bits 31–16 (not shown) are the watchpoint counter initialization (CNTINIT15–0) value.
Figure 2-47. WP1CTL (Lower) Register Bit Descriptions

1 Bits 31–16 (not shown) are the watchpoint counter initialization (CNTINIT15–0) value.
Universal (Ureg) Register Space

Figure 2-48. WP2CTL (Lower) Register Bit Descriptions

1 Bits 31–16 (not shown) are the watchpoint counter initialization (CNTINIT15–0) value.
Memory and Registers

Watchpoint Status (WPxSTAT) Registers

Each of the three watchpoints has a status register (WP0STAT, WP1STAT, and WP2STAT) used to indicate its operation. The WPxSTAT registers’ reset values are 0x0000 0000. The bit descriptions for this register are shown in Figure 2-49 on page 2-89.

![Figure 2-49. WPxSTAT (Upper) Register Bit Descriptions](image)

- **Reserved**: 31 30 29 28 27 26 25 24 23 22 21 20 19 18 17 16
- **EX – Execution State**: 00 = Watchpoint disable, 01 = Watchpoint search active, 11 = Watchpoint count expired

Watchpoint Address Pointer (WPxL/WPxH) Registers

The watchpoint address pointers are 32-bit pointers defining the address, or address range, of the watchpoints. Their value after reset is undefined.

Performance Monitor Mask (PRFM) Register

The performance monitor mask (PRFM) register identifies which events are monitored then counted by the performance monitor counter (PRFCNT). The PRFM register’s reset value is 0x0000 0000. Bits 19–16 and 12–0 indicate which events are monitored, and bit 31 indicates if the events in the same cycle are summed or ORed. The bit descriptions for this register are shown in Figure 2-50 on page 2-90 and Figure 2-51 on page 2-91.
Universal (Ureg) Register Space

SUMEN — Sum Events Count Enable
  1 = enable sum
  0 = disable sum

Reserved

CCYCLE — User Clock Cycles
  1 = user clock cycle (instruction executed)
  0 = not user clock cycle

ISL — Instruction Lines
  1 = instruction line aborted
  0 = instruction line executed

BTBPR — BTB Prediction
  1 = BTB prediction TRUE
  0 = BTB prediction FALSE

SCYCLE — Stall Cycles
  1 = stall cycle (instruction stalled)
  0 = not stall cycle

Figure 2-50. PRFM (Upper) Register Bit Descriptions

1 When more than one monitored event occurs at the same time, SUMEN selects whether the count in PRFCNT is a sum of the counts for all of the events (enabled) or an OR of the counts (disabled).
Memory and Registers

Figure 2-51. PRFM (Lower) Register Bit Descriptions

1. If multiple bits are set in the NGR field, the value of the SUMEN bit determines whether the performance count is a sum of the cycles or an OR of the cycles when multiple events occur on the same instruction line. For example, NGR=0110 causes the performance monitor to monitor for both J- and K-bus non granted requests.

2. If multiple bits are set in the GR field, the value of the SUMEN bit determines whether the performance count is a sum of the cycles or an OR of the cycles when multiple events occur on the same instruction line. For example, GR=0110 causes the performance monitor to monitor for both J- and K-bus granted requests.

3. If multiple bits are set in the MODULE field, the value of the SUMEN bit determines whether the performance count is a sum of the cycles or an OR of the cycles when multiple events occur on the same instruction line. For example, MODULE=01100 causes the performance monitor to monitor for both X and Y compute block instruction execution.
Universal (Ureg) Register Space

Performance Monitor Counter (PRFCNT) Register

The purpose of the performance counter is to register the cycle where monitored conditions occur. The monitored condition is identified by the performance monitor mask register ($PRFM$). The register is cleared after reset.

Cycle Counter (CCNTx) Registers

The cycle counter is 64 bits long. The two $Ureg$ registers that make up the cycle counter are $CCNT0$ and $CCNT1$. When JTAG is reset, $CCNT$ is cleared. These two registers cannot be accessed with a long-word access; they can only be accessed in two single accesses. These registers can be written in emulator or supervisor modes only, although they are readable in all modes.

To keep the cycle counter consistent between the two read accesses, an access to cycle counter must read first the low part ($CCNT0$). When the low part is read, the high part ($CCNT1$) is copied into a shadow register, so the high part does not change between the two reads. Another read of the low part again copies the high part to the shadow register, which is always the source of the high part read.
Trace Buffers and Pointer (TRCBx/TRCBPTR) Registers

Each time the TigerSHARC processor executes a non-sequential fetch, the PC of the non-sequentially fetched instruction is written into one of the trace buffer registers. The first write is into trace buffer #0; the second to #1, and so on in a circular manner. The trace buffer pointer (3 bits) identifies the last written trace buffer. For more information, see “Instruction Address Trace Buffer (TBUF)” on page 10-8.

Table 2-30. Trace Buffer Register Group

<table>
<thead>
<tr>
<th>Name</th>
<th>Description</th>
<th>Address</th>
<th>Default</th>
</tr>
</thead>
<tbody>
<tr>
<td>TRCB0</td>
<td>Trace buffer 0; Read only</td>
<td>0x1E 0140</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>TRCB1</td>
<td>Trace buffer 1; Read only</td>
<td>0x1E 0141</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>TRCB2</td>
<td>Trace buffer 2; Read only</td>
<td>0x1E 0142</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>TRCB3</td>
<td>Trace buffer 3; Read only</td>
<td>0x1E 0143</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>TRCB4</td>
<td>Trace buffer 4; Read only</td>
<td>0x1E 0144</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>TRCB5</td>
<td>Trace buffer 5; Read only</td>
<td>0x1E 0145</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>TRCB6</td>
<td>Trace buffer 6; Read only</td>
<td>0x1E 0146</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>TRCB7</td>
<td>Trace buffer 7; Read only</td>
<td>0x1E 0147</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>TRCB8</td>
<td>Trace buffer 8; Read only</td>
<td>0x1E 0148</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>TRCB9</td>
<td>Trace buffer 9; Read only</td>
<td>0x1E 0149</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>TRCB10</td>
<td>Trace buffer 10; Read only</td>
<td>0x1E 014A</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>TRCB11</td>
<td>Trace buffer 11; Read only</td>
<td>0x1E 014B</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>TRCB12</td>
<td>Trace buffer 12; Read only</td>
<td>0x1E 014C</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>TRCB13</td>
<td>Trace buffer 13; Read only</td>
<td>0x1E 014D</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>TRCB14</td>
<td>Trace buffer 14; Read only</td>
<td>0x1E 014E</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>TRCB15</td>
<td>Trace buffer 15; Read only</td>
<td>0x1E 014F</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>TRCB16</td>
<td>Trace buffer 16; Read only</td>
<td>0x1E 0150</td>
<td>0x0000 0000</td>
</tr>
</tbody>
</table>
### Universal (Ureg) Register Space

Table 2-30. Trace Buffer Register Group (Cont’d)

<table>
<thead>
<tr>
<th>Name</th>
<th>Description</th>
<th>Address</th>
<th>Default</th>
</tr>
</thead>
<tbody>
<tr>
<td>TRCB17</td>
<td>Trace buffer 17; Read only</td>
<td>0x1E 0151</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>TRCB18</td>
<td>Trace buffer 18; Read only</td>
<td>0x1E 0152</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>TRCB19</td>
<td>Trace buffer 19; Read only</td>
<td>0x1E 0153</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>TRCB20</td>
<td>Trace buffer 20; Read only</td>
<td>0x1E 0154</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>TRCB21</td>
<td>Trace buffer 21; Read only</td>
<td>0x1E 0155</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>TRCB22</td>
<td>Trace buffer 22; Read only</td>
<td>0x1E 0156</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>TRCB23</td>
<td>Trace buffer 23; Read only</td>
<td>0x1E 0157</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>TRCB24</td>
<td>Trace buffer 24; Read only</td>
<td>0x1E 0158</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>TRCB25</td>
<td>Trace buffer 25; Read only</td>
<td>0x1E 0159</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>TRCB26</td>
<td>Trace buffer 26; Read only</td>
<td>0x1E 015A</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>TRCB27</td>
<td>Trace buffer 27; Read only</td>
<td>0x1E 015B</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>TRCB28</td>
<td>Trace buffer 28; Read only</td>
<td>0x1E 015C</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>TRCB29</td>
<td>Trace buffer 29; Read only</td>
<td>0x1E 015D</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>TRCB30</td>
<td>Trace buffer 30; Read only</td>
<td>0x1E 015E</td>
<td>0x0000 0000</td>
</tr>
<tr>
<td>TRCB31</td>
<td>Trace buffer 31; Read only</td>
<td>0x1E 015F</td>
<td>0x0000 0000</td>
</tr>
</tbody>
</table>
3 SOC INTERFACE

The system-on-chip (SOC) interface connects the processor core’s buses (J-, K-, I-, and S-bus) to the I/O peripherals’ SOC bus. The SOC bus is 128-bits wide, and its clock (SOCCLK) operates at one half the processor core clock (CCLK) rate. The peripherals that can be masters or slaves on the SOC bus are:

- SOC interface
- External port (external bus interface unit, EBIU)
- DMA unit

The peripherals that can be only slaves on the SOC bus are:

- Link ports
- Interrupt controller
- Timers
- Flags
- JTAG port

Figure 3-1 shows a block diagram for these buses, bus masters, and bus slaves.
Figure 3-1. SOC Interface and SOC Bus Connections
SOC Interface

The SOC bus has a fixed priority scheme to arbitrate among the masters that request to perform a transaction on the SOC bus. From highest to lowest the SOC bus request priority is:

1. External port – EBIU – high priority (highest SOC bus priority)
   
   An external port (EBIU) SOC bus request is high priority when:
   
   - The EBIU IFIFO has more than two waiting transactions
   - A write transaction has been waiting unserved for 10 CCLK cycles
   - The EBIU IFIFO returns DMA data (from a high-priority DMA)

2. DMA high priority

   A DMA request for the SOC bus is high priority when the TCB of either source or destination is set to high priority.

3. SOC interface

4. External port – EBIU – low priority

5. DMA low priority (lowest SOC bus priority)

When the SOC bus transaction has arbitrated for bus mastership and reaches the SOC interface, the transaction must compete for access to the target memory block. The processor uses a priority order when arbitrating between two (or more) internal buses making simultaneous accesses to the same internal memory block. For more information about the priority order for internal bus arbitration, see “Processor Microarchitecture” on page 8-5.

Transaction between the extended core and masters or slaves on the SOC bus go through the SOC interface. The SOC interface consists of FIFOs and buffers which control the data flow. Looking at Figure 3-2, it is possi-
ble to examine the buffer structure of the SOC interface and follow the data flow for transactions. Because transactions across the SOC bus are influenced by the SOC interface FIFOs and buffers, it is important when assessing system performance to understand something of how these FIFOs and buffers operate.

The SOC interface output FIFO (OFIFO) interfaces with the processor core J- and K-buses. The transactions through the SOC OFIFO for the J- and K-buses are:

- Writes from the processor core to SOC registers or to external address space
- Read requests by the processor core from the SOC registers or from external address space (split transactions)

The SOC OFIFO also interfaces with the processor core I-bus. The transactions through the SOC OFIFO for the I-bus are Instruction fetches by the sequencer from external addresses or from emulator instruction (EMUIR) register.

Figure 3-2. SOC Interface Data Flow
The SOC interface input FIFO (IFIFO) interfaces with the processor core K- and S-buses. The transactions through the SOC IFIFO are:

- Writes to memory initiated by SOC masters (S-bus)
- Reads from memory initiated by SOC masters (S-bus)
- Writes to register initiated by SOC masters (K-bus)
- Reads from registers initiated by SOC masters (K-bus)
- Reads from SOC registers or external address space initiated by the processor core (K-bus)

The SOC interface output buffer (OBUF) interfaces with the processor core K- and S-buses. The transactions through the SOC OBUF return data from processor core registers (K-bus) or internal memory (S-bus) as a result of a read initiated by SOC masters.

For more information on SOC interface OFIFO, IFIFO, and OBUF operations, see “SOC Interface Operations” on page 3-5.

**SOC Interface Operations**

As shown in Figure 3-2, the SOC interface connects the SOC bus to the processor core internal buses (J-, K-, I-, and S-bus) and uses FIFOs and buffers to manage the data flow. This section provides more detail on the SOC OFIFO, SOC IFIFO, and SOC OBUF operations.

**SOC OFIFO Transactions**

The SOC OFIFO processes all processor core (internal bus) transactions that target the SOC bus or target external locations through the SOC bus.
SOC Interface Operations

SOC IFIFO transactions that target internal memory use the S-bus. The S-bus must arbitrate for access to the targeted memory block. The S-bus transactions can be high or low priority for this arbitration. A transaction from SOC interface is high priority in the following cases:

- Read transaction
- DMA transaction marked as high priority (TCB was high)
- When there are at least two entries in the SOC interface IFIFO
- When a single transaction in the IFIFO has waited there for 16 CCLK cycles

The SOC IFIFO appears in Figure 3-2 with the other SOC interface FIFOs and buffers.

When a transaction is processed by the SOC OFIFO, the SOC OFIFO requests the SOC bus and executes the transaction on the SOC bus. For write transactions, this step completes the SOC OFIFO processing of the transaction. Read transactions require more steps to return data to the processor core. The additional steps for read transactions are that the data is returned to the processor core through the SOC IFIFO.

Many processor core read transactions are split transactions—a read from external space by the core that is translated to two transactions.

In the first transaction, the read request is made to the SOC address space (for example, a processor core read from an SOC Ureg register) or the read request is made to external address space (an access that goes through the SOC interface and then through the EBIU).

In the second transaction, the requested data is returned. When the processor core reads from an SOC Ureg register, the SOC interface generates a transaction on K-bus to return the data (write it) to the core register. When the processor core reads from an external location, the external port requests a transaction on the SOC bus which produces a transaction on K-bus (generated by the SOC interface IFIFO).
SOC Interface

There are some cases in which SOC OFIFO operation can cause delays or stalls in the processor core instruction pipeline. These cases are a data dependency delay occurring on a read through the SOC OFIFO and a stall occurring while the SOC OFIFO is full.

From the instruction pipeline view, when the read request (first transaction) enters to the SOC interface OFIFO, the instruction is complete. The cycle count indicates no stalls related to delays for the second transaction in the split, although the destination register hasn’t been updated yet. If a dependency is generated on the destination register (waiting for the update from the second transaction), the cycle count does indicate the real latency of the data being written to the destination register.

Another instruction pipeline related issue occurs when the SOC OFIFO becomes full. The SOC interface can submit a series of transactions from the processor core to the SOC OFIFO, loading the FIFO. While the SOC OFIFO is full, further instructions targeting the SOC OFIFO are stalled, stalling the processor core.

SOC IFIFO/OBUF Transactions

The SOC IFIFO processes two types of transactions that target the processor core:

- Processor core read transactions – The SOC IFIFO processes the return of read data, resulting from a processor core read transaction executed by the SOC OFIFO.

- SOC bus master read or write transactions – The SOC IFIFO and OBUF process transactions executed on the SOC bus that target internal memory or target processor core ureg registers.
SOC Interface Programming

When a processor core read (returning read data) or an SOC master write transaction is processed by the SOC IFIFO, the SOC IFIFO requests an internal bus and executes the transaction. These transactions use the internal S-bus when the transaction targets internal memory and use the internal K-bus when the transaction targets a processor core register.

For SOC master read transactions, the SOC IFIFO requests an internal bus and executes the transaction, but the read data returns through the SOC OBUF. These read transactions use the internal S-bus, unless the transaction targets a register (in that case, the K-bus is used).

SOC Interface Programming

The SOC interface lets programmers develop I/O access routines without considering the number of cycles required for any transaction or worrying that a transaction can be indefinitely held off in the interface. The arbitration schemes that prioritize processor core bus, SOC bus, external port, and DMA transactions provide for servicing all combinations of access requests and interface loading.

The programming issues that developers face in working with the SOC interface relate to understanding the issues that affect SOC interface performance and identifying techniques that work well with the SOC interface architecture.

Because processor core accesses through the SOC interface require arbitration and synchronization across multiple clock domains and (depending on the access) may be held off by external devices, programming techniques that cause the processor core to wait for the result of an external access are poor SOC interface programming practices.
SOC Interface

Some SOC interface programming practices to avoid include:

- *Do not* fetch instruction from external locations, *instead* fetch instructions from internal memory.
- *Do not* use data from external locations directly in instructions, *instead* use DMA to load the data into internal memory (then use the data).
- *Do not* target the same internal memory block from the processor core and from the SOC interface (for instance, a DMA) at the same time if possible, *instead* use the DMA complete interrupt to identify when the data is ready (then use the data).
- *Do not* stall the processor core through over filling the SOC OFIFO with more than eight processor core read or writes to the SOC interface at a time during time critical code, *instead* load sets of SOC Ureg registers during startup code and do other SOC Ureg access in small groups (less than eight) keeping these accesses outside of time critical code.

Some SOC interface programming practices to apply include:

- *Do* use DMA to move as much data as possible through the SOC interface. *Also*, use the DMA complete interrupt to identify when data is ready.
- *Do* target different memory blocks with the SOC interface and processor core as much as possible. This technique takes best advantage of the SOC interface architecture and memory interface architecture.
SOC Interface Programming

- Do account for the time it may take for transactions to cross clock domains, FIFOs, and external hold offs when estimating system performance for handling external events such as external interrupts, flags, DMA requests, host accesses, and others.

- Do prioritize transactions according to their relative importance or their speed critical nature (for example, use the DMA high priority level as needed).

It is good to think about these SOC interface programming issues in terms of an I/O programming operation. Suppose a programmer who is learning how to develop a TigerSHARC system wants to experiment with SOC interface programming by writing an interrupt service routine for an external interrupt ($IRQx$) that sets a flag output. This project gets external input and provides external output.

The programmer could write the interrupt service routine and use an emulator to experiment with it on an ADSP-TS201 EZ-Kit. The most important experiment that the programmer can run is to measure the response time for the interrupt service routine execution while the system is only executing the routine. To make a reasonable assessment of the response time, the programmer should:

- Task the system with processor core operations and I/O peripheral operations that are similar to those expected on the production system.

- Identify the worst and best case scenarios for SOC interface contention.

- Run a series of experiments to identify the range of system response times and see how they vary with the factors that influence SOC interface performance.

Note that the ADSP-TS201 EZ-Kit comes with a set of example programs that may be modified to perform these types of I/O peripheral performance experiments.
4 TIMERS

The TigerSHARC processor has two general-purpose 64-bit timers—
timer 0 and timer 1. The timers are free-running counters that are loaded
with an initial value, decrement the value to zero (expiring), give an indi-
cation when expiring, automatically reload the value, and continue
running. The expiration indicator is normally an interrupt, but also can be
an external output pin for timer 0.

Each timer contains two long (64-bit) registers. One register is the initial
value (TMRINxH/L) and the other is the running value (TIMERxH/L). The
TMRINxH/L register is read/write and the TIMERxH/L is read-only. While the
timer is running, the value in TIMERxH/L is updated at some point during
every system-on-chip bus clock (SOCCLK) cycle.

As shown in Figure 4-1, the timer is a bus slave on the SOC bus. Because
a read of the TIMERxH/L register by the processor core must cross clock
domains, the point in time of the read (and therefore the value) may differ
from one read to another. Consequently, results based on reading the
TIMERxH/L register may vary. For more information on clock domains, see
“SOC Interface” on page 3-1.

Note that the timer registers (TMRINxH/L and TIMERxH/L) can be accessed
only by a single-word access. Each access is to the high or low half of the
long registers; the 32-bit registers are TMRINOH, TMRINOL, TMRINIH, TMRINIL,
TIMEROH, TIMEROH, TIMERL, and TIMERIL.

The timer run (TMRxRN) bits in the interrupt control (INTCTL) register start
(enable=1) or stop (disable=0) the timer count for each timer.
Timer Operations

The timers are initialized to idle status after reset – the $TMRxRN$ bits are cleared in the $INTCTL$ register. To start and run a timer, the application sets the corresponding $TMRxRN$ bit. When this bit is set, the value in
Timers

TMRINxH/L is copied to the TIMERxH/L register, and the timer starts counting down, one count every internal SOC interface clock (SOCCLK = CCLK/2) cycle.

Programs can stop the timer by clearing the TMRxRN bit and resume the timer count from that point by setting the TMRxRN bit. Note that this means that the TIMERxH/L register load is not affected by the operation of the TMRxRN bits. Only a write to a TMRINxH/L register loads the corresponding TIMERxH/L register with a new initial value.

Whenever the timer count reaches zero, the timer issues the two timer expire interrupts (high and low priority), reinitializes the counter to the initial value, and starts running again. If the timer is active (TMRxRN bit is set), writing a value to the TMRINxH/L register has no effect. Writing a different value to TMRINxH/L while the timer is operating changes the initial value after the next expiration of the timer (when TMRINxH/L value is copied again to the timer).

When an interrupt bit for a timer (INT_TIMER0LP, INT_TIMER1LP, INT_TIMER0HP, or INT_TIMER1HP) in the ILATx register is set or cleared by writing to the ILATSTx or ILATCLx registers, both the low and high priority interrupts for that timer are set or cleared.
Timer Programming

Use the following procedure to control the timers:

1. (if the timer is currently running) Disable the timer by clearing its TMRxRN bit in the INTCTL register.

2. (if an interrupt is to be output) Setup the timer high and/or low priority interrupt.
   a. Assign the interrupt vector address(es) for the routine(s) by loading the interrupt vector register(s) (IVTIMERxH/LP).
   b. Unmask (enable) the timer interrupt(s) by setting (=1) the INT_TIMERxH/LP bit(s) in the IMASK register.
   c. Unmask (enable) global interrupts by setting the SQCTL_GIE bit in the SQCTL register.
      If the only output is the TMR0E pin, programs do not need to set up the timer interrupt.

3. Load the initial value in the TMRINxH/L registers.

4. Run (enable) the timer by setting its timer run TMRxRN bit in the INTCTL register.

Timer Programming

The example in Listing 4-1 provides a programming example for timer setup and usage. This assembly example sets up the timers to produce periodic interrupts that drive flag outputs. It is useful to observe the operation of this code on an EZ-Kit Lite demonstration system, noting the operation of timers, interrupts, and flags.
Listing 4-1. Periodic Timer Interrupt Generating Flag Output

/* This TigerSHARC assembly program sets up a periodic timer interrupt to generate a periodic clock signal using a flag output. On the ADSP-TS201 EZ-Kit, this program lights up LEDs for each flag output, indicating the clock phase. */

#include <defts201.h>
define PHASE_0 0
define PHASE_1 1
define PHASE_2 2
define PHASE_3 3
define LENGTH 4
#define QTR_PHASE_CLK 0x20

.section data1;
.var clock_phase[LENGTH] = PHASE_0, PHASE_1, PHASE_2, PHASE_3;
.var current_clock_phase = PHASE_0;

.section program;
_main:
.align_code 4;
xr0 = FLAGREG_FLAG0_EN | FLAGREG_FLAG1_EN |
FLAGREG_FLAG2_EN | FLAGREG_FLAG3_EN ;;
/* enable all flag pins as outputs */
FLAGREG = xr0;
xr0 = FLAGREG_FLAG1_OUT ;;
/* set flag1 high to start... */
FLAGREGST = xr0 ;;
j0 = j31 + clock_phase ;;
/* setup circular buffers to read in clock phase */
j10 = LENGTH ;;
jb0 = j31 + clock_phase ;;
j10 = j31 + _Timer_0_HPISR ;;
/* assign high-priority timer interrupt vector address */
IVTIMER0HP = j10;;
xr0 = 0;;
IMASKL = xr0;;
xr0 = 0x00100000;;
IMASKH = xr0;;
/* enable timer 0 high-priority interrupt */
SOCTLST = SOCTL_GIE;;
/* enable global interrupts */
xr0 = 0;;
TMRINOH = xr0;;
/* assign a value to the upper 32 bits of the 64-bit timer
countdown-register */
xr0 = QTR_PHASE_CLK;;
TMRINOL = xr0;;
/* assign a value to the lower 32 bits of the 64-bit timer
countdown-register */
xr0 = 0x10;;
/* start the timer now */
INTCTL = xr0;;

_Up_And_Down:
.align_code 4:
    idle; nop; nop; nop;;
    jump _Up_And_Down; nop; nop;;

_Timer_0_HPISR:
.align_code 4:
    xr1 = 1;;
    xr2 = 2;;
    xr3 = 3;;
    xr0 = cb [j0 += 1];;
    xr0 = pass r0;;
Timers

.align_code 4:
    if xaeq, jump _Set_Clock_High_Phase_0: nop; nop; nop;
    /* if at clock phase 0, set the clock high */

xr4 = r0 - r1; nop; nop;
if xaeq, jump _Set_Clock_High_Phase_1: nop; nop; nop;
    /* if at clock phase 1, set the clock high */

xr4 = r0 - r2; nop; nop;
if xaeq, jump _Set_Clock_Low_Phase_2; nop; nop; nop;
    /* if at clock phase 2, set the clock low */

xr4 = r0 - r3; nop; nop;
if xaeq, jump _Set_Clock_Low_Phase_3; nop; nop; nop;
    /* if at clock phase 3, set the clock low */

_Set_Clock_High_Phase_0:
.align_code 4;
    xr0 = FLAGREG_FLAG1_OUT; nop; nop; nop;
    FLAGREGST = xr0; nop; nop; nop;
    rti (abs); nop; nop; nop;

_Set_Clock_High_Phase_1:
.align_code 4;
    xr0 = FLAGREG_FLAG1_OUT; nop; nop; nop;
    FLAGREGST = xr0; nop; nop; nop;
    rti (abs); nop; nop; nop;

_Set_Clock_Low_Phase_2:
.align_code 4;
    xr0 = ~(FLAGREG_FLAG1_OUT); nop; nop; nop;
    FLAGREGCL = xr0; nop; nop; nop;
    rti (abs); nop; nop; nop;

 туристские
Timer Expired (TMR0E) Pin and Signal Timing

```assembly
_Set_Clock_Low_Phase_3:
.align_code 4:
  xr0 = ~(FLAGREG_FLAG1_OUT); nop; nop; nop::
  FLAGREGCL = xr0; nop; nop; nop::
  rti (abs); nop; nop; nop::

_main.end:
```

Timer Expired (TMR0E) Pin and Signal Timing

The timer 0 expired (TMR0E) pin indicates that timer 0 has expired. When this timer expires, the processor issues a high pulse on the TMR0E pin for four SCLK cycles. It is important to note (as shown in Figure 4-1 on page 4-2) that the timer is a bus slave on the SOC bus. Because processor core accesses to control the timer must cross clock domains, the response of the timer may be affected by SOC interface loading. For more information on clock domains, see “SOC Interface” on page 3-1. For more information on TMR0E pin timing, see the ADSP-TS201 TigerSHARC Embedded Processor Data Sheet.
5 FLAGS

The TigerSHARC processor has four general-purpose input or output flag (FLAG3-0) pins. Each pin may be configured as an input or output. As outputs, programs may use the flag pins to signal events or conditions to any external device such as a host processor. As inputs, programs may read the flag pin input state in the FLAGREG register or use the pin input state as a condition in conditional instructions. After powerup reset, the FLAG3-0 pins are inputs with internal pull-up resistors that hold them at logic high value.

Flag Operations

To setup and use input and output flag pins, use the following procedure:

1. Use the FLAGx_EN bits in the FLAGREG register to configure each of the FLAG3-0 pins as an input (FLAGx_EN=0) or output (FLAGx_EN=1).

2. (for output flag pins) Use the FLAGx_OUT bits in the FLAGREG register to select the output value for each of the FLAG3-0 pins.

3. (for input flag pins) Read the FLGx bits in the SQSTAT register to view the input value for each of the FLAG3-0 pins, or use the input flag conditions (FLAGx_IN) in conditional instructions.

As shown in Figure 5-1, the flag unit is a bus slave on the SOC bus. It is important to note that input and output flag operations must cross a number of clock domains when changing output values or responding to
Flag Operations

input. To guarantee that flag I/O is not lost, use a handshake protocol for flag I/O in the system. For more information on clock domains, see “SOC Interface” on page 3-1.

Figure 5-1. Flags and SOC Bus Connections
Flag Programming

The examples in Listing 5-1, Listing 5-2, Listing 5-3, and Listing 5-4 provide a programming example for flag setup and usage. It is useful to observe the operation of this code on an EZ-Kit Lite demonstration system, noting the operation of timers, interrupts, and flags.

Listing 5-1. Loop Generating Flag Output
(EZ-Kit Blink Example in Assembly)

/* This TigerSHARC assembly program toggles flag pin outputs to blink connected LEDs on the ADSP-TS201 EZ-Kit. */

/* Macro definitions and include statements */
#include <defTS201.h>

/* Program Code for internal memory block 0 */
.section program;

/* Flag setup */

flagregst = FLAGREG_FLAG2_EN | FLAGREG_FLAG3_EN |
FLAGREG_FLAG2_OUT | FLAGREG_FLAG3_OUT ;;
/* FLAG2 and FLAG3 output enable */

flagregcl = ~(FLAGREG_FLAG2_OUT):;

_done:

lc0 = 0x989680;; /* this count sets a quick toggle speed */
/* lc0 = 0x5F5E10;; */ /* count for very quick toggle speed */
/* lc0 = 0x2FAF080;; */ /* count for slow toggle speed */
.align_code 4;
Flag Programming

_loop1:
    if nlc0e, jump _loop1 (NP); nop;nop;nop;;

xR0 = FLAGREG;;
    /* flag toggle to blink EZ-Kit LEDs */
xR1= FLAGREG_FLAG2_OUT | FLAGREG_FLAG3_OUT;;
    /* toggle flags */
xR0 = RO XOR R1;;
    FLAGREG = xR0;;
.align_code 4:
    jump _done:nop;nop;nop;;
Listing 5-2. Flag Input Generating Flag Output
(EZ-Kit Push Buttons to LEDs Example)

/* This TigerSHARC assembly program uses flag input (pushbuttons) to toggle flag pin outputs to blink connected LEDs on the ADSP-TS201 EZ-Kit. */

// Macro Definitions & Includes
#include <defTS201.h>

.section program:

_start:
xR0 = 0x00000000;;
FLAGREG = xR0;;
    /* clear all flag settings */

flagregst = FLAGREG_FLAG2_EN | FLAGREG_FLAG3_EN;;
    /* enable FLAG2, FLAG3 as outputs */

j0 = _IRQ0_ISR;;
    /* set IVIRQ0 interrupt vector */
IVIRQ0 = j0;;
xR0 = INT_IRQ0;;
    /* unmask IRQ0 interrupt */
IMASKH = xR0;;
    /* Write contents back to IMASKH register */
xR0 = SQCTL;;
    /* Read contents of SQCTL register */
xr0 = bset r0 by SQCTL_GIE_P;;
    /* Unmask and global interrupts enable */
Flag Programming

SOCTL = xR0;;
    /* Write contents back to IMASKH register */

_testflag0:
    xR0 = SQSTAT;;
    /* check FLAG0 status */
bittest r0 by 16;;

    if xSEQ, jump _turnoff_flag2 (NP);;

_turnon_flag2:
    flagregst = FLAGREG_FLAG2_OUT;;
    /* turn on LED/FLAG2 */
jump _testflag1 (NP);;

_turnoff_flag2:
    flagregcl = ~(FLAGREG_FLAG2_OUT);;
    /* turn off LED/FLAG2 */

_testflag1:
    xR0 = SQSTAT;;
    /* check FLAG1 status */
bittest r0 by 17;;

    if xSEQ, jump _turnoff_flag3 (NP);;

_turnon_flag3:
    flagregst = FLAGREG_FLAG3_OUT;;
    /* turn on LED/FLAG3 */
jump _delayloop (NP);;

_turnoff_flag3:
    flagregcl = ~(FLAGREG_FLAG3_OUT);;
    /* turn off LED/FLAG3 */
Flags

_delayloop:
    /* delay loop */
    LC0 = 0x400000;
.align_code 4;
wait0:
    if NLC0E, jump wait0 (NP);:nop::nop::nop::

jump _testflag0 (NP);

_IRQ0_ISR:
    /* IRQ0 ISR */
    /* An RTI cannot be the first instruction of an ISR */
    xr0 = FLAGREG;; /* check FLAG2/3 status */
    xr0 = btgl r0 by FLAGREG_FLAG2_OUT_P; /* toggle FLAG2 */
    xr0 = btgl r0 by FLAGREG_FLAG3_OUT_P; /* toggle FLAG3 */
    FLAGREG = xr0;; /* write it back */
    LC1 = 0x400000;; /* delay loop */
.align_code 4;
wait1:
    if NLC1E, jump wait1 (NP):nop::nop::nop::
    xr0 = FLAGREG;; /* check FLAG2/3 status */
    xr0 = btgl r0 by FLAGREG_FLAG2_OUT_P; /* toggle FLAG2 */
    xr0 = btgl r0 by FLAGREG_FLAG3_OUT_P; /* toggle FLAG3 */
    FLAGREG = xr0;; /* write it back */
    LC1 = 0x400000;; /* delay loop */
.align_code 4;
wait2:
    if NLC1E, jump wait2 (NP):nop::nop::nop::
    rt1 (NP) (ABS);:
Listing 5-3. Loop Generating Flag Output (EZ-Kit Flag Test Example)

/* This C program toggles flag pin outputs to blink connected
LEDs on the ADSP-TS201 EZ-Kit. */
#include "flag.h"
#include <sysreg.h>
#include <builtins.h>

volatile int i, j;

void main()
{
    InitFlagAsOutput(2); /* Assign FLAG2 as an output */
    InitFlagAsOutput(3); /* Assign FLAG3 as an output */
    FlagHigh(2); /* Set FLAG2 high */
    FlagLow(3); /* Set FLAG3 low */
    while(1){ /* Setup infinite loop here... */
        for(i=0; i<8000000; i++);
        /* Add an arbitrary delay here */
        FlagLow(2); /* Set FLAG2 low */
        FlagHigh(3); /* Set FLAG3 high */
        for(j=0; j<8000000; j++);
        /* Add an arbitrary delay here */
        FlagHigh(2); /* Set FLAG2 high */
        FlagLow(3); /* Set FLAG3 low */
    } /* End infinite WHILE(1) loop... */
} /* End of main()... */
Listing 5-4. Loop Generating Flag Output
(EZ-Kit Blink Example in C)

/* This C program sets up a periodic timer interrupt to generate flag output. On the ADSP-TS201 EZ-Kit, this program lights up LEDs for each flag output. */

#include <defts201.h>
#include <sysreg.h>
#include <signal.h>

#define SET_INTCTL(x) asm("INTCTL=%0::":"y"(x))
#define SET_FLAG(x) asm("FLAGREGST=%0::":"y"(x))
#define CLR_FLAG(x) asm("FLAGREGCL=%0::":"y"(x))

#define TIMER1H_PERIOD 25000000/2

volatile int irq0_count = 0;
volatile int irq1_count = 0;
volatile int blink_speed = 0;
volatile int timer_count = 0;

void irq1_isr(int signal){
    irq1_count++;
    // while(1);
    if (irq1_count&1)
        SET_FLAG(FLAGREG_FLAG1_OUT);
    else
        CLR_FLAG(~FLAGREG_FLAG1_OUT);
} /* end irq1_isr */

void irq0_isr(int signal){
    irq0_count++;
    blink_speed--;
}
Flag (FLAG3-0) Pin and Signal Timing

```c
if (blink_speed<0)
    blink_speed = 2;
} /* end irq0_isr */

void timer1h_isr(int signal){
    timer_count++;
    if (((timer_count)>>blink_speed)&1)
        SET_FLAG(FLAGREG_FLAG0_OUT);
    else
        CLR_FLAG(~FLAGREG_FLAG0_OUT);
} /* end timer1h_isr */

void main(void){
    __builtin_sysreg_write(__IMASKH, 0);
    __builtin_sysreg_write(__IMASKL, 0);
    __builtin_sysreg_write(__TMRIN1H, 0);
    __builtin_sysreg_write(__TMRIN1L, TIMER1H_PERIOD);
    SET_FLAG(FLAGREG_FLAG0_EN|FLAGREG_FLAG1_EN);
    interrupt(SIGIRQ0, irq0_isr);
    interrupt(SIGIRQ1, irq1_isr);
    interrupt(SIGTIMER1HP, timer1h_isr);
    SET_INTCTL(INTCTL_TMR1RN);
    while(1);
} /* end main */
```

Flag (FLAG3-0) Pin and Signal Timing

The flag input and/or output (FLAG3-0) pins let the processor signal external devices or receive input from them. It is important to note (as shown in Figure 5-1 on page 5-2) that the flag unit is a bus slave on the SOC bus. Because processor core accesses to control the flag unit must cross clock domains, the response of the flags may be affected by SOC interface load-
Flags

For more information on clock domains, see “SOC Interface” on page 3-1. For more information on FLAG3–0 pin timing, see the ADSP-TS201 TigerSHARC Embedded Processor Data Sheet.
Flag (FLAG3-0) Pin and Signal Timing
6 INTERRUPTS

The TigerSHARC processor supports a variety of interrupts—external or internal event indicators. Interrupts can serve any of the following purposes in a system.

- Synchronization between core and non-core operations
- Error detection
- Debug feature operations
- Controls introduced by an application

Interrupts on the TigerSHARC processor are classified as either software interrupts or hardware interrupts.

Most interrupts in the TigerSHARC processor are dedicated. Four interrupt pins and two interrupt registers support general-purpose interrupts. All interrupt sources other than the interrupt pins and interrupt register are caused by events or dedicated hardware. For each interrupt there is a vector register in the Interrupt Vector Table (IVT) (in register groups 0x38 and 0x39) and a bit number in the interrupt latch, mask, and priority mask registers.

For performance considerations, when a hardware interrupt occurs, it does not break the pipeline. The first instruction of the interrupt routine is inserted into the instruction flow after it has occurred. From this point, the instructions that are already in the pipeline complete their execution.
Software interrupts are caused by an instruction that causes an internal event. Other instructions left in the instruction pipeline are not completed (flushed) after the instruction that has caused the software interrupt.

Hardware interrupts are caused by input to the external interrupt (IRQ3–0) pins and timer interrupts. Instructions left in the instruction pipeline are not completed after the cycle that the external interrupt is recognized. Hardware error interrupts are caused by:

- DMA error
- Inactive AutoDMA access
- Self multiprocessor space read
- Link port error
- Broadcast read from external memory

For more information on the instruction pipeline and interaction with interrupts, see Chapter 8, “Program Sequencer” in the ADSP-TS201 TigerSHARC Processor Programming Reference.

Interrupts are either level-sensitive or edge-sensitive—the hardware interrupts (IRQ3–0) can be configured as either level- or edge-sensitive according to the programming in the INTCTL register. The differences between the two types of interrupts are:

- Edge-sensitive interrupts are latched when they occur and remain latched until serviced or reset by an instruction. If it is not cleared during servicing, no new event is identified and the continued request is ignored.

- Level-sensitive interrupt requests must be sustained until serviced, otherwise they are not seen. If the request continues after being serviced, it is considered a new interrupt.
A level-sensitive interrupt is normally driven as a result of some accessible register state and cleared either by reading the register or by writing an inactive value into it.

An edge-sensitive interrupt is event triggered (for example, by expiration of a timer).

The edge- or level-sensitivity for each interrupt is listed in “Interrupt Vector Table” on page 6-17.

Each of the interrupts implemented in the TigerSHARC processor has an interrupt vector register, which is the address of the interrupt routine that services the appropriate interrupt. The whole register file (31 entries with another 32 reserved entries) is referred to as the Interrupt Vector Table (IVT). The registers are 32 bits each, in order to enable internal and external memory interrupt routines.

For boot procedures, some of the interrupt vector registers are initialized at reset to specific addresses. In boot by EPROM and boot by link operations, a DMA is initialized to load the boot data into memory address 0x0. When the boot DMA is executed, it issues a DMA interrupt. The DMA interrupt vector is also initialized to address 0x0.

To issue an interrupt, an interrupt request pin, IRQ3-0, is asserted. Each of the IRQ3-0 pins may be individually set to either edge-triggered or level-sensitive. See “Interrupt Control (INTCTL) Register” on page 2-45.

Software interrupts that are caused by a specific instruction occur immediately after the instruction that caused them. Therefore the pipeline is flushed after a software interrupt, and the instructions that were in the pipeline are not executed.

As shown in Figure 6-1, the interrupt unit is a bus slave on the SOC bus. It is important to note that interrupt register accesses must cross a number of clock domains when reading or writing register data. Corresponding response for software interrupts may vary with SOC bus loading. Because the interrupt vector input from the interrupt unit has a direct connection
to the program sequencer, response to latched interrupts is not affected by SOC bus loading. For more information on clock domains, see “SOC Interface” on page 3-1.

Figure 6-1. Interrupts and SOC Bus Connections
Interrupt Operations

Interrupt operations on the processor include interrupt setup, interrupt (or exception) handling, and returning from the interrupt. The interrupt controller includes the following interrupt setup and control registers:

- Interrupt control (INTCTL) register
- Interrupt mask (IMASK) registers – IMASKH and IMASKL
- Priority mask (PMASK) registers – PMASKH and PMASKL
- Interrupt latch (ILAT) registers – ILATH and ILATL

The IMASK, PMASK, and ILAT registers have identical bit definitions; one bit for each interrupt source. The definitions for these registers are available in Figure 2-14 on page 2-48, Figure 2-15 on page 2-48, Figure 2-16 on page 2-49, Figure 2-17 on page 2-49, and Figure 2-13 on page 2-45. Additional programming information on these registers is available in Chapter 8, “Program Sequencer” in the ADSP-TS201 TigerSHARC Processor Programming Reference.

INTCTL Register

The INTCTL register is a 32-bit register that controls interrupt pin sensitivity (level- or edge-sensitive interrupts). This register also provides run/stop control for the timers.

IMASK Register

The IMASK register is a single 64-bit register accessed as two 32-bit registers, IMASKL and IMASKH. Mask bits are individually dedicated to different interrupts. When an interrupt bit is set in the ILAT register, the corresponding interrupt is only accepted if the corresponding bit is also set in the IMASK register. Consequently, the order of the interrupt bits is identi-
Interrupt Operations

cal to those found in the ILAT register. Bit2 in the SQCTL register is a global interrupt enable (GIE) bit. When cleared, no interrupts, except for exception and emulation, are enabled.

PMASK Register

The PMASK register is a single 64-bit register that is accessed as two 32-bit registers, PMASKL and PMASKH. This register is identical to the IMASK register in bit assignment.

These registers help the hardware track nested interrupts by masking all interrupts with a lower or equal priority than the interrupt that is currently being executed. The PMASK is a 64-bit register, of which 32 bits are implemented and 32 are reserved. When the TigerSHARC processor begins to service an interrupt, its bit in PMASK is set. Each set bit indicates this ISR is active or nested at some level. The most significant set bit in PMASK indicates which interrupt is currently served. The current interrupt bit is cleared once the service is completed, either by RTI instruction or by RDS instruction (see “if cond, RTI (ABS)” and “if cond, RDS” in the ADSP-TS201 TigerSHARC Processor Programming Reference).

When the PMASK is nonzero, all the interrupts with a lower or equal priority to the most significant set bit in PMASK are disabled by it. For logic equations, the PMASK_R is defined as the mask created by PMASK; that is, all the bits above the most significant set bit are set and other bits are clear. A PMASK_R is only a definition and not a register.

The RETI register is updated only in the Execute2 pipe stage of the first instruction of the interrupt service routine. The first instruction cannot be an instruction that uses its value, such as RTI or RETI.
Interrupts

During normal operations the PMASK is zero, and PMASK_R is 0xFF…F.

1. Assume that a link #0 interrupt occurred. PMASK = 0x00…040, and PMASK_R = 0xFF…F80. The interrupts of lower priority than link #0—namely, timer low priority interrupts and link #0 interrupt—are disabled by PMASK.

2. Following this, assume a DMA #2 interrupt occurred. PMASK = 0x00…08040, and PMASK_R is 0xFF…F0000. The timer low priority, link, and DMA #0, #1 and #2 interrupts are disabled by PMASK. Returning from interrupt (which is DMA #2 interrupt) clears the most significant set bit in PMASK which is bit #15. The PMASK register is now 0x00…040, and PMASK_R is 0xFF…F80. The timer low priority interrupts and link #0 interrupt are disabled by PMASK, but the other link interrupts and DMA interrupts are enabled again.

ILAT Register

The ILAT register is a single 64-bit register accessed as two 32-bit registers, ILATL and ILATH. Each bit is dedicated to an interrupt—when an interrupt occurs and is latched, the corresponding bit is set. The order of interrupt bits is the interrupt priority—Bit0 is the lowest priority. See “Interrupt Mask (IMASKL/IMASKH) Registers” on page 2-47 for bit assignments.

An application may emulate an interrupt by writing to the ILAT register set address (ILATSTL or ILATSTH). Writing to these addresses updates the ILAT register to the OR of the old value and the new written value. As a result, every set bit in the written data sets the corresponding bit in the ILAT register. Setting an interrupt bit causes the TigerSHARC processor to assume the corresponding interrupt has occurred.

Interrupt bits can also be cleared by writing to the clear addresses (ILATCLL or ILATCLH). Such a write ANDs the data written with the old data of the ILAT register. In this case, a zero bit in the input data clears the corresponding bit in ILAT, while a set bit keeps it unchanged. This way, an
Interrupt Operations

An application can clear a pending interrupt before it is executed. Such an operation is very sensitive, and it should be executed under these conditions:

- Non-implemented (reserved) bits may not be set. When writing to either set or clear address, the reserved bits must be zero.

- Exception and emulation exception bits may not be set through ILATST. In order to cause an exception, use a TRAP instruction; in order to cause an emulation exception, use an EMUTRAP instruction.

- Interrupts that are level-triggered should not be changed—link interrupts, hardware error interrupts, or IRQ bits if programmed to be level-triggered.

- Writes to ILATL or ILATH should not be attempted.

- An interrupt should be cleared while it is masked, otherwise it may be served before it is cleared.

- Like all other sequencer registers, the set and clear addresses are single-word writes only.

Interrupt Service Routines

When an interrupt occurs, the corresponding bit in the ILAT register is set. The ILAT bits are ANDed with the enable bits in the IMASK and with PMASK_R. If the global interrupt enable (GIE) bit in the SQCTL register is set and the result of the ILAT and IMASK and PMASK_R is not zero, the highest priority interrupt that is enabled is served.
Interrupts

All the level-sensitive interrupts (IRQ3–0 pin interrupts, link port interrupt, and hardware error interrupt) may be deasserted only in one of the following states:

- During the interrupt service routine of this specific interrupt
- While the specific interrupt is disabled
- While all hardware interrupts are disabled

The following algorithm describes the interrupt service routine (ISR).

1. The interrupt vector in Interrupt Vector Table (IVT) is used as the next fetch address and the ISR’s first instruction is pushed into the pipeline.

   If the interrupt is a hardware interrupt, normal execution continues until the first instruction of the interrupt routine reaches pipeline stage EX2. Otherwise, for software exceptions that are not hardware interrupts, all the normal flow instructions that are in the pipeline are aborted.

   While an interrupt is in the pipeline, all the hardware interrupts are disabled, although this is not reflected in the mask registers (PMASK or IMASK).

   If the exception is disabled, the ILAT bit for the exception is not set and the exception is ignored. The reason is that the instruction line that caused the exception is lost.

   In an ISR, the first instruction must not be RDS or RTI.

2. Instructions in the pipeline are executed.

3. Before the first instruction in the interrupt routine reaches pipeline stage EX2, the global interrupt enable (GIE) is rechecked. If the bit was cleared since the interrupt was generated by an instruction
Interrupt Operations

already in the pipeline, then all the instructions in the pipeline are aborted and the TigerSHARC processor begins fetching from the normal flow (as if there were no interrupt).

If instructions are ordered in a specific way, it is possible to enter an ISR immediately after its bit is cleared in the IMASK register. While in the ISR, IMASK bit = 0, which does not affect operation in any way.

4. The return PC (which points to the instruction that would have been executed had the interrupt not occurred) is saved in the appropriate register—RETI for a hardware interrupt, RETS for a software interrupt, or DBGE for an emulation debug.

5. The TigerSHARC processor mode is changed to emulation mode if there was an emulation exception; otherwise it changes to supervisor mode.

6. When the first instruction in the interrupt routine reaches pipeline stage EX2 and if the interrupt is edge-triggered, the corresponding interrupt bit in the ILAT register is reset, and the same bit in the PMASK register is set. The interrupt bit in PMASK is set in any case, not just when the interrupt is level-sensitive.

A software interrupt occurs only when the instruction that caused it reaches pipeline stage EX2. When a software interrupt that was enabled occurs, all the instructions in the pipeline after the instruction that caused the exception are aborted, including hardware interrupt routine instructions. If a software interrupt occurs during the time that the hardware interrupt is in the pipeline, the hardware interrupt is aborted with other instructions in the pipeline and the software interrupt procedure begins. The hardware interrupt bit is not reset, however, in order to save the (hardware) interrupt.
Interrupts

The hardware ISR steps, illustrated and numbered in Figure 6-2, are as follows.

1. The processor determines the state after hardware interrupt ($GIE \ AND \ IMASKN \ AND \ PMASK\_RN$).

2. The processor (after determining that masks are set):
   a. Inserts the ISR fetch address
   b. Stores the PC in $RETI$ when the instruction reaches pipeline stage EX2

   Storing PC in $RETI$ happens only after the first instruction of the interrupt routine reaches EX2 with $ILAT$ clear and $PMASK$ bits set, if step 4 (below) conditions are met.
   c. Executes instructions in pipeline, unless it encounters an exception or emulation condition

   Note that exceptions and emulation conditions are not affected by $GIE$ set.

3. The processor checks whether GIE is set when the first instruction in the ISR reaches pipeline stage EX2.
   a. If NO, the processor flushes the pipeline, aborts the interrupt, and waits for next instruction.
   b. If YES, the processor clears $ILAT\_N$ if interrupt is edge triggered

   Note that exceptions and emulation conditions are not affected by $GIE$ set.
Interrupt Operations

Figure 6-2. Hardware Interrupts
4. If nesting, for hardware interrupt, the ISR should start by storing the processor context and store the contents of \texttt{RETIB}.

Note that instructions for storing the processor context and \texttt{RETIB} must be specified in the ISR.

5. The ISR executes, servicing the interrupt.

6. If the nesting was enabled, the ISR should end by restoring the \texttt{RETIB} contents, restoring the processor context and executing an \texttt{RTI} instruction.

Note that the instructions for restoring \texttt{RETIB} and the processor context must be specified in the ISR.

7. After the ISR completes, the processor clears \texttt{PMASKN} and jumps to \texttt{RETI} address.

Handling Interrupts

There are two methods of interrupt handling: one that uses nesting and one that does not. If the interrupt handling does not use nesting, other interrupts are disabled throughout the ISR or handling routine. If the interrupt handling routine does enable nesting, some special programming is required. The special programming required for nested interrupt handling routines is described in this section.

The first instructions in the interrupt routine must be dedicated to preserving the current status of the machine. All the registers used by the interrupt routine must be pushed onto a stack. The TigerSHARC processor does not require a dedicated stack pointer since any one of the IALU registers can be used as a stack pointer. The application can store and load registers with the post-modify option, and it can work with two stacks in two memory blocks and store eight words every cycle.
Interrupt Operations

While preserving the current machine status in a stack, the appropriate global interrupt enable bit (hardware or software) is still clear, so there is no danger of corrupting the return address PC (RETI). If, however, the interrupt was a hardware interrupt, software or emulation exceptions can still be processed, since the return address is saved in a different register (RETS or DBGE). This also applies to emulation exceptions when a software interrupt is executing its initial cycles.

Once the relevant registers are saved in memory, the return register should be saved. This operation is performed automatically by saving the return address alias RETIB. The RETIB register is an aliased name for RETI. If you use RETIB, you access RETI. Note that interrupt nesting is enabled by saving off RETIB to memory or to another register.

If the system does not need to support nested hardware interrupts, there is no need to preserve the machine state in a stack. The return can be executed from the RETI register, and the hardware interrupt’s global disable bit should be left as is. However, software interrupts or debug interrupts may be nested. In these cases, the machine state should be retained before software/debug service is performed.

Returning From an Interrupt

The return from interrupt is performed using the RTI instruction (see “if cond, RTI (ABS)” in the ADSP-TS201 TigerSHARC Processor Programming Reference). The return address should be put in the RETIB register (recommended at about eight cycles beforehand to enable BTB usage). In non-nested hardware interrupt routines, the return address is already in the RETI register. If the non-nested exception is taken, the return address should be copied from the RETS or DBGE register (according to exception type) to the RETI register before the return. The use of the RETIB alias disables all interrupts until the RTI instruction is executed—this is to protect the RETI register from being destroyed by another interrupt occurrence.
Interruptions

The state of the machine must be saved in memory at the beginning of the interrupt routine. The full state should be restored in the original registers before the return. See “Handling Interrupts” on page 6-13.

The RTI instruction clears the highest set bit in PMASK register. The instruction then jumps to the address pointed by the value in the RETI register and re-enables the interrupts.

Interrupt routine restrictions:

- An interrupt routine may not refer to the RETI or RETS registers on the first instruction line.
- An interrupt routine may not use the RTI, RETI, or RDS instructions on the first instruction line.

Handling Exceptions

Exceptions are software interrupts—interrupts that are caused by code being executed. Figure 6-3 illustrates what transpires when a software interrupt is introduced. If the exception is enabled:

- SQCTL[21] is set in case of a regular exception, or SQCTL[22] is set in case of an emulation exception.
- The sequencer starts fetching from the address pointed to by the IVSW register in case of an exception, or from the EMUIR register in case of an emulation exception.
- PC is stored in RETS in case of an exception, or in DBGE register in case of an emulation exception.
- Instructions are flushed from the pipeline.

The return is executed by transferring the return address from RETS or DBGE (or from where it has been stored) to RETI without using RETIB. Then the RTI instruction should be executed.
Interrupt Operations

If an exception occurs when it is not enabled, the exception is cleared and not serviced.

![Diagram Image]

Figure 6-3. Software Exception (Interrupt)

The exceptions are treated differently from other interrupts:

- The exception is tied to the instruction that caused it. The exception process begins when the instruction that caused the exception reaches pipeline stage EX2. When this occurs, all instructions in the pipeline are aborted. If the instruction that caused the excep-
tion was speculative and it was aborted, the exception does not take place. Unlike an abort, a predicated instruction that is \textit{not} executed due to the condition can still cause the exception.

- The return address is saved in a different register to enable nesting between the hardware interrupt and the exception. The registers are \texttt{RETS} for exception or \texttt{DBGE} for emulation trap.

- If the exception is disabled when it is occurring, it is not latched. It is not latched because it is irrelevant to do so without the return address pointing to the cause (\texttt{RETS} or \texttt{DBGE} registers). In these cases the exception is lost.

- In order to protect the old \texttt{RETI} value before returning you must execute the following instruction lines, then jump to \texttt{RETS}:

\[
[j31+\text{temporary address}] = \texttt{RETI};
\texttt{RETI} = \texttt{RETS};
\texttt{RTI}; \texttt{RETI} = [j31+\text{temporary address}];
\]

\section*{Interrupt Vector Table}

The following subsections describe the interrupt types, noting the corresponding interrupt bit in the \texttt{ILAT} and \texttt{IMASK} interrupt registers.

\begin{table}[h]
\centering
\begin{tabular}{|c|c|p{7cm}|c|}
\hline
Priority & Interrupt & Description & Sensitivity \\
\hline
63 (highest) & DBG & Debug exception. See “Emulation Exception Interrupt” on page 6-24. & edge \\
\hline
62 & SW & Software exception. See “Software Exception Interrupts” on page 6-23. & edge \\
\hline
61–58 & Reserved & Reserved & \\
\hline
57 & HWERR & Hardware error. See “Hardware Error Interrupt” on page 6-22. & level \\
\hline
\end{tabular}
\caption{ADSP-TS201 Interrupt Vector Table}
\end{table}
## Interrupt Operations

Table 6-1. ADSP-TS201 Interrupt Vector Table (Cont'd)

<table>
<thead>
<tr>
<th>Priority</th>
<th>Interrupt</th>
<th>Description</th>
<th>Sensitivity</th>
</tr>
</thead>
<tbody>
<tr>
<td>56–54</td>
<td>Reserved</td>
<td>Reserved</td>
<td></td>
</tr>
<tr>
<td>53</td>
<td>TIMER1H</td>
<td>Timer 1 high priority. See “Timer Expired Interrupts” on page 6-19.</td>
<td>edge</td>
</tr>
<tr>
<td>52</td>
<td>TIMER0H</td>
<td>Timer 0 high priority</td>
<td>edge</td>
</tr>
<tr>
<td>51</td>
<td>Reserved</td>
<td>Reserved</td>
<td></td>
</tr>
<tr>
<td>50</td>
<td>BUSLOCK</td>
<td>Buslock. See “Bus Lock Interrupt Register” on page 6-21.</td>
<td>edge</td>
</tr>
<tr>
<td>49</td>
<td>Reserved</td>
<td>Reserved</td>
<td></td>
</tr>
<tr>
<td>48</td>
<td>VIRPT</td>
<td>Vector Interrupt. See “Vector Interrupt” on page 6-21.</td>
<td>edge</td>
</tr>
<tr>
<td>47–45</td>
<td>Reserved</td>
<td>Reserved</td>
<td></td>
</tr>
<tr>
<td>44</td>
<td>IRQ3</td>
<td>IRQ3 pin. See “External (IRQ3–0) Input Interrupts” on page 6-20.</td>
<td>edge or level</td>
</tr>
<tr>
<td>43</td>
<td>IRQ2</td>
<td>IRQ2 pin</td>
<td>edge or level</td>
</tr>
<tr>
<td>42</td>
<td>IRQ1</td>
<td>IRQ1 pin</td>
<td>edge or level</td>
</tr>
<tr>
<td>41</td>
<td>IRQ0</td>
<td>IRQ0 pin</td>
<td>edge or level</td>
</tr>
<tr>
<td>40–39</td>
<td>Reserved</td>
<td>Reserved</td>
<td></td>
</tr>
<tr>
<td>38</td>
<td>DMA13</td>
<td>DMA channel 13 complete. See “DMA Complete Interrupts” on page 6-20.</td>
<td>edge</td>
</tr>
<tr>
<td>37</td>
<td>DMA12</td>
<td>DMA channel 12 complete.</td>
<td>edge</td>
</tr>
<tr>
<td>36–33</td>
<td>Reserved</td>
<td>Reserved</td>
<td></td>
</tr>
<tr>
<td>32</td>
<td>DMA11</td>
<td>DMA channel 11 complete</td>
<td>edge</td>
</tr>
<tr>
<td>31</td>
<td>DMA10</td>
<td>DMA channel 10 complete</td>
<td>edge</td>
</tr>
<tr>
<td>30</td>
<td>DMA9</td>
<td>DMA channel 9 complete</td>
<td>edge</td>
</tr>
<tr>
<td>29</td>
<td>DMA8</td>
<td>DMA channel 8 complete</td>
<td>edge</td>
</tr>
<tr>
<td>28–26</td>
<td>Reserved</td>
<td>Reserved</td>
<td></td>
</tr>
<tr>
<td>25</td>
<td>DMA7</td>
<td>DMA channel 7 complete</td>
<td>edge</td>
</tr>
<tr>
<td>24</td>
<td>DMA6</td>
<td>DMA channel 6 complete</td>
<td>edge</td>
</tr>
<tr>
<td>23</td>
<td>DMA5</td>
<td>DMA channel 5 complete</td>
<td>edge</td>
</tr>
<tr>
<td>22</td>
<td>DMA4</td>
<td>DMA channel 4 complete</td>
<td>edge</td>
</tr>
</tbody>
</table>
Interrupts

Table 6-1. ADSP-TS201 Interrupt Vector Table (Cont’d)

<table>
<thead>
<tr>
<th>Priority</th>
<th>Interrupt</th>
<th>Description</th>
<th>Sensitivity</th>
</tr>
</thead>
<tbody>
<tr>
<td>21–18</td>
<td>Reserved</td>
<td>Reserved</td>
<td></td>
</tr>
<tr>
<td>17</td>
<td>DMA3</td>
<td>DMA channel 3 complete</td>
<td>edge</td>
</tr>
<tr>
<td>16</td>
<td>DMA2</td>
<td>DMA channel 2 complete</td>
<td>edge</td>
</tr>
<tr>
<td>15</td>
<td>DMA1</td>
<td>DMA channel 1 complete</td>
<td>edge</td>
</tr>
<tr>
<td>14</td>
<td>DMA0</td>
<td>DMA channel 0 complete</td>
<td>edge</td>
</tr>
<tr>
<td>13–10</td>
<td>Reserved</td>
<td>Reserved</td>
<td></td>
</tr>
<tr>
<td>9</td>
<td>LINK3</td>
<td>Link port 3 request. See “Link Port Service Request Interrupts” on page 6-20.</td>
<td>level</td>
</tr>
<tr>
<td>8</td>
<td>LINK2</td>
<td>Link port 2 request</td>
<td>level</td>
</tr>
<tr>
<td>7</td>
<td>LINK1</td>
<td>Link port 1 request</td>
<td>level</td>
</tr>
<tr>
<td>6</td>
<td>LINK0</td>
<td>Link port 0 request</td>
<td>level</td>
</tr>
<tr>
<td>5–4</td>
<td>Reserved</td>
<td>Reserved</td>
<td></td>
</tr>
<tr>
<td>3</td>
<td>TIMER1L</td>
<td>Timer 1 low priority. See “Timer Expired Interrupts” on page 6-19.</td>
<td>edge</td>
</tr>
<tr>
<td>2</td>
<td>TIMER0L</td>
<td>Timer 0 low priority</td>
<td>edge</td>
</tr>
<tr>
<td>1</td>
<td>Reserved</td>
<td>Reserved</td>
<td></td>
</tr>
<tr>
<td>0 (lowest)</td>
<td>KERNEL</td>
<td>Kernel</td>
<td>edge</td>
</tr>
</tbody>
</table>

Timer Expired Interrupts

There are four timer interrupts: two for each timer, one as high priority and one as low priority. The purpose of having the two priorities per timer is to enable the user to choose which priority is needed. The timer interrupt vector registers are IVTIMER0HP, IVTIMER1HP, IVTIMER0LP, and IVTIMER1LP.

When the timer interrupt occurs, both high and low priority bits are set in ILAT (for example, for Timer 0 both Bit2 and Bit52). This is accomplished by enabling the appropriate interrupt. When an interrupt bit for a timer (INT_TIMER0LP, INT_TIMER1LP, INT_TIMER0HP, or INT_TIMER1HP) in
Interrupt Operations

The ILATx register is set or cleared by writing to the ILATSTx or ILATCLx registers, both the low and high priority interrupts for that timer are set or cleared.

After reset, the timer interrupts are disabled, and vectors are not initialized.

Link Port Service Request Interrupts

There are four link port channels that normally work with their dedicated DMA channels. The link port service request vector registers are IVLINK0, IVLINK1, IVLINK2, and IVLINK3. When a data element enters the link port receive buffer and its corresponding DMA channel is not initialized, the link port issues a service request interrupt.

After reset, the link interrupts are disabled, and vectors are not initialized.

DMA Complete Interrupts

There are 14 DMA channels. Each DMA channel can generate a DMA complete interrupt. The vector registers are IVDMA0, IVDMA1, IVDMA2, IVDMA3, IVDMA4, IVDMA5, IVDMA6, IVDMA7, IVDMA8, IVDMA9, IVDMA10, IVDMA11, IVDMA12, and IVDMA13. If the interrupt bit in the DMA’s transfer control block (TCB) is set, the DMA channel generates an interrupt when the channel completes the DMA block transfer.

After reset, the DMA interrupts are enabled and vectors are initialized to zero for boot purposes.

External (IRQ3–0) Input Interrupts

There are four general-purpose interrupt pins that can be programmed to be edge- or level-sensitive. The external interrupt (IRQ3–0) pin vector registers are IVIRQ0, IVIRQ1, IVIRQ2, and IVIRQ3. When one of these interrupt pins is activated, an interrupt is issued (if enabled).
Interrupts

The interrupt type for the interrupt pins is edge- or level-triggered according to INTCTL programming. See “Interrupt Control (INTCTL) Register” on page 2-45.

After reset, the IRQ interrupts are disabled, unless IRQEN strap option (on pin BM) is used to allow for “no boot, run from memory” mode. For more information on boot options, see “Processor Booting Methods” on page 11-2. Vectors are initialized for boot purposes.

Vector Interrupt

The vector interrupt is a general-purpose interrupt for another master’s use. Another master, either the host or a TigerSHARC processor, can write an address to this register. The write causes a vector interrupt to occur. The value that is written into the register is the address of the interrupt routine. The vector interrupt register is VIRPT.

After reset, the vector interrupt is enabled but the vector is not initialized. This is one of the events that can initiate booting by another master.

Bus Lock Interrupt Register

The bus lock interrupt is issued when the bus lock bit in the SQCTL register is set and the TigerSHARC processor becomes the bus master. The bus lock vector register is IVBUSLK. This interrupt is used to indicate that the TigerSHARC processor has locked the external bus.

After reset, the bus lock interrupt is disabled and vectors are not initialized.
Interrupt Operations

Hardware Error Interrupt

The hardware error interrupt indicates that one of the following errors has occurred:

- **Error in DMA** – one of the CHxx fields in DSTAT register is 100 (TCB initialization error). Refer to “DMA Status Register (DSTAT/DSTATC)” on page 7-23.

- **Data written to the AutoDMA while the corresponding AutoDMA channel is not initialized.** Active while Bit17 of SYSTAT is active. Refer to the register description for “System Status (SYSTAT) Register” on page 2-77.

- **Broadcast read from external memory.** Active while Bit16 of SYSTAT is active. Refer to the register description for “System Status (SYSTAT) Register” on page 2-77.

- **Access to SDRAM when it is not enabled.** Active while Bit18 of SYSTAT is active. Refer to the register description for “System Status (SYSTAT) Register” on page 2-77.

- **Self multiprocessing read** – active when Bit19 of SYSTAT is active. Refer to the register description for “System Status (SYSTAT) Register” on page 2-77.

- **Link error** – in one of the LSTATx registers in the fields RER or TER indicate an error. Refer to “Link Port Error Detection Mechanisms” on page 9-19. For additional information refer to “Link Port Status Registers” on page 9-26.

The hardware error interrupt vector register is IVHW.

After reset, the hardware error interrupt is disabled and vectors are not initialized.
Software Exception Interrupts

Software exceptions are interrupts stemming from code execution. If an exception occurs when the interrupt is disabled, the exception is lost. The software interrupt vector register is IVSW. Software exceptions include:

- Floating-point invalid, if the IVEN bit in the XSTAT/YSTAT register is set. These exceptions are defined by the IEEE standard.
- Underflow, if the UEN bit in the XSTAT/YSTAT register is set
- Overflow, if the OEN bit in the XSTAT/YSTAT register is set
- Debug exceptions that cause supervisor exceptions:
  - TRAP instruction
  - Watchpoint match, when programmed to cause an exception
- Illegal operations:
  - Undefined instruction (not on all cases)
  - Illegal combination of instructions
- External access to an illegal space – multiprocessing broadcast read access by either load/store instruction or by fetch
- Illegal misalignment of load – not DAB accessible due to odd address for long-word; or address that is not quad-aligned for quad-word
- Protection violation – access to a supervisor register in user mode
- Two instructions in the same line with one destination
- More than three results in a single compute instruction line (for example, add and subtract, multiply, shift)
Interrupt Operations

- Two 32-bit immediate instructions in the same line
- More than one branch in the same aligned quad-word with prediction bit set

After reset, the software exception is disabled and vectors are not initialized.

For more information on instruction execution and processor core status reported by the software exception, see the *ADSP-TS201 TigerSHARC Processor Programming Reference*.

Emulation Exception Interrupt

The emulation debug exception causes the TigerSHARC processor to go into emulation mode. The emulation exception does not have an interrupt vector register. After an emulation exception, the TigerSHARC processor fetches from \texttt{EMUIR} register.

In this mode, instructions are read from the JTAG interface and executed in a single step. Emulation mode can be caused by:

- An \texttt{EMUTRAP} instruction
- A watchpoint match (when programmed to cause an emulation trap)
- A JTAG emulation activation

During and after reset, the emulation trap is enabled.
Interrupt Programming

The example code in Listing 6-1 initializes the two timers and sets up their interrupts. The lower priority interrupt allows nesting, the higher one does not need to allow nesting. The example code in Listing 6-2 shows software exception interrupt handling.

Listing 6-1. Example Code for Nested Interrupts Using Two Timers

```
/******************************************************************************
/*                                                                         */
/* Example of nested interrupts using two timers                          */
/*                                                                         */
/******************************************************************************
#include "defts201.h"

.section data1a;
.align 4;
.var register_store[4]; /* Internal memory used to store registers within ISR */
.align 4;
.var register_store_2[4]; /* Internal memory used to store registers within ISR */

.section program;
_Setup_Timer_Interrupt_Vectors:
    j0 = j31 + _TIMER0HP_ISR;;
    IVTIMER0HP = j0;; /* Set Timer0HP Interrupt Vector */
    j0 = j31 + _TIMER1HP_ISR;;
    IVTIMER1HP = j0;; /* Set Timer1HP Interrupt Vector */

_Unmask_Timer_Interruports:
```
Interrupt Programming

/* Unmasks Timer0 and Timer1 HP Interrupts without */
/* altering previously masked or unmasked interrupts */
xr0 = IMASKH;;
xr1 = INT_TIMER0H | INT_TIMER1H;;
xr0 = r0 or r1;;
IMASKH = xr0;;
_Enable_Hardware_Interrupts:
/* Enable the Hardware Interrupts Enable bit in SQCTL */
SQCTLST = SQCTL_GIE;;

_Setup_Timer_Registers:
/* Load values into Timer0 and Timer1 registers */
xr0 = 0x0;;
xr1 = 0xf;;
xr2 = 0xa;;
TMRIN0H = xr0;;
TMRIN1H = xr0;;
TMRIN0L = xr1;;
TMRIN1L = xr2;;

_Enable_flag_out:
/* Enable FLAG2 and FLAG2 as outputs */
FLAGREGST = FLAGREG_FLAG2_EN | FLAGREG_FLAG3_EN;;

_Enable_Timers:
/* Enable the timers without altering any other */
/* settings in the INTCTL register */
xr0 = INTCTL;;
xr1 = INTCTL_TMRORN | INTCTL_TMR1RN;;
xr0 = r0 or r1;;
INTCTL = xr0;;

Done:
/* Endless loop */
Interrupts

_nop;nop;nop;nop;;
_jump _Done;nop;nop;nop;;

_TIMER0HP_ISR:
/* Interrupt service routine for Timer0HP Interrupt */
/* This interrupt is the lower of the two being used. */
    _j0 = j31 + register_store_2;
    q[[_j0]+4] = xr3:0; /* Store registers used in ISR to memory */
xr0 = RETIB; /* Clears SQSTAT[20] enabling servicing of higher priority interrupts */

/**********************************
/* Add main function of ISR here */
/**********************************
    xr1 = FLAGREG;;
xr2 = btgl r1 by FLAGREG_FLAG2_OUT_P;;
FLAGREG = xr2;; /* Toggle flag2 LED on EZ-KIT */

RETIB = xr0;; /* Sets SQSTAT[20] no longer enabling servicing of higher priority interrupts */
    _j0 = j31 + register_store_2;
    xr3:0 = q[[_j0]+4]; /* Restore registers previously saved to memory */
    RTI(ABS);;;

_TIMER1HP_ISR:
/* Interrupt service routine for Timer1HP Interrupt. This interrupt is the highest of the two being used. Therefore there is no need to enable nesting. */
    _j0 = j31 + register_store_2;
    q[[_j0]+4] = xr3:0; /* Store registers used in ISR to memory */

/**********************************
/* Add main function of ISR here */
/**********************************
Interrupt Programming

/**********************************
xr1 = FLAGREG;;
xr2 = btgl r1 by FLAGREG_FLAG3_OUT_P;;
FLAGREG = xr2;; /* Toggle flag3 LED on EZ-KIT */

j0 = j31 + register_store_2;;
xr3:0 = q[j0+=4];; /* Restore registers previously saved to memory */
    RTI(ABS);;
Listing 6-2. Example Code for Software Exception Interrupt Handling

/*********************/
/*               */
/* Example of Software Exception Interrupt handling */
/*               */
/*********************/
#include "defts201.h"

.section data1a;
.align 4;
.var register_store[4]: /* Internal memory used to store registers within ISR */
.align 4;
.var register_store_2[5]: /* Internal memory used to store registers within ISR */

.section program;
_Setup_Timer_Interrupt_Vectors:
    j0 = j31 + _TIMER0HP_ISR;;
    IVTImer0H = j0;; /* Set Timer0HP Interrupt Vector */
    j0 = j31 + _SW_ISR;;
    IVDW = j0;; /* Set Timer0HP Interrupt Vector */

_Unmask_Timer_Interrupts:
/* Unmasks Timer0 Interrupt without altering previously masked or unmasked interrupts */
    xr0 = IMASKH;;
    xrl = INT_TIMER0H;;
    xro = r0 or r1;;
    IMASKH = xr0;;
Interrupt Programming

_Enable_HW_SW_Interrupts:
/* Enable Hardware Interrupts and Software exceptions in SQCTL */
    SQCTLST = SQCTL_GIE | SQCTL_SW;;

_Setup_Timer_Registers:
/* Load values into Timer0 and Timer1 registers */
    xr0 = 0x0;;
    xr1 = 0xffff;;
    TMRINOH = xr0;;
    TMRINOL = xr1;;

_Enable_Flag_Out:
/* Enable FLAG2 and FLAG3 as outputs */
    FLAGREGST = FLAGREG_FLAG2_EN | FLAGREG_FLAG3_EN;;

_Enable_Timer:
/* Enable the timer without altering any other settings in the
   INTCCTL register */
    xr0 = INTCCTL;;
    xr1 = INTCCTL_TMR0RN;;
    xr0 = r0 or r1;;
    INTCCTL = xr0;;

.Done:
/* Endless loop */
    nop;nop;nop;nop;;
    jump _Done:nop;nop;nop;;

_TIMER0HP_ISR:
/* Interrupt service routine for Timer0HP Interrupt */
    j0 = j31 + register_store;;
    q[j0+=4] = xr3:0;; /* Store registers used in ISR to memory */
interrupts

/** Add main function of ISR here */
/** Add main function of ISR here */

j0 = j0 + 1;;
q[j0 += 4] = xr3:0;; /* Forces a software exception due to unaligned quad access */
xr1 = flagreg;;
xr2 = btgl r1 by FLAGREG_FLAG2_OUT_P;;
flagreg = xr2;; /* Toggle Flag2 LED */

j0 = j31 + register_store;;
xr3:0 = q[j0 += 4];; /* Restore registers previously saved to memory */
RTI(ABS);;

_SW_ISR:
/* Interrupt service routine for Software Exceptions. This interrupt is higher than all Hardware interrupts. There is no need to enable nesting as RETS is used to store return address and not RETI. */

j0 = j31 + register_store_2;;
q[j0 += 4] = xr3:0;; /* Store registers used in ISR to memory */
[j0 += 1] = RETI;; /* Stores RETI value for last interrupt */
RETI = RETS;; /* Replace RETI with RETS to return to point where SW exception occurred and not HW interrupt */

/** Add main function of ISR here */
/** Add main function of ISR here */

xr1 = flagreg;;
xr2 = btgl r1 by FLAGREG_FLAG3_OUT_P;;
flagreg = xr2;; /* Toggle Flag3 LED */

j0 = j31 + register_store_2;;
Interrupt (IRQ3-0) Pin and Signal Timing

xr3:0 = q[j0++4]; /* Restore registers previously saved to memory */
RTI(ABS); RETI = [j31 +j0]; /* Return from interrupt and restore RETI */

Interrupt (IRQ3-0) Pin and Signal Timing

The external interrupt (IRQ3–0) pins let the processor signal receive interrupt input from external devices. It is important to note (as shown in Figure 6-1 on page 6-4) that the interrupt controller is a bus slave on the SOC bus. Because the interrupt vector input from the interrupt unit has a direct connection to the program sequencer, response to latched interrupts is not affected by SOC bus loading. For more information on clock domains, see “SOC Interface” on page 3-1. For more information on IRQ3–0 pin timing, see the ADSP-TS201 TigerSHARC Embedded Processor Data Sheet.
7 DIRECT MEMORY ACCESS

Direct Memory Access (DMA) is a mechanism for transferring data without executing instructions in the processor core. The TigerSHARC processor on-chip DMA controller relieves the processor core of the burden of moving data between internal memory and an external device/memory, or between link ports and internal or external memory. The fully integrated DMA controller allows the TigerSHARC core processor, or an external device, to specify data transfer operations and return to normal processing while the DMA controller carries out the data transfers in the background.

As shown in Figure 7-1, the DMA unit can be a bus master or a bus slave on the SOC bus. It is important to note that DMA register accesses must cross a number of clock domains when reading or writing register data. Corresponding response for controlling DMA transfers may vary with SOC bus loading. For more information on clock domains, see “SOC Interface” on page 3-1.

The TigerSHARC processor DMA competes with other internal bus masters for internal memory access. For more information, see “Processor Microarchitecture” on page 8-5. This conflict is minimized because large internal memory bus bandwidth is available.

The TigerSHARC processor DMA includes 14 DMA channels of which four are dedicated to the transfer of data to and from external memory devices (including other TigerSHARC processors in the cluster), eight to
Figure 7-1. DMA and SOC Bus Connections
Direct Memory Access

the link ports and two to the AutoDMA registers. These DMA channels allow for the following transfer types (the ←→ symbol indicates “to and from”)

- Internal memory ←→ External memory or memory-mapped peripherals
- Internal memory ←→ Internal memory of another TigerSHARC processor via the cluster bus
- Internal memory ←→ Host processor
- Internal memory ←→ Link port I/O
- External memory or memory-mapped peripherals ←→ Link port I/O
- External memory ←→ External peripherals
- Link port I/O → Link port I/O
- Cluster bus master → Slave internal memory via AutoDMA

Internal-to-internal memory transfers are not directly supported. Transfers may be executed through the multiprocessing space, although this loads the cluster bus.

The AutoDMA registers (see “AutoDMA Register Control” on page 7-32) are accessed by a cluster bus master or by the core itself via the multiprocessing address space. They cannot be accessed via the internal address space.

In chained DMA operations, a DMA transfer can be programmed to auto-initialize another DMA operation to follow the current one. This technique is primarily used for the same DMA channel, but in some cases, it may be used for a different channel.
The TigerSHARC processor also has four external DMA request pins (DMAR3–0) that allow for external I/O devices to request DMA services. As a response to the DMA request, the TigerSHARC processor performs DMA transfers according to DMA channel initialization.

In order for a DMA channel to be initialized, the program must write to the DMA channel’s Transfer Control Block (TCB). Figure 7-2 shows a block diagram of the DMA controller.

![Figure 7-2. DMA Block Diagram](image)

The transfer control block is a quad-word (128-bit) register that contains the vital information required to perform a DMA. The form of the TCB register is shown in Figure 7-3 on page 7-5.

In the case of a transmitter TCB, the four words contain the address of the source data (D1), the number of words to be transferred (DX/DY), the address increment (DX/DY), and the control bits (DP).
Direct Memory Access

In the case of a receiver TCB these four words contain destination address (DI), the number of words to be received (DX/DY), the address increment (DX/DY), and the control bits (DP).

The DMA index (DI) register is the 32-bit index register for the DMA. This contains the source or destination of the data to be transmitted or received and can point to internal, external memory, or the link ports.

The DMA X modify/count (DX) register contains a 16-bit count value and a 16-bit modify value.\(^1\) The count value is stored in the upper 16 bits (16-31) and the modify in the lower (0-15). If a two-dimensional DMA is enabled, then this register contains the modify and count values for the X dimension only. The value of X count must always be the number of normal (32-bit) words to be transferred. Likewise, the modify value is the number of normal words to modify the count. For example, if we wanted to transmit four quad-words (16 normal words), then the count value would be 0x10 and the modify value 0x4 if the operand length in the DP register is set to quad-word. If the operand length (in the DP register) was set to long-word, the modify value would be 0x2. Programming the DMA TCB parameters is not limited to those as previously described. Figure 7-4 on page 7-7 shows some of the many options available for DMA operation.

There are two restrictions that need to be considered when programming the DMA TCBs. The first is that the pointer must fall on an aligned boundary as specified by the operand length in the DP.

\(^1\) When TY=011 or 111, the count field is 10 bits and the modify field is 22 bits.
field. For example, it is not possible to set up a DMA TCB with an operand length of quad-word and a modify value of two as this results in a quad-word being transmitted from a non-quad-aligned address and a software exception being generated. The second is that the count values entered into the $DX$ and $DY$ fields must be an integer multiple of the operand length. So if the operand length is programmed to long-word then the count values need to be a multiple of two.

The $DY$ register is used in conjunction with the $DX$. This register contains the 16-bit modify and 16-bit count values for the DMA in the Y dimension.\(^1\) If two-dimensional DMA is not selected, load this register with 0x0 or a value that is consistent with the previous notes.

The $DP$ register contains all the control information for the DMA. This register is split into two main fields. The first contains all the control information and the second the chaining information. The breakdown of this register is shown in Figure 7-9 on page 7-19 and Table 7-3 on page 7-20.

The cluster bus (EP) TCBs are loaded by writing to the $DCSx$ registers for the source TCB and the $DCDx$ registers for the destination TCBs, where $x$ can be any value ranging from 0 to 3. The $DCx$ registers are written-to, in order to load the TCBs for the link ports or the AutoDMA registers. The $X$ can be any value from 4 to 13 depending on whether you are writing to link port receive, transmit, or AutoDMA channels.

\(^1\) When $TY=011$ or 111, the count field is 10 bits and the modify field is 22 bits.
Direct Memory Access

Figure 7-4. DMA Data Transfer Examples
DMA Controller Features

There are 14 DMA channels dedicated to six types of transfers:

Table 7-1. DMA Transfer Types

<table>
<thead>
<tr>
<th>Transfer Type</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>Internal memory &lt;-&gt; Cluster bus</td>
<td>This transfer can be done in both directions and requires programming two Transfer Control Blocks (TCB). One is a transmitting TCB and the other a receiving TCB.</td>
</tr>
<tr>
<td>External memory &lt;-&gt; External I/O Device</td>
<td>This transfer can be done in both directions and requires programming two Transfer Control Blocks. One is a transmitting TCB and the other a receiving TCB.</td>
</tr>
<tr>
<td>Auto DMA register --&gt; Internal memory</td>
<td>This requires one receiver TCB.</td>
</tr>
<tr>
<td>Internal/external memory --&gt; Link ports</td>
<td>This requires one transmitter TCB.</td>
</tr>
<tr>
<td>Link ports --&gt; Internal/external memory</td>
<td>This requires one receiver TCB.</td>
</tr>
<tr>
<td>Link port --&gt; Link port</td>
<td>This requires one receiver TCB.</td>
</tr>
</tbody>
</table>

Cluster Bus Transfers

Cluster bus data transfers move data between the TigerSHARC processor internal memory and external memory or external I/O devices.

For internal to external memory transfers, transmitter DMA TCB registers are programmed with the internal memory address, the address increment, the number of transfers, and the control bits. Receiver DMA TCB registers are programmed with the external memory address, the address increment, the number of transfers, and the control bits.
Direct Memory Access

For external to internal memory transfers, transmitter DMA TCB registers are programmed with the external memory address, the address increment, the number of transfers, and control bits. Receiver DMA TCB registers are programmed with the internal memory address, the address increment, the number of transfer, and control bits.

Once setup programming is complete, DMA transfers start automatically and either continue until the entire block is transferred, or until one transaction after each DMAR3-0 pulse. The programming of the TCBs determines the instructions to be followed.

For external device and external memory transfers, an additional DMA capability allows the TigerSHARC processor to support data transfers between an external device and external memory (flyby transactions). This transfer does not interfere with internal TigerSHARC processor operations.

An external I/O device, unlike memory, gives an indication of readiness for data transfer. A source device is ready if it has data to write. A receiver is ready when it has space in its write buffer. There are two techniques available for synchronizing these devices with the TigerSHARC processor DMA channels:

- The source or receiver can assert a DMA Request input (DMARx) every time it is ready to transfer new data. The DMA, requests are accumulated in the DMA and a transaction is issued on the corresponding channel per DMA request. Up to 15 requests may be accumulated.

- A source that can be a master on the cluster bus can write to an AutoDMA register. After data is written to the AutoDMA register, the AutoDMA channel transfers the data according to its initialization.
DMA Controller Features

AutoDMA Transfers

Receiver DMA TCB registers are programmed with the internal memory address, the address increment, the number of transfers, and the control bits.

Once setup is complete, when an external master (either a host or another TigerSHARC processor) writes to the AutoDMA register, a DMA request is initiated and the DMA channels transfers the data to the internal memory.

Link Port Transfers

Link port DMA transfers handle data transmitted and received through the TigerSHARC processor’s link ports.

For external or internal memory to link port transfers, transmitter DMA TCB registers are programmed with the external/internal memory address, the address increment, the number of transfers, and the control bits.

Once transmitter TCB programming is complete, DMA transfers start automatically and continue until the block transfer is completed. The transmitting link port generates a DMA request if it is ready to receive data for transmission. The DMA from external/internal memory to the link port is then completed. This process is repeated until the entire data block is transferred.

For link port to external or internal memory transfers, receiver DMA TCB registers are programmed with the external/internal memory address, the address increment, the number of transfers, and the control bits.

Once receiver TCB programming is completed, DMA transfers start automatically and the link port waits until data is received. The link port then generates a DMA request, and the DMA transfer to external/internal memory completes. This process continues until the block transfer is completed.

7-10 ADSP-TS201 TigerSHARC Processor Hardware Reference
Direct Memory Access

For link port to external/internal memory transfers, receiver DMA TCB registers are programmed with the external/internal memory address, the address increment, the number of transfers, and the control bits.

Once receiver TCB programming is completed, DMA transfers start automatically and the link port waits until data is received. The link port then generates a DMA request and the DMA transfer to external/internal memory completes. This process continues until the block transfer is completed.

Two-Dimensional DMA

The DMA is able to address and transfer two-dimensional memory arrays, where X and Y array dimensions are defined in transmitter and receiver TCB registers.

- Receiver array dimensions can differ from those of the transmitter, as long as the total number of words in source and destination are equal.

- A two-dimensional memory array can be moved to a one-dimensional array and vice versa, as long as the total number of words in source and destination are equal.

- A two-dimensional block in memory can be transmitted via links, or a block received through links or from AutoDMA can be placed in memory as a two-dimensional array.

Chained DMA

Chained DMA allows for the DMA controller to auto-initialize multiple DMA transfers, thus minimizing loading on the processor core. The chain pointer field of the DMA control bits points to the address in memory that contains the next TCB to be loaded (which contains the address,
DMA Architecture

increment, number of transfers, and control bits for the next block of transfers). The new TCB is loaded when the current DMA transfer completes.

The TigerSHARC processor DMA controller has a great flexibility of options with chained DMA sequences.

- Single channel chained DMA
  When the chain pointer in the current TCB points to TCB data used to set up another DMA on the same channel.

- Cross channel chained DMA on link ports
  When one link port DMA channel sets up a DMA transfer on a different link port DMA channel.

- Chain insertion
  When a high priority DMA operation or another DMA chain may be inserted into an already active DMA chain.

DMA Architecture

DMA transfers are characterized by the direction of data flow, that is, from the transmitter (source) to the receiver (destination). If the transmitter or receiver is memory, it is characterized by a TCB register. Link and AutoDMA channels have one TCB register each. A transmit channel has one source TCB register. A receive link or AutoDMA channel has one destination TCB register. Feedthrough, from the receiver link to another transmitter link, can be caused by programming a receiver link channel TCB to send data to the target transmitter link buffer.

DMA initiates a transaction as specified by TCB programming and requests. The TCB programming determines if the DMA operates continuously or on requests (handshake mode). The link channels and
Direct Memory Access

AutoDMA channels always work by requests, which are generated internally by the link or by the AutoDMA registers. The memory-to-memory DMA channels can work either autonomously after initialization, or in handshake mode, using DMAR3-0 input pins. The bus transaction is always initiated by the DMA; the source of the transaction is the transmitter; and the transaction destination is the receiver.

Transactions that involve link or transactions where the transmitter is the internal memory are issued on the internal bus. Transactions where the transmitter is external memory are driven by the DMA directly to the EBIU. For more information, see “Processor Microarchitecture” on page 8-5.

DMA Request (DMAR3-0) Pins

The DMA request pins (DMAR3-0) allow I/O devices to request DMA services from the TigerSHARC processor. As a response to a DMA request, the TigerSHARC processor performs DMA transfers according to DMA channels initialization. Any DMA request from an uninitialized channel is ignored. The DMAR3-0 inputs are edge-sensitive.

The TigerSHARC processor supports external DMA request input signals, DMAR0, DMAR1, DMAR2, and DMAR3, and output signals, TORD, TOWN, and TOEN, to support DMA transfers between external peripheral devices. In flyby transactions, internal memory or peripherals are not involved. Only channel 0 should be defined to work in flyby mode.

By asserting a DMAR3-0 pin and waiting for the appropriate bus transaction to be performed by the TigerSHARC processor, an I/O device can transfer data to TigerSHARC processor internal memory or links. Any flyby output can be used to transfer data between an I/O device and external memory—in this case, the TigerSHARC processor performs a bus transaction but does not read or write data. “Handshake Mode” on page 7-64 provides additional information regarding DMAR I/O.
Terminology

There are a number of terms that repeatedly appear in the DMA discussions in this chapter. These terms include:

External Port Input FIFO (IFIFO)

Refers to the TigerSHARC processor external bus interface unit (EBIU) input FIFO. It is used for all externally-supplied data by bus masters (other TigerSHARC processors or a host processor), direct writes or external reads. The IFIFO also holds on-chip destination addresses and data attributes.

External Port Output FIFO (OFIFO)

Refers to the TigerSHARC processor EBIU output FIFO. It is used for all outgoing external port addresses, data, and transaction control signals, including DMA transfers to and from external address space.

Transfer Control Block (TCB)

A quad-word that defines a set of parameters for the DMA operation.

DMA TCB register

A quad-word register that contains a Transfer Control Block.

TCB chain loading

The process in which the TigerSHARC processor’s DMA controller downloads a TCB from memory and autoinitializes the DMA TCB register.

AutoDMA registers

Two virtual DMA registers that can only be accessed by a cluster bus master. These registers allow data to be moved to a block defined by the channel’s TCB register. For more information, see “AutoDMA Register Control” on page 7-32.
Direct Memory Access

Setting Up DMA Transfers

DMA operations can be programmed by the TigerSHARC processor core processor, by an external host processor, or by the (external) TigerSHARC processor bus master. The operation is programmed by writing to the memory-mapped DMA TCB registers. The TCB register is a quad-word register that indicates the DMA block transfer. A DMA channel is set up by writing a quad-word to each of the DMA TCB registers. Each register must be loaded with a starting address for the block, an address modifier, and a word count. Similar to link DMA channels, an AutoDMA register channel has only one TCB register.

In the case of link ports, once a DMA block is set up and enabled, data words received are automatically transferred to the internal memory or external memory or to another link port transmit buffer. Likewise, when the transmitting link is ready to transmit data, a quad-word is automatically transferred from internal or external memory to the transmitter link buffer. These transfers continue until the entire data buffer is received or transmitted.

If the TCB is programmed to issue an interrupt, then DMA interrupts are generated when an entire block of data has been transferred. This occurs when the DMA channel’s count register has decremented to zero and the last element of data has been transferred. For more information, see “DMA Interrupts” on page 7-49.

DMA Transfer Control Block Registers

The TCB registers used to control and configure DMA operations are part of the memory-mapped register sets (“DMA Control and Status Register Group” on page 2-51). These can be accessed as quad-words only.
DMA Transfer Control Block Registers

The remaining sections in this chapter describe the different operating modes of the DMA controller together with the associated control registers and bits. DMA TCB registers are detailed in the following subsections.

DMA Channel Control

Each of the four external memory DMA channels is controlled by a TCB quad-word register pair. In addition, each of the four link ports to or from memory DMA channels is controlled by a TCB quad-word register. Finally, each of the two AutoDMA registers-to-memory DMA channels is controlled by a TCB quad-word register.

Transfer Control Block (TCB) Registers

Each TCB register is 128 bits long and is divided into four 32-bit registers. These registers are illustrated in Figure 7-5 and listed by channel in Table 7-2.

- DMA index (DI) register
- DMA X dimension count and increment (DX) register
- DMA Y dimension count and increment (DY) register
- DMA control and chaining pointer (DP) register

![Figure 7-5. TCB Register](image-url)
Direct Memory Access

These four registers form a quad-word TCB that can be loaded as an aligned quad-word from memory via chaining or by the core. The following conditions apply (note only quad accesses are allowed).

- Loading an inactive TCB (TY field in DP register = 000) into a TCB register reinitializes the state of the DMA channel, clearing the channel status bits to zero. The DMA request counters are flushed and any internal DMA states are reset.

- Loading an active TCB into an active channel TCB register generates an interrupt. The DSTAT register (see “DMA Status Register (DSTAT/DSTATC)” on page 7-23) can be read to determine if the channel is active.

To initiate a new master DMA sequence after the current one has finished, the program must write new parameters to the TCB registers, re-enabling the DMA. (For chained DMA operations, this is done automatically.)

Table 7-2. DMA Channels and Associated TCB Registers

<table>
<thead>
<tr>
<th>DMA Channel</th>
<th>TCB Registers</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>DCS0</td>
<td>External port channel 0 source TCB</td>
</tr>
<tr>
<td></td>
<td>DCD0</td>
<td>External port channel 0 destination TCB</td>
</tr>
<tr>
<td>1</td>
<td>DCS1</td>
<td>External port channel 1 source TCB</td>
</tr>
<tr>
<td></td>
<td>DCD1</td>
<td>External port channel 1 destination TCB</td>
</tr>
<tr>
<td>2</td>
<td>DCS2</td>
<td>External port channel 2 source TCB</td>
</tr>
<tr>
<td></td>
<td>DCD2</td>
<td>External port channel 2 destination TCB</td>
</tr>
<tr>
<td>3</td>
<td>DCS3</td>
<td>External port channel 3 source TCB</td>
</tr>
<tr>
<td></td>
<td>DCD3</td>
<td>External port channel 3 destination TCB</td>
</tr>
<tr>
<td>4</td>
<td>DC4</td>
<td>Link port 0 transmit TCB</td>
</tr>
<tr>
<td>5</td>
<td>DC5</td>
<td>Link port 1 transmit TCB</td>
</tr>
<tr>
<td>6</td>
<td>DC6</td>
<td>Link port 2 transmit TCB</td>
</tr>
<tr>
<td>7</td>
<td>DC7</td>
<td>Link port 3 transmit TCB</td>
</tr>
</tbody>
</table>
DMA Transfer Control Block Registers

Table 7-2. DMA Channels and Associated TCB Registers (Cont’d)

<table>
<thead>
<tr>
<th>DMA Channel</th>
<th>TCB Registers</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>8</td>
<td>DC8</td>
<td>Link port 0 receive TCB</td>
</tr>
<tr>
<td>9</td>
<td>DC9</td>
<td>Link port 1 receive TCB</td>
</tr>
<tr>
<td>10</td>
<td>DC10</td>
<td>Link port 2 receive TCB</td>
</tr>
<tr>
<td>11</td>
<td>DC11</td>
<td>Link port 3 receive TCB</td>
</tr>
<tr>
<td>12</td>
<td>DC12</td>
<td>AutoDMA channel 0 TCB</td>
</tr>
<tr>
<td>13</td>
<td>DC13</td>
<td>AutoDMA channel 1 TCB</td>
</tr>
</tbody>
</table>

DIx Register

This is the 32-bit Index register for the DMA. It can point to the address of external, internal memory, or link ports.

![DIx Register](image)

Figure 7-6. DIx Register

DXx Register

If a two-dimensional (2D) DMA is disabled, this register contains the 16-bit modify (LSBs) and 16-bit count (MSBs) values for the DMA. If a two-dimensional (2D) DMA is enabled, it contains the 16-bit modify and 16-bit count X dimension values, where X count is the number of normal words to be transferred, and X modify is the number of normal words to modify the address pointer.
Direct Memory Access

**DYx Register**

This register is used together with the DX register when a two-dimensional DMA is enabled. It contains the 16-bit modify and the 16-bit Y count dimension values, where Y modify is the difference between the address of the last data element in a row and the address of the first element in the following row. If a two-dimensional (2D) DMA is disabled, this register is loaded with 0x0 or a valid value as previously described.

**DPx Register**

This register is split into two fields, where the first is dedicated to DMA control and the second is dedicated to chaining.

--

1 When TY=011 or 111, the count field is 10 bits and the modify field is 22 bits.
DMA Transfer Control Block Registers

Figure 7-9. DPx (Upper) Register Bit Descriptions (Cont’d)

1 The CHPT field includes bits 18–0 (bits 15–0 not shown).

Table 7-3. DPx Register Bit Descriptions

<table>
<thead>
<tr>
<th>Bit</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>CHPT</td>
<td>Chain Pointer&lt;br/&gt;These bits contain Bits20–2 of the address in memory where the next TCB register contents are to be found. Bits1–0 of the memory address are not required as the address must be quad-word aligned. For example if the next TCB register contents are stored at memory location 0x87128–0x8712B then the CHPT field would contain b00100001110001001010 (0x21C4A).</td>
</tr>
<tr>
<td>CHTG</td>
<td>Chaining destination channel&lt;br/&gt;This field only applies for link port DMAs and specifies the DMA channel TCB registers in which the quad-word pointed to by the CHPT field should be loaded. 000 – Link Receive Channel 0, 001 – Link Receive Channel 1, 010 – Link Receive Channel 2, 011 – Link Receive Channel 3, 100 – Link Transmit Channel 0, 101 – Link Transmit Channel 1, 110 – Link Transmit Channel 2, and 111 – Link Transmit Channel 3</td>
</tr>
<tr>
<td>CHEN</td>
<td>Chaining Enabled&lt;br/&gt;0 – No Chaining&lt;br/&gt;1 – DMA loads chaining targets channel TCB registers from internal memory&lt;br/&gt;This bit disables chaining or enables chaining. If set the DMA loads the chaining destination channel TCB registers from the internal memory location pointed to by the CHPT and MS fields.</td>
</tr>
</tbody>
</table>
### Direct Memory Access

#### Table 7-3. DPx Register Bit Descriptions (Cont’d)

<table>
<thead>
<tr>
<th>Bit</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>DRQ</td>
<td><strong>DMA Request Enable</strong>&lt;br&gt;0 – Once the DMA channel is enabled, the whole block is transferred&lt;br&gt;1 – DMA issues transactions only upon request&lt;br&gt;If cleared (=0) this bit disables the functionality of the DMARx pins on external port DMA transfers. This results in the entire block of data being transferred once the TCB registers have been written to.&lt;br&gt;If enabled (=1) this bit results in external port transactions occurring only upon a request signaled on the DMARx pins.&lt;br&gt;For AutoDMA channels this bit must always be set.</td>
</tr>
<tr>
<td>INT</td>
<td><strong>Interrupt Enable</strong>&lt;br&gt;0 – DMA interrupt is disabled&lt;br&gt;1 – The DMA interrupts the core after transferring the whole block&lt;br&gt;If cleared (=0) this bit disables the generation of an interrupt on completion of the DMA block transfer.&lt;br&gt;When enabled (=1) an interrupt is generated upon completion of the DMA. The interrupt is generated when the count field of the DMA channels TCB decrements to zero.</td>
</tr>
<tr>
<td>LEN</td>
<td><strong>Operand Length</strong>&lt;br&gt;00 – Reserved&lt;br&gt;01 – Normal (32-bit) word&lt;br&gt;10 – Long (64-bit) word&lt;br&gt;11 – Quad (128-bit) word&lt;br&gt;These bits specify the length of the operand to be transferred on each DMA transaction. The original source or destination address specified in the DIx register must be aligned to a word boundary as specified with the setting of these bits.&lt;br&gt;Thus if operand length is set to quad, then the address originally loaded in the DIx register must be divisible by four.</td>
</tr>
<tr>
<td>2DDMA</td>
<td><strong>Two-Dimensional DMA</strong>&lt;br&gt;0 – One-dimensional DMA: DY register is meaningless&lt;br&gt;1 – Two-dimensional DMA: DX controls X dimension addresses, increment, and count. The DY register controls Y dimension addresses, increment, and count.&lt;br&gt;In this mode of operation, the DX register contains the X dimension addresses, increment and count. The DY register controls the Y dimension addresses, increment, and count. When disabled (cleared = 0), the contents of the DY register are simply ignored, and a normal one-dimensional DMA is performed.</td>
</tr>
</tbody>
</table>
## DMA Transfer Control Block Registers

### Table 7-3. DPx Register Bit Descriptions (Cont'd)

<table>
<thead>
<tr>
<th>Bit Description</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>PR Bit28</td>
<td>Determines Priority&lt;br&gt;0 – DMA request priority is normal&lt;br&gt;1 – DMA request priority is high&lt;br&gt;This bit determines the priority of the DMA. When set (PR = 1), the DMA request is given a high priority. If the TCB refers to an external address, then the Priority Access (DPA) pin is set. See cluster bus definition for more information on the DPA pin. If the TCB refers to an internal address, the DMA asserts its high priority internal bus request. When the priority bit is cleared (PR=0), the DMA request priority is normal. This bit also indicates the priority in the DMA channel arbitration.</td>
</tr>
<tr>
<td>TY Bits31–29</td>
<td>Specify the Device Type&lt;br&gt;000 – DMA disabled&lt;br&gt;001 – I/O link port DMA; select this type when performing link-port-to-link-port transfers.&lt;br&gt;010 – Internal memory (with 16-bit modify and 16-bit count); select this type in the source TCB if the source is internal memory; select this type in the destination TCB if the destination is internal memory. If a &gt;16-bit modify value is needed, use the 011 type.&lt;br&gt;011 – Internal memory (with 22-bit modify and 10-bit count²); select this type in the source TCB if the source is internal memory; select this type in the destination TCB if the destination is internal memory. Unless a &gt;16-bit modify value is needed, use the 010 type.&lt;br&gt;100 – External memory (with 16-bit modify and 16-bit count); select this type in the source TCB if the source is external memory; select this type in the destination if the destination is external memory. If a &gt;16-bit modify value is needed, use the 111 type.&lt;br&gt;101 – External I/O device (Flyby)&lt;br&gt;110 – Boot EPROM This type allows for DMA accesses to the boot EPROM memory. The BMS pin is used to select the device when this type is set.&lt;br&gt;111 – External memory (with 22-bit modify and 10-bit count²); select this type in the source TCB if the source is external memory; select this type in the destination if the destination is external memory. Unless a &gt;16-bit modify value is needed, use the 100 type.</td>
</tr>
</tbody>
</table>

1 Link channel TCBs CHTG field may be programmed with any other link channel. However, CHTG should not be cross chained with any other DMA channels.<br>2 When the TY = 011 or 111 transfer types are selected, the DXx and DYx registers change from holding a 16-bit count and 16-bit modify to holding a 10-bit count and 22-bit modify.
DMA Control and Status Registers

The following registers act as status and control registers for the DMA:

- DMA Status Register – \texttt{DSTAT}
- DMA Control Registers – \texttt{DCNT, DCNTST, DCNTCL}

DMA Status Register (\texttt{DSTAT/DSTATC})

The \texttt{DSTAT} register allocates a field of three status bits per channel, indicating whether the channel is currently disabled or active; whether the DMA has completed or not; and whether an error has been detected. The bits are read only and a field is cleared by loading the respective TCB register.

- DMA active status is set when the DMA channel is enabled (when the \texttt{TY} field of the \texttt{DPx} register is set).
- DMA completion status is set when the last DMA transaction is completed.
- DMA error status is set if an illegal or error condition is detected.

There are three DMA channel status states in which a hardware interrupt may be generated. These states are reached if one or more of the following conditions occur:

- Writing to already active DMA channels TCB registers
- Writing an illegal configuration to DMA channels TCB registers (DP register)
- Initializing a DMA channel to read from broadcast memory space
If any of these conditions occur a hardware interrupt is generated if enabled and the status bits can only be cleared by reading from the `DSTATC` register. The `DSTAT` and `DSTATC` registers must be read as either a long or quad-word. Normal word reading of these registers is not permitted.
Direct Memory Access

Figure 7-12. DSTAT (Bits 47-32) Register Bit Descriptions

Figure 7-13. DSTAT (Bits 15-0) Register Bit Descriptions
DMA Control and Status Registers

DMA Control Registers

There are three DMA control registers—DCNT, DCNST, and DCNTCL.

DCNT Register

This 32-bit control register controls the DMA flow. Setting DCNT bits stop DMA flow at the end of the current transaction; clearing DCNT bits resumes a suspended transaction. Initial value of the DCNT after reset is 0x0.

Figure 7-14. DCNT (Lower) Register

1 Bits 31-18 (not shown) are reserved.

7-26 ADSP-TS201 TigerSHARC Processor Hardware Reference
Direct Memory Access

DCNTST Register

The DCNTST register is a 32-bit register alias used to set DCNT bits. The value written to DCNTST is ORed with DCNT, and the result is loaded into DCNT. For example, writing a 1 to bit \( n \) in the register sets the corresponding bit \( n \) in the DCNT register.

DCNTCL Register

The DCNTCL register is a 32-bit register alias used to clear DCNT bits. The value written to DCNTCL is ANDed with DCNT, and the result is loaded into DCNT. For example, writing a 0 to bit \( n \) in the register resets the corresponding bit \( n \) in the DCNT register.

DMA Control Register Restrictions

The following restrictions apply to the DMA Control register usage.

Operand Length Setup (Len) Restrictions

In external port (EP) channels, the LEN field (normal word, long word, or quad word) must be the same in both TCBs. If the TY setup of either source or destination TCB is boot EPROM, the LEN field must be 0x1 (normal word).

In link (receive and transmit) channels the LEN field must always be quad-word (0x3). For AutoDMA channels, the LEN field must be the same as the transactions to the AutoDMA register.

Count (DXx Count and DYx Count) Restrictions

For EP channels, the total counts should be identical.

- \( DX\_COUNT \): when the 2DDMA bit is cleared
- \( DX\_COUNT \times DY\_COUNT \): when the 2DDMA bit is set

The 2DDMA bit may be set in one of the TCBs and cleared in the other.
DMA Control and Status Registers

There is no restriction on count registers in link DMA channels. In all cases, if a channel is not idle, the \( \text{DX\_COUNT} \) cannot be zero. If \( \text{2DDMA} \) is set, \( \text{DY\_COUNT} \) cannot be 0 or 1.

**Type Setup – Links Transmit (Channels 4 to 7) Restrictions**

For DMA control registers, link transmit channels can be set up as only one of the following:

- 2 (internal memory)
- 4 (external memory)

**Type Setup – Links Receive (Channels 8 to 11) Restrictions**

For DMA control registers, link receive channels can be set up as only one of the following:

- 1 (link port),
- 2 (internal memory), or
- 4 (external memory)
Direct Memory Access

Type Setup – EP (Channels 0 to 3) Restrictions

The legal DMA type Source>>Destination pairs are:

<table>
<thead>
<tr>
<th>Source Type</th>
<th>Destination Type</th>
</tr>
</thead>
<tbody>
<tr>
<td>2 (internal memory—16-bit count/16-bit modifier)</td>
<td>4, 6, or 7</td>
</tr>
<tr>
<td>3 (internal memory—10-bit count/22-bit modifier)</td>
<td>4, 6, or 7</td>
</tr>
<tr>
<td>4 (external memory—16-bit count/16-bit modifier)</td>
<td>2, 3, or 5</td>
</tr>
<tr>
<td>5 (external I/O flyby—16-bit count/16-bit modifier)</td>
<td>4 or 7</td>
</tr>
<tr>
<td>6 (boot EPROM—16-bit count/16-bit modifier)</td>
<td>2 or 3</td>
</tr>
<tr>
<td>7 (external memory—10-bit count/22-bit modifier)</td>
<td>2, 3, or 5</td>
</tr>
<tr>
<td>0 (channel disabled)</td>
<td>Any</td>
</tr>
<tr>
<td>Any</td>
<td>0 (channel disabled)</td>
</tr>
</tbody>
</table>

Type Setup – AutoDMA (Channels 12, 13) Restrictions

AutoDMA is limited to internal memory. The type must be 2 (internal memory).

DMA Request Restrictions

In both AutoDMA channels, the DRQ bit (DMA Request) must be set.

In EP channels (channels 0 to 3), the DRQ bit must be the same in both TCB registers.

Alignment Restrictions

If the LEN field is 2 (long-word), the following fields must be even (bit 0 cleared):

- DI
- DX_MODIFY
DMA Control and Status Registers

- **DY_MODIFY** (if relevant)
- **DX_COUNT**

If the **LEN** field is 3 (quad-word), the values must all be multiplicands of 4 (Bits1–0 cleared).

For type = 6 (boot EPROM), these registers must all be multiplicands of 4 (Bits1–0 cleared).

**Address Range Restrictions**

The following correlation must exist between the **TY** field in the **DP** register and the address any time the TCB is active:

- **TY = 1** (link) ⇒ address is of one of the link transmit registers (see “Link Port Register Groups” on page 2-65).
- **TY = 2** (internal memory) ⇒ address is in internal memory range, and not in the registers (see “Internal Address Space” on page 2-7).
- **TY = 4** (external memory) ⇒ address is in external memory range (address31–22 nonzero). It can be in host, external memory, or multiprocessing space.

When a link DMA channel (transmit or receive channel) becomes active, if the transmit buffer is empty (for transmit channel) or if the receive buffer is not empty (for receive channel), a DMA request is immediately issued.
DMA Controller Operations

The DMA controller supports:

- “Link Port DMA Control” on page 7-31
- “External Port DMA Control” on page 7-32
- “AutoDMA Register Control” on page 7-32

Link Port DMA Control

The TigerSHARC processor’s four link ports are able to use DMA transfers to handle transmit and receive data. Two DMA channels are assigned to each link port, where the link port DMA request specifies the DMA channel that serves them.

- The link port input channel has a receiver TCB register.
- The link port output channel has a transmitter TCB register.

The number of words to be transferred is defined as X count or X count times Y count (if 2DDMA is set), but in any case, only quad-word transfers are allowed. A receive link port can signal the DMA channel to complete the count before the X and Y counters expire. The DMA signals a transmit port that the last data item has been sent (see “Link Port Communication Protocol” on page 9-12).

Discussions in this chapter assume normal block completion; X count = Y count = 0x0000.

The link DMA channel is enabled as long as the TY field is not set to 000 in the TCB register DP. Control bits and chaining address are as defined in TCB register DP, where if chaining is enabled, one quad-word TCB register is loaded. The link’s functionality is specified in “Link Ports” on page 9-1. Finally, DMA request priority is fixed—link 3 is highest and link 0 is the lowest.
DMA Controller Operations

External Port DMA Control

The TigerSHARC processor’s external port is able to use DMA transfers to transmit and receive data. Four DMA channels are assigned to external port operations. Each channel can work in handshake mode, using its \( DMARx \) pin. Either 32-bit (word), 64-bit (long-word), or 128-bit (quad-word) internal data widths can be used when transferring data between internal and external memory.

Table 7-4. Pin Definitions – External Port DMA/Flyby

<table>
<thead>
<tr>
<th>Signal</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>DMAR3-0</td>
<td>DMA Request Pins. Enable external I/O devices to request DMA services from the DSP. In response to ( DMARx ), the DSP performs DMA transfers according to the DMA channel’s initialization. The DSP ignores DMA requests from uninitialized channels.</td>
</tr>
<tr>
<td>IOWR</td>
<td>I/O Write. When a DSP DMA channel initiates a flyby mode read transaction, the DSP asserts the ( IOWR ) signal during the data cycles. This assertion makes the I/O device sample the data instead of the TigerSHARC processor.</td>
</tr>
<tr>
<td>IORD</td>
<td>I/O Read. When a DSP DMA channel initiates a flyby mode write transaction, the DSP asserts the ( IORD ) signal during the data cycle. This assertion with the ( IOEN ) makes the I/O device drive the data instead of the TigerSHARC processor.</td>
</tr>
<tr>
<td>IOEN</td>
<td>I/O Device Output Enable. Enables the output buffers of an external I/O device for fly-by transactions between the device and external memory. Active on fly-by transactions.</td>
</tr>
</tbody>
</table>

AutoDMA Register Control

The AutoDMA is a technique used to implement a slave mode DMA. There are two virtual registers—AutoDMA0 and AutoDMA1. Each of the AutoDMAs has an AutoDMA channel associated with it, channels 12 and 13, respectively. When an external master (host or another TigerSHARC processor) writes to one of the AutoDMA registers, this register issues a DMA request to its associated channel and the DMA channel transfers the data to internal memory according to its TCB programming. Either 32-,
64- or 128-bit internal data width can be used when transferring data between the data registers and internal memory. The DMA data registers channel is enabled as long as the \( T_Y \) field is not set to 000 in the TCB register \( DP \). Control bits and chaining address are as defined in TCB register \( DP \). If chaining is enabled, one TCB register is loaded.

If a write to AutoDMA occurs when the \textit{channel is uninitialized} and \textit{chaining is disabled}, the data written is lost, an error interrupt is issued, and an error is indicated in the \textit{SYSTAT} register. If a write to AutoDMA occurs when the \textit{channel is disabled} and \textit{chaining is enabled}, the data written is buffered and forwarded to its destination when the TCB load for the channel completes (no interrupt, no error). Refer to the “\textit{System Status (SYSTAT) Register}” on page 2-77 for additional information.

**DMA Transfers**

The following subsections discuss the operation of the TigerSHARC processor’s DMA controller and DMA transfers.

**Internal Memory Buses**

DMA controller operations are carried out on any one of the three internal memory buses. All I/O ports (link and external) are connected to the internal memory via the three memory buses. \textit{For more information, see “Processor Microarchitecture” on page 8-5.} The DMA controller generates an internal memory access on one of these buses.
DMA Controller Operations

DMA Channels

Memory-to-memory DMA channels consist of two TCB registers (one for receive and one for transmit) that specify:

- A data buffer in internal and external memory
- Control fields
- The method of requesting a transfer

Link memory DMA channels consist of one TCB register that also specifies a data buffer in internal or external memory, control fields, and the hardware required to request DMA service. Any receiver link channel TCB register can define another link port as a memory-mapped device.

AutoDMA register memory is similar to link memory except that AutoDMA registers are always receivers. The AutoDMA register memory TCB cannot define another link port as a memory-mapped device. The destination must always be to internal memory.

The DMA controller contains priority logic to determine which channel or I/O can drive which bus in any given cycle.

DMA Memory Addressing

The DMA is able to access the full address space. Each channel includes an Index register (DIx) and a Modify register (DMx) in its TCB register, which are used to set up a data buffer in the memory.

The 32-bit TCB register DI holds the block’s initial address and must be initialized with a starting address for the data buffer. The address in the DI register is directed to one of the internal memories, to a link register address, or to the external memory space. This must be consistent with the TCB transfer type—TY field in DPx register. (See “DPx Register” on page 7-19.)
The data transferred may be a normal, long, or quad-word. This is specified in the LEN field of the TCB control register DPx. (See “DPx Register” on page 7-19.) The LEN field must be unique per channel. (Both external port channel’s TCBs must have the same LEN value.)

After each data transfer to or from internal memory, the DMA controller adds the modify value to the Index register to generate the address for the next DMA transfer. The modify value is added to the index value and written back into the Index register. The modify value in the X modify field in the TCB register DX is a 16-bit signed integer, allowing both increments and decrements. Similarly the Y modify field in the TCB register DY is a 16-bit signed integer. The 16 bits are sign-extended when used with the 32-bit Index register.

If the Index register is modified out of the address range of an existing memory allocation or changes range between internal and external memory space, results are unpredictable.

Each DMA channel has a Count register (the X count field in the TCB register DX) that has to be initialized with the number of words to be transferred, regardless of the data item length (32, 64, or 128 bits). The count register is decremented after each DMA transfer on that channel. The decrement value is 1, 2, or 4 depending on the normal, long, or quad-word access specified. When the count reaches zero, the DMA is complete, and the interrupt for that channel can be generated.

If the X count field in the TCB register DX is initialized with zero, DMA transfers on that channel are not disabled. Moreover, $2^{16}$ transfers are performed. This occurs because the first transfer is initiated before the count value is tested. The correct way to disable a DMA channel is to clear its DMA enable bit in the corresponding control register.

Each DMA channel TCB also has a chain pointer—the CHPT field in the TCB register DP. The chain pointer is used in chained DMA operations. (See “DMA Chaining” on page 7-40.)
DMA Controller Operations

When DMA is from internal/external memory to the external/internal memory, two TCB registers are used. One is used for the internal DMA address generation; the second is used to generate 32-bit addresses to be driven out of the external port. If the two-dimensional DMA option is enabled, the count value equals the TCB register $DX \times$ count times the TCB register $DY \times$ count. It is important to stress that different count values cause the DMA to cease once the transfer of the shorter block is complete.

“Transfer Control Block (TCB) Registers” on page 7-16 lists the TCB registers for each DMA channel. The TCB registers are disabled following a hardware reset, except for the boot channel TCB. For more information, see “Processor Booting Methods” on page 11-2.

DMA Channel Prioritization

The overall DMA prioritization is defined by internal bus priority as well as DMA channels priority.

Internal Memory Bus Priority

DMA arbitration resolves priority conflicts between DMA channels. Internal memory buses have different arbitration levels that define priority between internal memory bus masters. The internal memory bus priority is described in “Processor Microarchitecture” on page 8-5.

DMA Channel Priority

Because more than one DMA channel may have a request active in a particular cycle, a prioritization scheme is used to select the channel to service. The TigerSHARC processor always uses a fixed prioritization between I/O groups. Table 7-5 on page 7-37 lists the DMA groups in descending order of priority.
Priority is also set according to these considerations:

- Incoming data has higher priority than outgoing data, in order to reduce the possibility of stalling the cluster bus. For this reason, the AutoDMA register channels are assigned the highest priority. Receiving links have priority over transmitting links.

- The links have higher priority than the external ports because their lower bandwidth is not likely to block the bus.

- External port channels have a rotating priority. The sequence in which they rotate is 3-2-1-0.

- If the PR bit is set in TCB register DP, channel \( x \) asserts the highest priority inside its group. If more than one channel has one of its PR bits set in the TCB register DP, priority is resolved according to the priority list.

- The DMA controller determines the highest priority channel/device(s) that is requesting during every cycle. If the DMA is transferring a block and a higher priority channel requests DMA services, arbitration is carried out and the transfer is stopped.

- TCB chain loading receives the channel’s priority. For example, a higher priority channel data transfer is performed before a lower priority TCB chain loading.

### Table 7-5. DMA Channel Priority

<table>
<thead>
<tr>
<th>Channel</th>
<th>Description</th>
<th>Priority</th>
</tr>
</thead>
<tbody>
<tr>
<td>Channel 13</td>
<td>AutoDMA Registers</td>
<td>Highest</td>
</tr>
<tr>
<td>Channel 12</td>
<td></td>
<td>↓</td>
</tr>
</tbody>
</table>

Direct Memory Access
DMA Controller Operations

Table 7-5. DMA Channel Priority  (Cont’d)

<table>
<thead>
<tr>
<th>Channel</th>
<th>Description</th>
<th>Priority</th>
</tr>
</thead>
<tbody>
<tr>
<td>Channel 11 (Link 3)</td>
<td>Receiving Links</td>
<td>⬇</td>
</tr>
<tr>
<td>Channel 10 (Link 2)</td>
<td></td>
<td>⬇</td>
</tr>
<tr>
<td>Channel 9 (Link 1)</td>
<td></td>
<td>⬇</td>
</tr>
<tr>
<td>Channel 8 (Link 0)</td>
<td></td>
<td>⬇</td>
</tr>
<tr>
<td>Channel 7 (Link 3)</td>
<td>Transmitting Links</td>
<td>⬇</td>
</tr>
<tr>
<td>Channel 6 (Link 2)</td>
<td></td>
<td>⬇</td>
</tr>
<tr>
<td>Channel 5 (Link 1)</td>
<td></td>
<td>⬇</td>
</tr>
<tr>
<td>Channel 4 (Link 0)</td>
<td></td>
<td>⬇</td>
</tr>
<tr>
<td>Channel 3 (DMAR3)</td>
<td>External Ports (EPs)</td>
<td>⬇</td>
</tr>
<tr>
<td>Channel 2 (DMAR2)</td>
<td></td>
<td>⬇</td>
</tr>
<tr>
<td>Channel 1 (DMAR1)</td>
<td></td>
<td>⬇</td>
</tr>
<tr>
<td>Channel 0 (DMAR0)</td>
<td></td>
<td>Lowest</td>
</tr>
</tbody>
</table>

Rotating Priority

The priority of channels 0 to 3 rotates—that is, every time a new selection is made, the priority is set again. The priority of the channels after reset is:

- channel 3 (highest)
- channel 2
- channel 1
- channel 0 (lowest)
Every time a new selection is made, the priority changes. The newly selected channel receives the highest priority, where priority descends in the same order as above, but in a cyclical fashion. For example, if channel 1 becomes the next selected channel, the priority is:

- channel 1 (highest)
- channel 0
- channel 3
- channel 2 (lowest)

If the next selected channel is channel 0, the priority scheme is:

- channel 0 (highest)
- channel 3
- channel 2
- channel 1 (lowest)

The priority within channels 0 to 3 is kept while higher priority channels (one or more of channels 4–13) are selected. When the higher priority channel (or channels) completes its transfer, the last priority order for channels 0 to 3 is kept unchanged.

This is under the assumption that all channels do not set the priority bit in TCB, or that all channels do set it. If one or more of the channels is set to high priority, it is selected at a separate arbitration level. In this case there is one round robin for the higher priority channels, and another round robin for the lower priority channels. For example, if after reset channel 2, the TCB priority bit is set, the priority is:

- channel 2 (priority bit set)
- channel 3 (highest)
DMA Controller Operations

- channel 1
- channel 0 (lowest)

If channel 1 requests and the request is granted, the priority scheme is:
- channel 2 (priority bit set)
- channel 1 (highest)
- channel 0
- channel 3 (lowest)

If during the operation of channel 1, channel 2 makes a request, it is granted and the channel 1 operation is interrupted. When it completes executing its transactions, the scheme of the lower priority channels is unchanged, and channel 1 wins the arbitration and continues its transfers.

If there is more than one channel with priority bit set, the arbitration between these channels is in a similar round robin manner.

DMA Chaining

DMA chaining allows the TigerSHARC processor’s DMA controller to auto-initialize itself between multiple DMA transfers. Using chaining, multiple DMA operations can be set up, in which each operation can have different attributes and I/Os.

There are some restrictions on chaining DMA across DMA channels. For internal/external memory DMA transfers, cross channel DMA is not permitted. DMA processes between internal/external memory can only be chained within the same channel. Link port DMA processes, however, may be chained across channels.

If chaining across link port DMA channels, the system should enable the DMA error interrupt. This interrupt should be enabled to detect cross-channel chaining to an already active channel.
Direct Memory Access

In chained DMA operations, the TigerSHARC processor automatically sets up another DMA transfer when the entire contents of the current buffer have been transmitted or received. It is important to understand how the following terms apply to chained DMA operations:

**Transfer Control Block (TCB)**

The chain pointer (field CHPT in TCB control register DP) is used to point to the next set of DMA parameters stored in internal memory. The chain target field (CHTG) in the TCB indicates the channel to be initiated.

Each TCB has a chaining enable bit (CHEN) in register DP. The channel is selected according to the value of the CHTG field in TCB register DP. The address in the CHPT field of the same register selects the TCB’s internal memory address. (See “DPx Register” on page 7-19.)

For external/internal memory transfers, the chaining between the internal and external channels must be coordinated. Both channels must have the chaining enable bit set.

**TCB chain loading**

The DMA controller automatically reads the TCB from internal memory and loads the values into the channel TCB registers to set up the next DMA sequence at the end of the present one. This process is called TCB chain loading.

**DMA sequence**

A DMA sequence is defined as the sum of the DMA transfers for a single channel, from the TCB registers initialization to when the count register decrements to zero.
DMA Controller Operations

Enabling and Disabling Chaining

DMA transfers are initiated by writing a quad-word from memory to the TCB register (EP channels require both TCB registers to be loaded). Chaining occurs if the chain enable bit is set, and the chain pointer is a valid address.

The CHPT field in TCB register DP can be loaded at any time during the DMA sequence. This allows a DMA channel to have chaining disabled (when the CHEN bit is set to 0 in TCB register DP) until some event occurs that loads the CHPT field with a target address, sets the CHEN bit, and defines the CHTG field. DMA chaining operations can be performed across I/O devices. While updating an active TCB, it must be paused by writing to the channel field in the DCNT register.

Generating a DMA Complete Interrupt While Chaining

If the INT (interrupt enable) bit in TCB register DP is high, an interrupt occurs at the end of the current block transfer. Interrupts occur at the end of the chain sequence, instead of after each transfer, when the INT bit is set in the last chained TCB register.

Transfer Control Blocks and Chain Loading

During TCB chain loading, the DMA channel TCB registers are loaded with values retrieved from internal memory. The TCB is stored in four consecutive locations of an aligned quad-word.

The TCB for each DMA channel can be loaded under core control by executing a quad-word memory read instruction to the DMA TCB registers. The TCB can also be loaded through chaining or by an external direct write. For transfers between internal and external memories, two TCBs must be loaded.
Direct Memory Access

TCB chain loading is requested like all other DMA operations. A TCB loading request is latched and held in the DMA controller until it becomes the highest priority request. If multiple chaining requests are present, the highest priority DMA channel is dealt with first. DMA channel request priorities are listed in Table 7-5 on page 7-37.

Setting Up and Starting the Chain

To set up and initiate a chain of DMA operations, the program should establish this sequence:

- Set up all TCBs in internal memory
- Write to the appropriate DMA control (DCNT) register

The TCB setup in the DP register includes:

- Set the CHEN (chaining enable) bit
- Define the CHTG (channel target) bits
- Define the CHPT (chaining pointer) bits, noting that the TCB’s address is four times the value of this field

Chain Insertion

A high priority DMA operation or chain can be inserted into an active DMA chain. In order to do so the current channel transactions must be stalled by setting the corresponding pause bit in the DCNT register. A new DMA chain can be inserted into the current chain without affecting the current DMA transfer. The new chain is inserted by the TigerSHARC processor core by writing the CHEN, CHTG, and CHPT bits in the TCB register DP. Once the TCB is updated, the xpause bit in register DCNT is reset and the data transfer continues from where it left off.
Two-Dimensional DMA

This section describes the changes in functionality that occur when the TigerSHARC processor is placed in two-dimensional DMA mode. Two-dimensional DMA mode is enabled by setting the 2DDMA bit in the TCB DP control register. This mode applies to external or internal addresses. The benefit of two-dimensional addressing is reordering data from a row-major orientation into a column-major representation allowing easier access to the columns.

Figure 7-15. DMA Channel 0 Chain Insertion Example
Direct Memory Access

Two-Dimensional DMA Channel Organization

In two-dimensional mode, two-dimensional DMA array addressing can be performed on any DMA channel, receiver or transmitter. The index (DI) register is loaded with the first address in the data array and maintains the current address by adding the X modify value after each transfer as long as the X count field has not reached zero. The X modify value in the DX register contains the offset added to the current address to point to the next element in the X dimension (next column). The X count field in the DX register and the column index (CIX) register\(^1\) both initially contain the number of words in the X dimension. The DMA transfer size of normal, long, or quad word accesses can only be applied in the X dimension. The DX register is decremented after each transfer and contains the number of words left in the current row. The count field of the CIX register is used to reload DX register count field when it decrements to zero.

The Y modify value in the DY register contains the offset added to the current address to point to the next element in the Y dimension (first location in next row). When the X count field reaches zero, only the Y modify value is added to the current address on the following cycle, and the Y count field is decremented. The X modifier is not added. The Y modify value in the DY register is a signed integer, allowing both increments and decrements. The modify bits are sign-extended when used with the 32-bit index register. The Y count field in the DY register initially contains the number of words in the Y dimension (number of rows) and is decremented each time the X count field reaches zero. After the Y count field reaching zero, the DMA block transfer is complete, spanning a matrix of the size X count times transfer size times Y count. If chaining is enabled, the chain pointer field (CHPT) in the DP register points to the start of a buffer in internal memory containing the next set of DMA parameters.

Following formula will help generating the DX and the DY counts and modifiers.

\(^1\) The CIX register is a DMA controller, internal usage only register.
DMA Controller Operations

The `DX` register contains the column count (COL) and the transfer size (TXS) in the X direction, where the upper 16 bits of `DX` contain the count and the lower 16 bits the modifier or stride, which is influenced by the number of rows (ROW):

```c
// DX
DX = TXS*COL << 16;; // word count in x direction
DX |= TXS*ROW;; // column stride is row length
```

The modifier for the `DY` register can be positive or negative. In this case, a negative value has been used to return to the original location offset set by one element of the transfer size TXS, also the last modifier in X direction (columns) was not applied, so we have to subtract it from the total number of locations. Finally, as the lowest 16 bits contain only the negative modifier, the upper 16 bits of the counts must be masked before both halves are logically ORed:

```c
// DY
DY = ROW << 16;; // word count in y
DY |= (-TXS*(((COL-1)*ROW)-1))&0xffff;; // row stride backwards
```

Two-Dimensional DMA Operation

A two-dimensional DMA transfer occurs in the following manner.

In the first phase,

- The current address stored in the `DI` register is output and a DMA memory cycle is initiated.
- In the same cycle, the X modify value stored in the `DX` register modify field is added to the current address in the `DI` register.
Direct Memory Access

- The \( DX \) register count field is decremented.
- If the decremented \( DX \) register count field is zero, the second phase is executed and no modify value from the \( DX \) register modify field is added.

In the second phase,
- The X count is restored into the \( DX \) register count field from the count field of the \( CIX \) register.\(^1\)
- The Y increment value in the \( DY \) register modify field is added to the current address in the \( DI \) register.
- The \( DY \) register count field is decremented.
- If the Y count is zero, the DMA sequence is ended, and the channel becomes inactive until the next pointer is written again.

A key feature of two-dimensional DMA sequence (or any DMA sequence) is the first DMA transfer begins before the address is modified. This means the DMA cannot be disabled by setting either the count in the \( DX \) register or the count in the \( DY \) register to zero. To disable two-dimensional DMA, the \( 2DDMA \) bit must be set to zero in the \( DP \) control register. To disable the DMA channel, the \( TY \) field should be reset in the \( DP \) register.

When the X count becomes zero, but the Y count is nonzero, the X count must be reloaded with the original value. The count field of the \( DX \) register functions as the initial count register while \( DX \) count decrements. The count field of the \( DX \) register holds the original count value that is restored in \( DX \) count register.

---

\(^1\) The \( CIX \) register is a DMA controller, internal usage only register.
Examples of Two Dimensional DMA Transfers

Using two dimensional DMA is an efficient method of rearranging data received in row-major fashion, so that an algorithm, requiring sequential access to the column elements (column-major) can benefit from the reordering.

First Example: 2 columns, 6 rows, transfer size is 1:

2col x 6row memory layout (linear addressing, with word #):

<table>
<thead>
<tr>
<th>1</th>
<th>2</th>
<th>1-3-5-7-9-11-</th>
</tr>
</thead>
<tbody>
<tr>
<td>3</td>
<td>4</td>
<td>-2-4-6-8-10-12</td>
</tr>
<tr>
<td>5</td>
<td>6</td>
<td></td>
</tr>
<tr>
<td>7</td>
<td>8</td>
<td></td>
</tr>
<tr>
<td>9</td>
<td>10</td>
<td></td>
</tr>
<tr>
<td>11</td>
<td>12</td>
<td></td>
</tr>
</tbody>
</table>

COL = 2, ROW = 6, TXS = 1 results in following parameters:

\[ DX = (1 * 2) \ll 16; \] // (TXS * COL)
\[ DX |\!| (1 * 6); \] // (TXS * ROW)

So the value placed in DX will be 0x00020006. For DY following calculations are necessary:

\[ DY = 6 \ll 16;; \] // ROW
\[ DY |\!| -1*(((1)*6)-1)&0xffff;; \] // -TXS*(((COL-1)*ROW)-1)

So the value placed in DY will be 0x0006fffb.

Second Example

3col x 4 row memory layout (linear addressing, with word #):

<table>
<thead>
<tr>
<th>1</th>
<th>2</th>
<th>3</th>
<th>1-4-7-10-</th>
</tr>
</thead>
<tbody>
<tr>
<td>4</td>
<td>5</td>
<td>6</td>
<td>-2-5-8-11-</td>
</tr>
<tr>
<td>7</td>
<td>8</td>
<td>9</td>
<td>-3-6-9-12</td>
</tr>
<tr>
<td>10</td>
<td>11</td>
<td>12</td>
<td></td>
</tr>
</tbody>
</table>

7-48 ADSP-TS201 TigerSHARC Processor Hardware Reference
Direct Memory Access

COL = 3, ROW = 4, TXS = 1 results in following parameters:

\[
\begin{align*}
DX &= (1 \times 3) \ll 16; \quad \text{//(TXS \times COL)} \\
DX &= (1 \times 4); \quad \text{//(TXS \times ROW)} \\
\end{align*}
\]

So the value placed in DX will be 0x00030004. For DY following calculations are necessary:

\[
\begin{align*}
DY &= 4 \ll 16; \quad \text{// ROW} \\
DY &= -1\ast(((2)\times4)-1)\&0xffff; \quad \text{// -TXS*(((COL-1)\times ROW)-1)} \\
\end{align*}
\]

So the value placed in DY will be 0x0004fff9.

DMA Interrupts

A DMA transfer complete interrupt is generated by the DMA if the INT bit in the channel’s DPx register is set. (See “DPx Register” on page 7-19.) The DMA generates the interrupt when an active DMA channel’s transfer count (X count field in the channel’s DXx register) decrements to zero and the transfer completes. When the DMA destination is in internal memory, transfer complete means that the last element of data has reached its destination in internal memory. When the DMA destination is external, transfer complete means that the last element of data has reached the external peripheral’s buffer (either the external port OFIFO or the link port transmit buffer).

The count register(s) must decrement to zero as a result of the actual DMA transfers—merely writing zero to a count register does not generate the interrupt.
DMA Controller Operations

Each DMA channel transfer control block has its own interrupt. The DMA interrupts are latched in the ILAT register and are enabled in the IMASK register. For more information, see “DMA Complete Interrupts” on page 6-20.

Although the EP DMA channel access priority rotates, the interrupt priorities of all DMA channels are fixed.

As an alternative to interrupts, polling the DSTAT register can be used to determine when a single DMA sequence has completed. If chaining is enabled, however, polling the DSTAT should not be used because the next DMA sequence may be underway by the time the polled status is returned.
Starting and Stopping DMA Sequences

This section discusses starting, ending, suspending, and resuming DMA sequences.

Starting a DMA Sequence

A DMA sequence starts when one of the following occurs:

- The DMA channel is enabled, chaining is disabled, the DMA request bit is set, and a DMA request input transitions from high to low.

- The DMA channel is enabled, chaining is disabled, and the DMA request bit is cleared.

- Chaining is enabled, the DMA request bit is set, and a DMA request input transitions from high to low. Once the block transfer is completed, TCB chain loading of the channel TCB registers occurs and the DMA starts a new DMA sequence.

- Chaining is enabled and the DMA request bit is reset. Once the block transfer is completed, TCB chain loading of the channel TCB registers occurs and the DMA starts a new DMA sequence.

To start a new DMA sequence after the current one is finished, the program must write new parameters to the TCB registers.

Writing an active TCB to an active TCB register causes a hardware error interrupt.
External Port DMA

Ending a DMA Sequence

A DMA sequence ends when one of the following occurs:

- A channel is disabled by resetting the \( TY \) field in one of the TCB and \( DP \) registers.
- The count registers decrement to zero and chaining ends.

Suspending a DMA Sequence

A DMA sequence is suspended when the pause bit in one of the \( DCNT \) registers is set. (See “DCNTST Register” on page 7-27.)

Resuming a DMA Sequence

A DMA sequence is resumed when the pause bit in one of the \( DCNT \) registers is reset. (See “DCNTCL Register” on page 7-27.)

Whenever the DMA request goes low again, the DMA sequence continues from where it left off.

External Port DMA

Four DMA channels define internal/external memory transfers, where each channel has two TCB \( DP \) registers and where the \( LEN \) field in each one must be the same. These registers define the following.

- A channel is enabled if both \( TY \) fields in the TCB \( DP \) registers are not set to 000.
- A channel’s priority is high if the \( PR \) bit is set in either one of the TCB and \( DP \) registers.
- DMA interrupt is enabled if the \( INT \) bit is set in either one of the TCB \( DP \) registers.
Direct Memory Access

- DMA request mode is enabled if the DRQ bit is set in either one of the TCB DP registers.

- Internal and external memory transfer is set up in the TY field of the TCB and DP registers. This allows efficient data transfers between the TigerSHARC processor’s internal memory and external memory or devices.

The CHTG field must point to the operating channel in both TCBs’ DP registers if chaining is enabled—for example, in TCBs of channel 1, the CHTG must be 1 in both TCBs. Each TCB register is loaded according to the respective CHPT fields.

Internal and External Address Generation

DMA transfers between the TigerSHARC processor internal memory and external memory require the DMA controller to generate addresses for both. The external address is generated according to the information in the external memory TCB register. (See “DMA Channel Control” on page 7-16.) The internal memory address is generated according to the information in the internal memory TCB register, where both TCB registers belong to the same channel. If the 2DDMA bit in TCB DP register is set for one of the TCBs, the address is also updated with the modifier values in the TCB DY register. Internal-to-internal memory and external-to-external memory transfers are not supported, although external memory to and from external device transfers are supported.
External Port DMA

External Port DMA Transfer Types

The $TY$ field in the TCBx $DP$ register specifies the type of DMA transfer to perform. Each DMA channel has a transmitter and a receiver TCB register, where transmitters drive data and receivers get it.

External to Internal Memory

There are two ways to move data from external to internal memory:

- Program one of the four DMA channels to move a block from one memory to the other. In this case, the transmitter and receiver TCBs should be programmed.

  The configurations for transmitter and receiver TCBs in this case are described in Table 7-6 on page 7-55 and Table 7-7 on page 7-56 respectively.

- Use one of the two AutoDMA register channels. In this case, the external device first programs the channel’s TCB registers and then writes data to the target AutoDMA data register. Writing to this address invokes the corresponding DMA channel.

  The configuration for the receiver TCB in this case is described in Table 7-8 on page 7-57.
# Direct Memory Access

## Table 7-6. Dual TCB Channel – External Memory TCB

<table>
<thead>
<tr>
<th>Register</th>
<th>Field</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>DI</td>
<td></td>
<td>External memory address</td>
</tr>
<tr>
<td>DX</td>
<td></td>
<td>Number of words to transfer and address modifier. The DX register also refers to X dimension data when the 2DDMA bit is set in the DP register.</td>
</tr>
<tr>
<td>DY</td>
<td></td>
<td>Number of Y dimension words to transfer and address modifier when the 2DDMA bit is set in the DP register; otherwise irrelevant.</td>
</tr>
<tr>
<td>DP</td>
<td>TY</td>
<td>External memory</td>
</tr>
</tbody>
</table>
| DP       | PR    | Sets the channel priority  
1 – Increases the channel priority – the cluster bus transaction is signaled as a DFA (Priority Access) transaction |
| DP       | 2DDMA | Sets the two-dimensional mode  
1 – Enables the two-dimensional mode – DY register becomes relevant  
DMA mode. (See "Two-Dimensional DMA" on page 7-44.) |
| DP       | LEN   | Can be word, double word or quad-word. Note the lowest internal bus utilization is achieved with quad-words. |
| DP       | INT   | Sets interrupt  
1 – Interrupts the core after the complete block is transferred. (Enabled if set in either the source or destination TCB DP register.) |
| DP       | DRQ   | Sets transfer mode  
1 – Transfers each data item per DMARx assertion. (Enabled if set in either the source or destination TCB DP register.) |
| DP       | CHEN  | Sets chaining mode  
1 – Enables chaining. |
| DP       | CHTG  | Defines the TCB register to be loaded; must be the same channel. |
| DP       | CHPT  | Chaining pointer – relevant when chaining is enabled. |
### Table 7-7. Dual TCB Channel – Internal Memory TCB

<table>
<thead>
<tr>
<th>Register</th>
<th>Field</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>DI</td>
<td></td>
<td>Internal memory address</td>
</tr>
<tr>
<td>DX</td>
<td></td>
<td>Number of words to transfer and address modifier. The DX register also refers to X dimension data when the 2DDMA bit is set in the DP register.</td>
</tr>
<tr>
<td>DY</td>
<td></td>
<td>Number of Y dimension words to transfer and address modifier when the 2DDMA bit is set in the DP register; otherwise irrelevant.</td>
</tr>
<tr>
<td>DP</td>
<td>TY</td>
<td>Internal memory</td>
</tr>
<tr>
<td>DP</td>
<td>PR</td>
<td>Channel priority 1 – Increases the channel priority – internal bus DMA request priority is high.</td>
</tr>
<tr>
<td>DP</td>
<td>2DDMA</td>
<td>Sets the two-dimensional DMA mode 1 – Enables the two-dimensional DMA mode; the DY register becomes relevant (See &quot;Two-Dimensional DMA&quot; on page 7-44).</td>
</tr>
<tr>
<td>DP</td>
<td>LEN</td>
<td>Must be the same as the value specified in the LEN field of the transmitter’s TCB DP register.</td>
</tr>
<tr>
<td>DP</td>
<td>INT</td>
<td>Sets interrupt 1 – Interrupts the core once the complete block is transferred. (Enabled if set in either the source or destination TCB DP register.)</td>
</tr>
<tr>
<td>DP</td>
<td>DRQ</td>
<td>Sets transfer mode 1 – Transfers each data item per DMARx assertion. (Enabled if set in either the source or destination TCB DP register.)</td>
</tr>
<tr>
<td>DP</td>
<td>CHEN</td>
<td>Sets chaining mode; 1 – Enables chaining This setting must be identical to transmitter TCB.</td>
</tr>
<tr>
<td>DP</td>
<td>CHTG</td>
<td>Defines TCB register to be loaded; must be the same channel.</td>
</tr>
<tr>
<td>DP</td>
<td>CHPT</td>
<td>Chaining pointer – relevant when chaining is enabled.</td>
</tr>
</tbody>
</table>
Consider the case where the transfer direction is from internal to external memory. The configurations for transmitter and receiver TCBs are described in Table 7-9 on page 7-58 and Table 7-10 on page 7-59 respectively.

<table>
<thead>
<tr>
<th>Register</th>
<th>Field</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>DI</td>
<td>Internal memory address</td>
<td></td>
</tr>
<tr>
<td>DX</td>
<td>Number of words to transfer and address modifier. The DX register also refers to X dimension data when the 2DDMA bit is set in the DP register.</td>
<td></td>
</tr>
<tr>
<td>DY</td>
<td>Number of Y dimension words to transfer and address modifier when 2DDMA bit is set in the DP register; otherwise irrelevant.</td>
<td></td>
</tr>
<tr>
<td>DP</td>
<td>TY Internal memory</td>
<td></td>
</tr>
<tr>
<td>DP</td>
<td>PR Channel priority – must always be set to 1</td>
<td></td>
</tr>
<tr>
<td>DP</td>
<td>2DDMA Sets the two-dimensional DMA mode 1 – Enables two-dimensional DMA mode; DY register becomes relevant. (See “Two-Dimensional DMA” on page 7-44.)</td>
<td></td>
</tr>
<tr>
<td>DP</td>
<td>LEN Can be word, double word, or quad-word.</td>
<td></td>
</tr>
<tr>
<td>DP</td>
<td>INT Sets interrupt 1 – Interrupts the core once the complete block is transferred.</td>
<td></td>
</tr>
<tr>
<td>DP</td>
<td>DRQ Must always be set to 1.</td>
<td></td>
</tr>
<tr>
<td>DP</td>
<td>CHEN Sets chaining mode 1 – Enables chaining</td>
<td></td>
</tr>
<tr>
<td>DP</td>
<td>CHTG Defines TCB register to be loaded; must be the same channel.</td>
<td></td>
</tr>
<tr>
<td>DP</td>
<td>CHPT Chaining pointer – relevant when chaining is enabled.</td>
<td></td>
</tr>
</tbody>
</table>
Table 7-9. Internal Memory TCB

<table>
<thead>
<tr>
<th>Transmitter TCB Configuration</th>
<th>Register</th>
<th>Field</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>DI</td>
<td></td>
<td></td>
<td>Internal memory address</td>
</tr>
<tr>
<td>DX</td>
<td></td>
<td></td>
<td>Number of words to transfer and address modifier. The DX register also refers to X dimension data when the 2DDMA bit is set in the DP register.</td>
</tr>
<tr>
<td>DY</td>
<td></td>
<td></td>
<td>Number of Y dimension words to transfer and address modifier when the 2DDMA bit is set in the DP register; otherwise irrelevant.</td>
</tr>
<tr>
<td>DP</td>
<td>TY</td>
<td></td>
<td>Internal memory</td>
</tr>
</tbody>
</table>
| DP                            | PR       |       | Channel priority  
1 – Increases the channel priority – internal bus DMA request priority is high. |
| DP                            | 2DDMA    |       | Sets the two-dimensional DMA mode  
1 – Enables two-dimensional DMA mode; the DY register becomes relevant. (See “Two-Dimensional DMA” on page 7-44.) |
| DP                            | LEN      |       | Can be word, double word, or quad-word. Note the lowest internal bus utilization is achieved with quad-words. |
| DP                            | INT      |       | Sets interrupt  
1 – Interrupts the core once the complete block is transferred. |
| DP                            | DRQ      |       | Sets transfer mode  
1 – Transfers each data item per request. |
| DP                            | CHEN     |       | Sets chaining mode  
1 – Enables chaining. |
| DP                            | CHTG     |       | Defines TCB register to be loaded; must be the same channel. |
| DP                            | CHPT     |       | Chaining pointer – relevant when chaining is enabled. |
### Direct Memory Access

#### Table 7-10. External Memory TCB

<table>
<thead>
<tr>
<th>Register</th>
<th>Field</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>DI</td>
<td></td>
<td>External memory address</td>
</tr>
<tr>
<td>DX</td>
<td></td>
<td>Number of words to transfer and address modifier. The DX register also refers to X dimension data when the 2DDMA bit is set in the DP register.</td>
</tr>
<tr>
<td>DY</td>
<td></td>
<td>Number of Y dimension words to transfer and address modifier when the 2DDMA bit is set in the DP register; otherwise irrelevant.</td>
</tr>
<tr>
<td>DP</td>
<td>TY</td>
<td>External memory</td>
</tr>
</tbody>
</table>
| DP       | PR    | Channel priority  
1 – Increases the channel priority – cluster bus transaction is signaled as a Priority Access transaction. |
| DP       | 2DDMA | Sets the two-dimensional DMA mode  
1 – Enables two-dimensional DMA mode; the DY register becomes relevant.  
(See “Two-Dimensional DMA” on page 7-44.) |
| DP       | LEN   | Must be the same as the value specified in the LEN field of the transmitter’s TCB DP register. |
| DP       | INT   | Sets interrupt  
1 – Interrupts the core once the complete block is transferred.  
Irrelevant when the INT bit is set in the transmitter’s TCB DP register. |
| DP       | DRQ   | Sets transfer mode  
1 – Transfers each data item per request.  
Irrelevant when the DRQ bit is set in the transmitter’s TCB DP register. |
| DP       | CHEN  | Sets chaining mode  
1 – Enables chaining  
The setting must be identical to transmitter TCB. |
| DP       | CHTG  | Defines TCB register to be loaded; must be the same channel. |
| DP       | CHPT  | Chaining pointer – relevant when chaining is enabled. |
External Port DMA

External I/O Device to External Memory (Flyby)

Consider the case where the transfer direction is from an external I/O device to the external memory. There is no internal memory access. The configurations for transmitter and receiver TCBs are described in Table 7-11 on page 7-60 and Table 7-12 on page 7-61 respectively. For more information, see “Flyby Transactions” on page 8-74.

Table 7-11. I/O Device TCB
– External I/O Device to External Memory (Flyby)

<table>
<thead>
<tr>
<th>Transmitter TCB Configuration</th>
<th>Register</th>
<th>Field</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>DI</td>
<td></td>
<td>DI</td>
<td>Irrelevant</td>
</tr>
<tr>
<td>DX</td>
<td></td>
<td>DX</td>
<td>Number of words to transfer and address modifier; modify must be 0.</td>
</tr>
<tr>
<td>DY</td>
<td></td>
<td>DY</td>
<td>Irrelevant</td>
</tr>
<tr>
<td>DP</td>
<td>TY</td>
<td>TY</td>
<td>External I/O device (Flyby)</td>
</tr>
<tr>
<td>DP</td>
<td>PR</td>
<td>PR</td>
<td>Irrelevant</td>
</tr>
<tr>
<td>DP</td>
<td>2DDMA</td>
<td>2DDMA</td>
<td>Must be cleared.</td>
</tr>
<tr>
<td>DP</td>
<td>LEN</td>
<td>LEN</td>
<td>Can be either word or double word.</td>
</tr>
<tr>
<td>DP</td>
<td>INT</td>
<td>INT</td>
<td>Sets interrupt 1 – Intermittently the core once the complete block is transferred.</td>
</tr>
<tr>
<td>DP</td>
<td>DRQ</td>
<td>DRQ</td>
<td>Sets transfer mode 1 – Transfers each data item per request.</td>
</tr>
<tr>
<td>DP</td>
<td>CHEN</td>
<td>CHEN</td>
<td>Sets chaining 1 – Enables chaining.</td>
</tr>
<tr>
<td>DP</td>
<td>CHTG</td>
<td>CHTG</td>
<td>Defines TCB register to be loaded; must be channel 0.</td>
</tr>
<tr>
<td>DP</td>
<td>CHPT</td>
<td>CHPT</td>
<td>Chaining pointer – relevant when chaining is enabled.</td>
</tr>
</tbody>
</table>
Table 7-12. External Memory TCB
– External I/O Device to External Memory (Flyby)

<table>
<thead>
<tr>
<th>Register</th>
<th>Field</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>DI</td>
<td>External memory address</td>
<td></td>
</tr>
<tr>
<td>DX</td>
<td>Number of words to transfer, increment is irrelevant.</td>
<td></td>
</tr>
<tr>
<td></td>
<td>The DX register also refers to X dimension data when the 2DDMA bit is set in the DP register.</td>
<td></td>
</tr>
<tr>
<td>DY</td>
<td>Number of Y dimension words to transfer and address modifier when the 2DDMA bit is set in the DP register; otherwise irrelevant.</td>
<td></td>
</tr>
<tr>
<td>DP</td>
<td>TY</td>
<td>External memory</td>
</tr>
<tr>
<td>DP</td>
<td>PR</td>
<td>Channel priority</td>
</tr>
<tr>
<td></td>
<td>1 – Increases the channel priority – cluster bus transaction is signaled as a Priority Access transaction</td>
<td></td>
</tr>
<tr>
<td>DP</td>
<td>2DDMA</td>
<td>Sets the two-dimensional DMA mode</td>
</tr>
<tr>
<td></td>
<td>1 – Enables two-dimensional DMA mode; the DY register becomes relevant.</td>
<td></td>
</tr>
<tr>
<td></td>
<td>(See “Two-Dimensional DMA” on page 7-44.)</td>
<td></td>
</tr>
<tr>
<td>DP</td>
<td>LEN</td>
<td>Must be the same as the value specified in the LEN field of the transmitter’s TCB DP register.</td>
</tr>
<tr>
<td>DP</td>
<td>INT</td>
<td>Sets interrupt</td>
</tr>
<tr>
<td></td>
<td>1 – Interrupts the core once the complete block is transferred.</td>
<td></td>
</tr>
<tr>
<td></td>
<td>Irrelevant when INT bit is set in the transmitter’s TCB DP register.</td>
<td></td>
</tr>
<tr>
<td>DP</td>
<td>DRQ</td>
<td>Sets transfer mode</td>
</tr>
<tr>
<td></td>
<td>1 – Transfers each data item per request.</td>
<td></td>
</tr>
<tr>
<td></td>
<td>Irrelevant when DRQ bit is set in transmitter’s TCB DP register.</td>
<td></td>
</tr>
<tr>
<td>DP</td>
<td>CHEN</td>
<td>Sets chaining mode</td>
</tr>
<tr>
<td></td>
<td>1 – Enables chaining.</td>
<td></td>
</tr>
<tr>
<td></td>
<td>Irrelevant when the CHEN bit is set in the transmitter’s TCB DP register.</td>
<td></td>
</tr>
<tr>
<td>DP</td>
<td>CHTG</td>
<td>Defines TCB register to be loaded; must be channel 0.</td>
</tr>
<tr>
<td>DP</td>
<td>CHPT</td>
<td>Chaining pointer – relevant when chaining is enabled.</td>
</tr>
</tbody>
</table>
External Port DMA

External Memory to External I/O Device (Flyby)

Consider the case where the transfer direction is from external memory to an external I/O device. There are no internal memory accesses and the data held in the OFIFO is irrelevant. The configurations for transmitter and receiver TCBs are described in Table 7-13 on page 7-62 and Table 7-14 on page 7-63 respectively. For more information, see “Flyby Transactions” on page 8-74.

Table 7-13. External Memory TCB
– External Memory to External I/O Device (Flyby)

<table>
<thead>
<tr>
<th>Register</th>
<th>Field</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>DI</td>
<td></td>
<td>External memory address</td>
</tr>
<tr>
<td>DX</td>
<td></td>
<td>Number of data elements to transfer, increment is irrelevant. The DX register also refers to X dimension data when the 2DDMA bit is set in the DP register.</td>
</tr>
<tr>
<td>DY</td>
<td></td>
<td>Number of Y dimension words to transfer and address modifier when the 2DDMA bit is set in the DP register; otherwise irrelevant.</td>
</tr>
<tr>
<td>DP</td>
<td>TY</td>
<td>External memory</td>
</tr>
<tr>
<td>DP</td>
<td>PR</td>
<td>Sets channel priority – 1 – Increases the channel priority – cluster bus transaction is signaled as a PA (Priority Access) transaction.</td>
</tr>
</tbody>
</table>
| DP       | 2DDMA | Sets the two-dimensional mode  
  1 – Enables two-dimensional DMA mode; the DY register becomes relevant. (See “Two-Dimensional DMA” on page 7-44.) |
| DP       | LEN   | Can be either word or double word.                                          |
| DP       | INT   | Sets interrupt – 1 – Interrupts the core once the complete block is transferred |
| DP       | DRQ   | Sets transfer mode – 1 – Transfers each data item per request.              |
| DP       | CHEN  | Sets chaining mode – 1 – Enables chaining.                                  |
| DP       | CHTG  | Defines TCB register to be loaded; must be channel 0.                       |
| DP       | CHPT  | Chaining pointer – relevant when chaining is enabled.                       |
Direct Memory Access

Table 7-14. I/O Device TCB – External Memory to External I/O Device (Flyby)

<table>
<thead>
<tr>
<th>Register</th>
<th>Field</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>DI</td>
<td></td>
<td>Irrelevant</td>
</tr>
<tr>
<td>DX</td>
<td></td>
<td>Number of words to transfer and address modifier.</td>
</tr>
<tr>
<td>DY</td>
<td></td>
<td>Irrelevant</td>
</tr>
<tr>
<td>DP</td>
<td>TY</td>
<td>External I/O device (Flyby)</td>
</tr>
<tr>
<td>DP</td>
<td>PR</td>
<td>Irrelevant</td>
</tr>
<tr>
<td>DP</td>
<td>2DDMA</td>
<td>Must be cleared.</td>
</tr>
<tr>
<td>DP</td>
<td>LEN</td>
<td>Must be the same value as the LEN field in the transmitter’s TCB DP register.</td>
</tr>
<tr>
<td>DP</td>
<td>INT</td>
<td>Sets interrupt 1 – Interrupts the core once the complete block is transferred. Irrelevant if the INT bit is set in the transmitter’s TCB DP register.</td>
</tr>
<tr>
<td>DP</td>
<td>DRQ</td>
<td>Sets transfer mode 1 – Transfers each data item per request. Irrelevant if the DRQ bit is set in the transmitter’s TCB DP register.</td>
</tr>
<tr>
<td>DP</td>
<td>CHEN</td>
<td>Sets chaining 1 – Enables chaining. Irrelevant if the CHEN bit is set in the transmitter’s TCB DP register.</td>
</tr>
<tr>
<td>DP</td>
<td>CHTG</td>
<td>Defines TCB register to be loaded; must be channel 0.</td>
</tr>
<tr>
<td>DP</td>
<td>CHPT</td>
<td>Chaining pointer – relevant when chaining is enabled.</td>
</tr>
</tbody>
</table>

DMA Semaphores

Since DMA channels are a system resource, a device that wants to use a DMA channel must determine whether one is currently available. It is recommended this be done through software, using an agreed upon location
External Port DMA

in each processor’s memory to specify which channels are currently available. This memory location should only be modified using a read-modify-write operation.

Handshake Mode

Each of the four external port DMA channels can be driven by a request control—DMAR0, DMAR1, DMAR2 and DMAR3.

An access is requested when the external device pulls DMARx low. The falling edge is detected by the TigerSHARC processor and synchronized to the processor’s I/O clock. In order to be recognized in a particular cycle, the DMARx low transition must meet the setup time specified in the data sheet, otherwise it may take effect in the following cycle. The DMA channel x (same channel as DMARx) issues the transactions according to the programming in its TCBs. If the source is external, or the transaction is flyby, the transaction is issued to the cluster bus via OFIFO. If the source is internal, the transaction is issued on the internal bus, and the data read from the source is written to the cluster bus via OFIFO.

The external device does not have to wait for the actual bus transaction before making another request. The requests are stored in a working counter maintained internally by the TigerSHARC processor. The counter holds a maximum of 15 requests, so the external device can make up to 15 requests before the first one has been serviced.

More than 15 requests without a DMA transfer is likely to cause unpredictable results.

The associated DMARx signal is ignored if a DMA channel is disabled.
Direct Memory Access

Flyby Mode

In flyby mode, the TigerSHARC processor operates as an independent DMA controller. Flyby mode transfers are similar to standard DMA transfers; although it is important that the data width match the external I/O device’s width.

For these transactions, the TigerSHARC processor relinquishes the data bus during the transaction and then activates in the external memory and the I/O devices in parallel. Memory is controlled in the regular way, and the I/O device is controlled with the \texttt{TURD}, \texttt{TOWR}, and \texttt{TOWN} pins. The external (SDRAM) memory access behaves the same as all other SDRAM accesses—the FIFOs drive or latch data if the transfer is to or from internal memory. Finally, the TCB for the DMA channel must be preloaded to generate the external memory addresses and word count.

The \texttt{TY} field in the channel 0 \texttt{DCD0:DP} or \texttt{DSDO:DP} register (which is external I/O device) and the \texttt{DRQ} field in the same register (where DMA request is enabled) allow the DMA to be driven by the DMA request lines, and the I/O device to be controlled by the control signals. This way, the external port bus interface unit (EBIU) can interface external memory and external I/O devices in a single cycle. For more information, see “Flyby Transactions” on page 8-74.
Link Port DMA

Four DMA channels define link ports to internal memory, external memory, or other link ports transfers. Another four channels define internal memory, external memory, or other link ports to link ports transfers.

Link Ports DMA Transfer Types

The $TY$ field in the TCBx $DP$ register specifies the type of DMA transfer to be used. Receiving link DMA channels have a receiver TCB register, and transmitting link DMA channels have a transmitter TCB register.

Link Port to Internal/External Memory

In order to move data from link ports to internal or external memory, the receiver TCB must be programmed. The DMA initiates a transfer by making a request of the internal bus. As a result, data is transferred from the link port to internal or external memory (OFIFO). The TCB configuration is described in Table 7-15 on page 7-67.
### Direct Memory Access

Table 7-15. Link to Internal/External Memory TCB

<table>
<thead>
<tr>
<th>Register</th>
<th>Field</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>DI</td>
<td>Internal/external memory address</td>
<td></td>
</tr>
<tr>
<td>DX</td>
<td>Number of words to transfer and address modifier. The DX register also refers to X dimension data when the 2DDMA bit is set in the DP register.</td>
<td></td>
</tr>
<tr>
<td>DY</td>
<td>Number of Y dimension words to transfer and address modifier when the 2DDMA bit is set in the DP register; otherwise irrelevant.</td>
<td></td>
</tr>
<tr>
<td>DP</td>
<td>TY Internal/external memory</td>
<td></td>
</tr>
<tr>
<td>DP</td>
<td>PR Channel priority 1......Increases the channel priority – internal/cluster bus DMA request priority is high.</td>
<td></td>
</tr>
<tr>
<td>DP</td>
<td>2DDMA Sets the two-dimensional DMA mode 1 – Enables two-dimensional DMA mode; the DY register becomes relevant. (See “Two-Dimensional DMA” on page 7-44.)</td>
<td></td>
</tr>
<tr>
<td>DP</td>
<td>LEN Always set to quad-word.</td>
<td></td>
</tr>
<tr>
<td>DP</td>
<td>INT Sets interrupt 1 – Interrupts the core once the complete block is transferred.</td>
<td></td>
</tr>
<tr>
<td>DP</td>
<td>DRQ Don’t care (set in hardware).</td>
<td></td>
</tr>
<tr>
<td>DP</td>
<td>CHEN Sets chaining 1 – Enables chaining.</td>
<td></td>
</tr>
<tr>
<td>DP</td>
<td>CHTG Defines the TCB register to be loaded; may be any link channel.</td>
<td></td>
</tr>
<tr>
<td>DP</td>
<td>CHPT Chaining pointer – relevant if chaining is enabled.</td>
<td></td>
</tr>
</tbody>
</table>
Link Port DMA

Internal/External Memory to Link

Consider the case where the transfer direction is from internal or external memory to link port.

- Internal memory to Link
  The DMA initiates a transfer by requesting data from internal memory. At this stage, data is loaded to the link port when it is available on the internal data bus.

- External memory to Link
  The DMA initiates an internal transfer to fill up the OFIFO buffer, and the external port starts an output to the external device (one or more times depending on the packing mode). Link address is put into IFIFO with read data, whereupon the IFIFO requests an internal bus access to link port. Once the bus is granted, it drives the data on the data bus. The transmitter TCB configuration is described in Table 7-16 on page 7-69.
Direct Memory Access

Table 7-16. Internal/External Memory to Link TCB

<table>
<thead>
<tr>
<th>Transmitter TCB Configuration</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>DI Field</td>
<td>Internal/external memory address</td>
</tr>
<tr>
<td>DX</td>
<td>Number of words to transfer and address modifier. The DX register also refers to X dimension data when the 2DDMA bit is set in the DP register.</td>
</tr>
<tr>
<td>DY</td>
<td>Number of Y dimension words to transfer and address modifier when the 2DDMA bit is set in the DP register; otherwise irrelevant.</td>
</tr>
<tr>
<td>DP TY</td>
<td>Internal/external memory</td>
</tr>
<tr>
<td>DP PR</td>
<td>Channel priority</td>
</tr>
<tr>
<td></td>
<td>1 – Increases the channel priority – internal/cluster bus DMA request priority is high.</td>
</tr>
<tr>
<td>DP 2DDMA</td>
<td>Sets the two-dimensional DMA mode</td>
</tr>
<tr>
<td></td>
<td>1 – Enables two-dimensional DMA mode; DY register becomes relevant. (See &quot;Two-Dimensional DMA&quot; on page 7-44.)</td>
</tr>
<tr>
<td>DP LEN</td>
<td>Always set to quad-word.</td>
</tr>
<tr>
<td>DP INT</td>
<td>Sets interrupt</td>
</tr>
<tr>
<td></td>
<td>1 – Interrupts the core once the complete block is transferred.</td>
</tr>
<tr>
<td>DP DRQ</td>
<td>Don’t care (set in hardware)</td>
</tr>
<tr>
<td>DP CHEN</td>
<td>Sets chaining</td>
</tr>
<tr>
<td></td>
<td>1 – Enables chaining.</td>
</tr>
<tr>
<td>DP CHTG</td>
<td>Defines TCB register to be loaded; may be any link channel.</td>
</tr>
<tr>
<td>DP CHPT</td>
<td>Chaining pointer – relevant when chaining is enabled.</td>
</tr>
</tbody>
</table>

Receiving Link Port to Link Port

In order to move data from one link port to another link port, the receiver TCB should be programmed as a link TCB. See Table 7-17 on page 7-70. The DMA initiates a transfer by requesting the internal bus, and data is transferred from the link port that requested the DMA services to the receiver link port.
Table 7-17. Link to Link TCB

<table>
<thead>
<tr>
<th>Register</th>
<th>Field</th>
<th>Description</th>
</tr>
</thead>
</table>
| DI       | Link port address | - 0x1F04A0 for link #0  
- 0x1F04A8 for link #1  
- 0x1F04B0 for link #2  
- 0x1F04B8 for link #3 |
| DX       | Number of words to transfer and address modifier – the address modifier must be set to 0. |
| DY       | Irrelevant |
| DP       | TY      | I/O link ports or DMA data registers |
| DP       | PR      | Channel priority  
1 – Increases the channel priority—internal bus DMA request priority is high. |
| DP       | 2DDMA   | Cleared. |
| DP       | LEN     | Always set to quad-word. |
| DP       | INT     | Sets interrupt  
1 – Interrupts the core once the complete block is transferred. |
| DP       | DRQ     | Don't care (set in hardware). |
| DP       | CHEN    | Sets chaining  
1 – Enables chaining. |
| DP       | CHTG    | Defines TCB register to be loaded; may be any link channel. |
| DP       | CHPT    | Chaining pointer – relevant if chaining is enabled. |
Direct Memory Access

DMA Throughput

This section discusses overall DMA throughput when several DMA channels are trying to access internal or external memory at the same time.

Internal Memory DMA

The DMA channels and I/O devices arbitrate for access to the TigerSHARC processor’s internal memory and buses. The DMA controller determines, on a cycle-by-cycle basis, which channel is allowed access to the three internal buses or OFIFO, and consequently which channel reads or writes to internal memory. The priority of the DMA channels is discussed in “DMA Channel Prioritization” on page 7-36.

There is no overall throughput loss in switching between channels. Any combination of link ports and external port transfers has the same maximum internal transfer rate, which is one DMA transfer per cycle.

The internal memory clock rate and bus widths are at least twice the external port clock rate and bus width. Thus the bandwidth of each internal bus is at least four times greater than the external port bandwidth.

In the case of link ports, assuming all the links operate at half the core frequency, the maximum link port bandwidth is 1/16 of an internal bus bandwidth.

External Memory DMA

When the DMA transfer is between the TigerSHARC processor internal memory and external memory, the external memory may have one or more wait states. External memory wait states, however, do not reduce the
DMA Operation on Boot

DMA operation on booting is described in “Processor Booting Methods” on page 11-2.
DMA Examples

This section provides the following DMA examples:

- Example Code for External Port DMA Transfers With Interrupts (Listing 7-1)
- Example Code for Chained External Port DMA Transfers With Interrupts (Listing 7-2)
- Example Code for Cross Channel Link Port DMA Transfers With Interrupts (Listing 7-3)

Listing 7-1. External Port DMA Transfers With Interrupts

```assembly
#include "defts201.h"
#define N 1024 /* Number of 32-bit words to DMA */
#define MODIFY 4 /* Modify for quad transfers */

.section data1a
.align 4
.var register_store[8]; /* Internal memory used to store registers within ISR */
.align 4
.var register_store_2[4]; /* Internal memory used to store registers within ISR */
.align 4
```
DMA Examples

.var DC0_Source_TCB[4] =
    tx_data, /* DI register */
    N<<16 | MODIFY, /* DX register */
    0x0, /* DY register */
    TCB_INTMEM | TCB_QUAD | TCB_INT; /* DP register */
.align 4;
.var DC0_Destination_TCB[4] =
    ext_data, /* DI register */
    N<<16 | MODIFY, /* DX register */
    0x0, /* DY register */
    TCB_EXTMEM | TCB_QUAD | TCB_INT; /* DP register */
.align 4;
.var DC1_Source_TCB[4] =
    ext_data, /* DI register */
    N<<16 | MODIFY, /* DX register */
    0x0, /* DY register */
    TCB_EXTMEM | TCB_QUAD | TCB_INT; /* DP register */
.align 4;
.var DC1_Destination_TCB[4] =
    rx_data, /* DI register */
    N<<16 | MODIFY, /* DX register */
    0x0, /* DY register */
    TCB_INTMEM | TCB_QUAD | TCB_INT; /* DP register */

.section data2a:
.var tx_data[N]; /* Buffer containing data to transmit to external memory */
.var rx_data[N]; /* Buffer containing data received from external memory */
.section sdram0:
.var ext_data[N]; /* Buffer in external memory where DMA data is written to and read from */

.section program:
Direct Memory Access

Initialize_Bus_SDRAM:
/* Initializes SYSCON and SDRCON */
xr0 = SYSCON_MP_WID64 | SYSCON_MEM_WID64 |
    SYSCON_MSH_PIPE2 | SYSCON_MSH_WT0 | SYSCON_MSH_IDLE |
    SYSCON_MS1_PIPE1 | SYSCON_MS1_WT0 | SYSCON_MS1_IDLE |
    SYSCON_MS0_SLOW | SYSCON_MS0_WT3 | SYSCON_MS0_IDLE ::
SYSCON = xr0;;

xr0 = SDRCON_INIT | SDRCON_RAS2PC5 | SDRCON_PC2RAS2 |
    SDRCON_REF3700 | SDRCON_PG256 | SDRCON_CLAT2 |
    SDRCON_ENBL ::
SDRCON = xr0;;

/* Initialize the buffer with incremental data */
LC0 = N:: /* Initialize loop counter 0 */
j0 = tx_data:: /* pointer to buffer */
xr0 = 1:: /* Increment value */
xr1 = 0:: /* Initial value in buffer */

_Fill_Tx_Data:
/* Fills the source buffer with data to DMA */
xr1 = r1 + r0:: /* Increment initial value with increment value */
[j0+=1] = xr1:: /* Write to buffer */
if NLCOE, jump _Fill_Tx_Data:: /* If loop counter has not expired then repeat */

_Setup_DMA_interrupt_Vectors:
j0 = j31 + _DMA0_ISR::
IVDMA0 = j0:: /* Set DMA Channel 0 Interrupt Vector */
j0 = j31 + _DMA1_ISR::
IVDMA1 = j0:: /* Set DMA Channel 1 Interrupt Vector */
DMA Examples

_Unmask_DMA_Interrupts:
/* Unmasks DMA channel 0 and 1 Interrupts without altering previously masked or unmasked interrupts */
  xr0 = IMASKH;;
  xr1 = INT_DMA0 | INT_DMA1;;
  xr0 = r0 or r1;;
  IMASKH = xr0;;

_Enable_Hardware_Interrupts:
/* Enable the Hardware Interrupts Enable bit in SQCTL */
  SQCTLST = SQCTL_GIE;;

_Load_DMA0_TCBs:
/* Load values into DMA0 transmit and receive TCBs */
  j0 = DC0_Source_TCB;; /* Load pointer to TCB data */
  xr3:0 = q[j0+j31];
  j0 = DC0_Destination_TCB;; /* Load pointer to TCB data */
  yr3:0 = q[j0+j31];;

  /* Only once both source and destination TCBs are loaded will the dma then commence */
  DCS0 = xr3:0;; /* Load DMA0 source TCB */
  DCD0 = yr3:0;; /* Load DMA0 destination TCB */

_Enable_Flag_Out:
/* Enable FLAG2 and FLAG3 as outputs */
  FLAGREGST = FLAGREG_FLAG2_EN | FLAGREG_FLAG3_EN;;

_Done:
/* Endless loop */
  idle;nop;nop;nop;:
  jump _Done;nop;nop;nop;;

_DMA0_ISR:
Direct Memory Access

/* Interrupt service routine for DMA Channel 0 Interrupt */
j0 = j31 + register_store;;
q[j0+=4] = xr3:0;; /* Store registers used in ISR to memory */
q[j0+=4] = yr3:0;; /* Store registers used in ISR to memory */

/*****************************/
/* Add main function of ISR here */
/*****************************/
xr1 = flagreg;;
xr2 = btgl r1 by FLAGREG_FLAG2_OUT_P;;
flagreg = xr2;; /* Toggle Flag2 LED */
j0 = DC1_Source_TCB;; /* Load pointer to TCB data */
xr3:0 = q[j0+j31];
j0 = DC1_Destination_TCB;; /* Load pointer to TCB data */
yr3:0 = q[j0+j31];;

/* Only once both source and destination TCBs are loaded will the
dma then commence */
DCS1 = xr3:0;; /* Load DMA1 source TCB */
DCD1 = yr3:0;; /* Load DMA1 destination TCB */

j0 = j31 + register_store;;
xr3:0 = q[j0+=4];; /* Restore registers previously saved to mem-
ory */
yr3:0 = q[j0+=4];; /* Restore registers previously saved to mem-
ory */
RTI(ABS);;

_DMA1_ISR:
/* Interrupt service routine for DMA Channel 1 Interrupt */
j0 = j31 + register_store_2;;
q[j0+=4] = xr3:0;;
DMA Examples

Listing 7-2. Example Code for Chained External Port DMA Transfers With Interrupts

```assembly
#include "defts201.h"
#define N 1024
#define MODIFY 4

.section data1a;
.var chain_flag = 0x0; /* Used to check if DMA channel 1 chain has started */
.align 4;
.var register_store[8]; /* Internal memory used to store registers within ISR */
.align 4;
.var register_store_2[4]; /* Internal memory used to store registers within ISR */
```

Direct Memory Access

/* Each of the following TCBs CHPT field points to it's own address. Thus the exact same DMA transfer is repeated endlessly */
.align 4;
.var DC0_Source_TCB[4] =
    tx_data, /* DI register */
    N<<16 | MODIFY, /* DX register */
    0x0, /* DY register */
    TCB_INTMEM | TCB_QUAD | TCB_INT | /* DP register */
    TCB_CHAIN | DC0_Source_TCB >> 2;
.align 4;
.var DC0_Destination_TCB[4] =
    ext_data, /* DI register */
    N<<16 | MODIFY, /* DX register */
    0x0, /* DY register */
    TCB_EXTMEM | TCB_QUAD | TCB_INT | /* DP register */
    TCB_CHAIN | DC0_Destination_TCB >> 2;
.align 4;
.var DC1_Source_TCB[4] =
    ext_data, /* DI register */
    N<<16 | MODIFY, /* DX register */
    0x0, /* DY register */
    TCB_EXTMEM | TCB_QUAD | TCB_INT | /* DP register */
    TCB_CHAIN | DC1_Source_TCB >> 2;
.align 4;
.var DC1_Destination_TCB[4] =
    rx_data, /* DI register */
    N<<16 | MODIFY, /* DX register */
    0x0, /* DY register */
    TCB_INTMEM | TCB_QUAD | TCB_INT | /* DP register */
    TCB_CHAIN | DC1_Destination_TCB >> 2;
.section data2a;
DMA Examples

.var tx_data[N]; /* Buffer containing data to transmit to external memory */
.var rx_data[N]; /* Buffer to store data read from external memory */

.section sdram0:
.var ext_data[N]; /* Buffer in external memory where DMA data is written to and read from */

.section program:
Initialize_Bus_SDRAM: /* Initialises SYSCON and SDRCON */
xr0 = SYSCON_MP_WID64 | SYSCON_MEM_WID64 |
   SYSCON_MSH_PIPE2 | SYSCON_MSH_WT0 | SYSCON_MSH_IDLE |
   SYSCON_MS1_PIPE1 | SYSCON_MS1_WT0 | SYSCON_MS1_IDLE |
   SYSCON_MS0_SLOW | SYSCON_MS0_WT3 | SYSCON_MS0_IDLE;:
SYSCON = xr0;:

xr0 = SDRCON_INIT | SDRCON_RAS2PC5 | SDRCON_PC2RAS2 |
   SDRCON_REF3700 | SDRCON_PG256 | SDRCON_CLAT2 |
   SDRCON_ENBL;:
SDRCON = xr0;:

/* Initialize the buffer with incremental data */
.LCO = N;:: /* Initialize loop counter 0 */
j0 = tx_data:: /* pointer to buffer */
xr0 = 1;:: /* Increment value */
xr1 = 0;:: /* Initial value in buffer */

_FILL_Tx_Data:
/* Fills the source buffer with data to DMA */
xr1 = r1 + r0;:: /* Increment initial value with increment value */
[j0+=1] = xr1;:: /* Write to buffer */
Direct Memory Access

if NLCOE, jump _Fill_Tx_Data;: /* If loop counter has not expired then repeat */

_Setup_DMA_interrupt_Vectors:
  j0 = j31 + _DMA0_ISR;;
  IVDMA0 = j0;; /* Set DMA Channel 0 Interrupt Vector */
  j0 = j31 + _DMA1_ISR;;
  IVDMA1 = j0;; /* Set DMA Channel 1 Interrupt Vector */

_Unmask_DMA_Interrupts:
/* Unmasks DMA channel 0 and 1 Interrupts without altering previously masked or unmasked interrupts */
  xr0 = IMASKH;;
  xr1 = INT_DMA0 | INT_DMA1;;
  xr0 = r0 or r1;;
  IMASKH = xr0;;

_Enable_Hardware_Interrupts:
/* Enable the Hardware Interrupts Enable bit in SQCTL */
  SQCTLST = SQCTL_GIE;;

_Load_DMA0_TCBs:
/* Load values into DMA0 transmit and receive TCBs */
  j0 = DC0_Source_TCB;; /* Load pointer to TCB data */
  xr3:0 = q[j0+j31];
  j0 = DC0_Destination_TCB;; /* Load pointer to TCB data */
  yr3:0 = q[j0+j31];

/* We only need to write to these TCBs once as the TCB chain points to itself resulting in an endless DMA chain. Only once both source and destination TCBs are loaded will the dma then commence. */
  DCS0 = xr3:0;; /* Load DMA0 source TCB */
  DCD0 = yr3:0;; /* Load DMA0 destination TCB */
DMA Examples

_Enable_Flag_Out:
/* Enable FLAG2 and FLAG3 as outputs */
FLAGREGST = FLAGREG_FLAG2_EN | FLAGREG_FLAG3_EN;

_Done:
/* Endless loop */
nop;nop;nop;nop;;
jump _Done;nop;nop;nop;;

_DMA0_ISR:
/* Interrupt service routine for DMA Channel 0 Interrupt */
j0 = j31 + register_store;;
q[j0+=4] = xr3:0;; /* Store registers used in ISR to memory */
q[j0+=4] = yr3:0;; /* Store registers used in ISR to memory */
/******************************************/
/* Add main function of ISR here */
/******************************************/
xr1 = flagreg;;
xr2 = btgl r1 by FLAGREG_FLAG2_OUT_P;;
flagreg = xr2;; /* Toggle Flag2 LED */

j0 = chain_flag;; /* Load pointer to chain_flag */
xr0 = [j0+j31];; /* Load value of chain_flag */
xr1 = 1;;
r0 = r1-r0;; /* Check if equal zero */
if nxaeq, jump _Start_DMA1_Chain;; /* If not equal to zero then load TCBs */

j0 = j31 + register_store;;
xr3:0 = q[j0+=4];; /* Restore registers previously saved to memory */
yr3:0 = q[j0+=4];; /* Restore registers previously saved to memory */
Direct Memory Access

RTI(ABS);;

_Start_DMA1_Chain:
/
  If the chain_flag is not set then we need to start the DMA for
  DMA Channel 1, we only need to write to these TCBs once as the TCB
  chain points to itself resulting in an endless DMA chain. */
  xr0 = 1;; /* Once we have reached this point we need to set the
  chain_flag */
  [j0 + j31] = xr0;;
  j0 = DC1_Source_TCB;; /* Load pointer to TCB data */
  xr3:0 = q[j0+j31];;
  j0 = DC1_Destination_TCB;; /* Load pointer to TCB data */
  yr3:0 = q[j0+j31];;

  /* Only once both source and destination TCBs are loaded will the
  dma then commence */
  DCS1 = xr3:0;; /* Load DMA1 source TCB */
  DCD1 = yr3:0;; /* Load DMA1 destination TCB */
  j0 = j31 + register_store;;
  xr3:0 = q[j0+=4];; /* Restore registers previously saved to mem-
  ory */
  yr3:0 = q[j0+=4];; /* Restore registers previously saved to mem-
  ory */
  RTI(ABS);;

_DMA1_ISR:
/
  Interrupt service routine for DMA Channel 1 Interrupt */
  j0 = j31 + register_store_2;;
  q[j0+=4] = xr3:0;;

  /*******************************
  */ Add main function of ISR here */
  /*******************************
  xr1 = flagreg;;
DMA Examples

xr2 = btgl r1 by FLAGREG_FLAG3_OUT_P;
flagreg = xr2;; /* Toggle Flag3 LED */

j0 = j31 + register_store_2;;
xr3:0 = q[j0+4];;
RTI(ABS);;

Listing 7-3. Example Code for Cross Channel Link Port DMA Transfers With Interrupts

/*****************************/
/*
/* Cross Channel Link Port DMA transfers with interrupts */
/*
/*****************************/
#include "defts201.h"
#define N 1024
#define MODIFY 4

.section data1a;
.align 4:
.var register_store[4]; /* Internal memory used to store registers within ISR */
.align 4:
.var register_store_2[4]; /* Internal memory used to store registers within ISR */
.align 4:
.var register_store_3[4]; /* Internal memory used to store registers within ISR */
.align 4:
.var register_store_4[4]; /* Internal memory used to store registers within ISR */
.align 4:
Direct Memory Access

.var register_store_5[4]: /* Internal memory used to store registers within ISR */

/* Each of the following TCBs CHPT field points to the next link port TCB storage address thus cross channeling a DMA transfer from link port 0 to link port 1 to link port 2 and finally onto link port 3. The final link port 3 TCB does not have chaining enabled and thus the chain stops after the transfer through all four link ports has completed. */

.align 4;
.var DC4_Source_TCB[4] =
    tx_data, /* DI register */
    N<<16 | MODIFY, /* DX register */
    0x0, /* DY register */
    TCB_INTMEM | TCB_QUAD | TCB_INT | /* DP register */
    TCB_CHAIN | DC5_Source_TCB >> 2 |
    TCB_DMA5DEST;
.align 4;
.var DC5_Source_TCB[4] =
    tx_data, /* DI register */
    N<<16 | MODIFY, /* DX register */
    0x0, /* DY register */
    TCB_INTMEM | TCB_QUAD | TCB_INT | /* DP register */
    TCB_CHAIN | DC6_Source_TCB >> 2 |
    TCB_DMA6DEST;
.align 4;
.var DC6_Source_TCB[4] =
    tx_data, /* DI register */
    N<<16 | MODIFY, /* DX register */
    0x0, /* DY register */
    TCB_INTMEM | TCB_QUAD | TCB_INT | /* DP register */
    TCB_CHAIN | DC7_Source_TCB >> 2 |
    TCB_DMA7DEST;
DMA Examples

.align 4;
.var DC7_Source_TCB[4] =
    tx_data, /* DI register */
    N<<16 | MODIFY, /* DX register */
    0x0, /* DY register */
    TCB_INTMEM | TCB_QUAD | TCB_INT, /* DP register */

.section data2a;
.var tx_data[N]; /* Buffer containing data to transmit to link ports */

.section program;
/* Initialize the buffer with incremental data */
LC0 = N;; /* Initialize loop counter 0 */
j0 = tx_data;; /* pointer to buffer */
xr0 = 1;; /* Increment value */
xr1 = 0;; /* Initial value in buffer */

.Fill_Tx_Data:
/* Fills the source buffer with data to DMA */
xr1 = r1 + r0;; /* Increment initial value with increment value */
[j0+=1] = xr1;; /* Write to buffer */
if NLCOE, jump _Fill_Tx_Data;; /* If loop counter has not expired then repeat */

.Setup_DMA_interrupt_Vectors:
j0 = j31 + _DMA4_ISR;;
IVDMA4 = j0;; /* Set DMA Channel 4 Interrupt Vector */
j0 = j31 + _DMA5_ISR;;
IVDMA5 = j0;; /* Set DMA Channel 5 Interrupt Vector */
j0 = j31 + _DMA6_ISR;;
IVDMA6 = j0;; /* Set DMA Channel 6 Interrupt Vector */
j0 = j31 + _DMA7_ISR;;
Direct Memory Access

IVDMA7 = j0;; /* Set DMA Channel 7 Interrupt Vector */

j0 = j31 + _HWERR_ISR;;

IVHW = j0;; /* Set Hardware Error Interrupt Vector */

_Unmask_DMA_Interrupts:
/* Unmasks DMA channels for all link port transmit interrupts
without altering previously masked or unmasked interrupts. Also
the HW interrupt in the event that an active DMA channel is writ-
ten to with an active value */

xr0 = IMASKH;;
xr1 = INT_DMA4 | INT_DMA5 | INT_DMA6 | INT_DMA7 | INT_HWERR;;
xr0 = r0 or r1;;
IMASKH = xr0;;

_Enable_Hardware_Interrupts:
/* Enable the Hardware Interrupts Enable bit in SQCTL */

SQCTLST = SQCTL_GIE;;

_Set_Up_Linkports:
/* Set link port transmission characteristics */

xr0 = LTCTL_TCLKDIV2 | LTCTL_TDSIZE | LTCTL_TEN;;
LTCTL0 = xr0;;
LTCTL1 = xr0;;
LTCTL2 = xr0;;
LTCTL3 = xr0;;

_Load_DMA4_TCB:
/* Load values into DMA0 transmit and receive TCBs */

j0 = DC4_Source_TCB;; /* Load pointer to TCB data */
xr3:0 = q[j0+j31];;
DMA Examples

/* We only need to write to the TCB once as the TCB chain points to the next TCB for the next link port resulting in a DMA chain transitioning from link port 0 to link port 1 to link port 2 to link port 3 */
DC4 = xr3:0;; /* Load DMA0 source TCB */

_Enable_Flag_Out:
/* Enable FLAG2 and FLAG3 as outputs */
FLAGREGST = FLAGREG_FLAG2_EN | FLAGREG_FLAG3_EN;;

_Done:
/* Endless loop */
nop;nop;nop;nop;;
jump _Done:nop;nop;nop;;

_DMA4_ISR:
/* Interrupt service routine for DMA Channel 4 Interrupt */
j0 = j31 + register_store;;
q[j0+=4] = xr3:0;; /* Store registers used in ISR to memory */

 till here
/* Add main function of ISR here */
 till here
xr1 = flagreg;;
xr2 = btgl r1 by FLAGREG_FLAG2_OUT_P;;
flagreg = xr2;; /* Toggle Flag2 LED */

 till here
j0 = j31 + register_store;;
xr3:0 = q[j0+=4]:: /* Restore registers previously saved to memory */
RTI(ABS);;

_DMA5_ISR:
/* Interrupt service routine for DMA Channel 5 Interrupt */
j0 = j31 + register_store_2;;
q[j0+=4] = xr3:0;; /* Store registers used in ISR to memory */

/***********************/
/* Add main function of ISR here */
/***********************/
xr1 = flagreg;;
xr2 = btgl r1 by FLAGREG_FLAG2_OUT_P;;
flagreg = xr2;; /* Toggle Flag2 LED */

j0 = j31 + register_store_2;;
xr3:0 = q[j0+=4];; /* Restore registers previously saved to memory */
RTI(ABS);;

_DMA6_ISR:
/* Interrupt service routine for DMA Channel 6 Interrupt */
j0 = j31 + register_store_3;;
q[j0+=4] = xr3:0;; /* Store registers used in ISR to memory */

/***********************/
/* Add main function of ISR here */
/***********************/
xr1 = flagreg;;
xr2 = btgl r1 by FLAGREG_FLAG2_OUT_P;;
flagreg = xr2;; /* Toggle Flag2 LED */

j0 = j31 + register_store_3;;
xr3:0 = q[j0+=4];; /* Restore registers previously saved to memory */
RTI(ABS);;

_DMA7_ISR:
/* Interrupt service routine for DMA Channel 7 Interrupt */
DMA Examples

j0 = j31 + register_store_4;;
q[j0+=4] = xr3:0;; /* Store registers used in ISR to memory */

/**********************************
/* Add main function of ISR here */
/**********************************
xr1 = flagreg;;
xr2 = btgl r1 by FLAGREG_FLAG2_OUT_P;;
flagreg = xr2;; /* Toggle Flag2 LED */

j0 = j31 + register_store_4;;
xr3:0 = q[j0+=4];; /* Restore registers previously saved to memory */
    RTI(ABS);;

_HWERR_ISR:
/* Interrupt service routine for Hardware Error Interrupt */
j0 = j31 + register_store_5;;
q[j0+=4] = xr3:0;; /* Store registers used in ISR to memory */

/**********************************
/* Add main function of ISR here */
/**********************************
xr1 = flagreg;;
xr2 = btgl r1 by FLAGREG_FLAG3_OUT_P;;
flagreg = xr2;; /* Toggle Flag3 LED */

j0 = j31 + register_store_5;;
xr3:0 = q[j0+=4];; /* Restore registers previously saved to memory */
    RTI(ABS);;
The TigerSHARC processor can be used as a single processor or as one element in a multiprocessor system. The processor can act as a standalone TigerSHARC processor or system, or it can be controlled by a host computer (for example, the TigerSHARC processor system can be an add-in board on a PC). The system architecture is flexible, and can be implemented according to the application requirements. The software features for the different options lie within the TigerSHARC processor. This chapter focuses on the external bus interface of the TigerSHARC processor, which includes the bus arbitration logic and the external address, data, and control buses.

The fastest protocol is the pipelined protocol. The TigerSHARC processor uses this protocol to interface with other TigerSHARC processors within a cluster. The TigerSHARC processor can also use this protocol to interface with host and intelligent memory systems. The protocol has a peak performance of one data transfer every bus clock cycle and can sustain a throughput close to this peak performance because there is no restriction of address range for continuous flow in full throughput.

Another fast protocol is the SDRAM protocol. This protocol is defined by industry-standard SDRAM chips. The TigerSHARC processor has an on-chip SDRAM controller that drives all the SDRAM control signals (RAS, CAS, SDWE, SDCKE, LOOM, and HDOM) and handles the initialization and refresh of the SDRAM chips. The peak data throughput to the SDRAM is one data transfer every clock cycle. The throughput can be kept close to the maximum if sequential accesses are kept in the same page. This restriction is also applicable to block transfers by DMA. The overhead of a single access to SDRAM is high.
The TigerSHARC processor also supports slow device protocol. This protocol should be used for non-performance critical devices. In most systems, these devices should be connected on a secondary bus, since it may increase the load on the bus and slow the more critical accesses. It is possible, however, to put the slower devices directly on the external bus.

The external bus accommodates both 32-bit and 64-bit data bus widths and 32-bit address and control signals. Most signals are bidirectional since the TigerSHARC processor can be either the master or the slave on the external bus.

The external bus interface unit can be configured to many different setups by writing to the different external port control registers—SYSCON and SDRCON (see “External Bus Interface Register Groups” on page 2-72). These two registers can be initialized any time during system operation or they can be restricted to be written only once. The control for this write feature depends on the setting of the strap pin SYS_REG_WE on reset. The setup of these two registers should be identical in all TigerSHARC processors in a given system. A broadcast write is recommended when initializing the SYSCON and SDRCON registers.

The SYSCON and SDRCON registers can always be written in emulation mode.

The external bus includes:

- Bus width: 64- or 32-bits, configured separately for memory, multiprocessing or host interface
- Pipelined transactions with a programmable number of pipeline stages
- Programmable IDLE states
External Port and SDRAM Interface

- Protocol supporting wait cycles inserted by the target slave, using the ACK pin
- EPROM and FLASH interface: 8-bit data bus with a fixed number of wait cycles, read or write
- Host interface
- SDRAM interface: no wait cycles required
- Support for slow I/O devices
- Glueless multiprocessing with other TigerSHARC processors, based on distributed bus arbitration
- Support of DMA transactions for external I/O devices through handshake mode
- Support for flyby mode transactions between SDRAM memory and I/O in DMA
- Support for cluster bus arbitration (response only) while processor is in the software reset state.

Bus Interface I/O Pins

This section presents external bus interface unit (EBIU) I/O pins in Table 8-1 on page 8-4. Some related information sources on the EBIU pins are:

- Multiprocessing interface pins in Table 8-3 on page 8-35
- SDRAM interface I/O pins in Table 8-4 on page 8-55.
- External port data alignment in Figure 8-24 on page 8-64 and Figure 8-23 on page 8-60
## Table 8-1. Pin Definitions – External Port Bus Controls

<table>
<thead>
<tr>
<th>Signal</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>ADDR31–0</td>
<td>Address Bus. The DSP issues addresses for accessing memory and peripherals on these pins. In a multiprocessor system, the bus master drives addresses for accessing internal memory or I/O processor registers of other ADSP-TS201s. The DSP inputs addresses when a host or another DSP accesses its internal memory or I/O processor registers.</td>
</tr>
<tr>
<td>DATA63–0</td>
<td>External Data Bus. The DSP drives and receives data and instructions on these pins. Pullup/down resistors on unused DATA pins are unnecessary.</td>
</tr>
<tr>
<td>RD</td>
<td>Memory Read. RD is asserted whenever the DSP reads from any slave in the system, excluding SDRAM. When the DSP is a slave, RD is an input and indicates read transactions that access its internal memory or universal registers. In a multiprocessor system, the bus master drives RD. RD changes concurrently with ADDR pins.</td>
</tr>
<tr>
<td>WRL</td>
<td>Write Low. WRL is asserted in two cases: When the ADSP-TS201 writes to an even address word of external memory or to another external bus agent; and when the ADSP-TS201 writes to a 32-bit bus (host, memory, or DSP programmed to 32-bit bus). An external master (host or DSP) asserts WRL for writing to a DSP’s low word of internal memory. In a multiprocessor system, the bus master drives WRL. WRL changes concurrently with ADDR pins. When the DSP is a slave, WRL is an input and indicates write transactions that access its internal memory or universal registers.</td>
</tr>
<tr>
<td>WRH</td>
<td>Write High. WRH is asserted when the ADSP-TS201 writes a long word (64 bits) or writes to an odd address word of external memory or to another external bus agent on a 64-bit data bus. An external master (host or another DSP) must assert WRH for writing to a DSP’s high word of 64-bit data bus. In a multiprocessor system, the bus master drives WRH. WRH changes concurrently with ADDR pins. When the DSP is a slave, WRH is an input and indicates write transactions that access its internal memory or universal registers.</td>
</tr>
<tr>
<td>ACK</td>
<td>Acknowledge. External slave devices can de-assert ACK to add wait states to external memory accesses. ACK is used by I/O devices, memory controllers and other peripherals on the data phase. The DSP can de-assert ACK to add wait states to read and write accesses of its internal memory. The pullup is 50 Ω on low-to-high transactions and is 500 Ω on all other transactions.</td>
</tr>
<tr>
<td>BMS</td>
<td>Boot Memory Select. BMS is the chip select for boot EPROM or flash memory. During reset, the DSP uses BMS as a strap pin (EBOOT) for EPROM boot mode. In a multiprocessor system, the DSP bus master drives BMS.</td>
</tr>
</tbody>
</table>
External Port and SDRAM Interface

Table 8-1. Pin Definitions – External Port Bus Controls (Cont’d)

<table>
<thead>
<tr>
<th>Signal</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>MS1–0</td>
<td>Memory Select. MS0 or MS1 is asserted whenever the DSP accesses memory banks 0 or 1 respectively. MS1–0 are decoded memory address pins that change concurrently with ADDR pins. When ADDR31–27 = 0b00110, MS0 is asserted. When ADDR31–27 = 0b00111, MS1 is asserted. In multiprocessor systems, the master DSP drives MS1–0.</td>
</tr>
<tr>
<td>MSH</td>
<td>Memory Select Host. MSH is asserted whenever the DSP accesses the host address space (ADDR31 = 0b1). MSH is a decoded memory address pin that changes concurrently with ADDR pins. In a multiprocessor system, the bus master DSP drives MSH.</td>
</tr>
<tr>
<td>BRST</td>
<td>Burst. The current bus master (DSP or host) asserts this pin to indicate that it is reading or writing data associated with consecutive addresses. A slave device can ignore addresses after the first one and increment an internal address counter after each transfer. For host-to-DSP burst accesses, the DSP increments the address automatically while BRST is asserted.</td>
</tr>
</tbody>
</table>

Processor Microarchitecture

The TigerSHARC processor functions differently depending on the type of system in which it is used. It can perform as a single processor or in a multiprocessor system on a common external bus (as shown in Figure 8-1 on page 8-6). There may be a host (or host interface) in the system as well, where the arbitration between multiple TigerSHARC processors and between TigerSHARC processors and the host is accomplished using a distributed arbitration logic on the TigerSHARC processors themselves.

In a multiprocessor system, each TigerSHARC processor must be identified by its own ID. The ID is set by three input pins that are connected to a constant value. There must always be a single processor in a system with ID=000 because the ID0 processor is required to initialize any external SDRAM memory. Also, the ID0 processor has internal pull-up and pull-down resistors that are active during and after reset. For more infor-
Figure 8-1. Typical Multiprocessing Cluster Configuration
External Port and SDRAM Interface

mation on these features, see the *ADSP-TS201 TigerSHARC Embedded Processor Data Sheet*. For more information, see “Multiprocessing Interface” on page 8-34.

Other agents defined on the external bus can be memory chips, I/O devices, or bridges to other buses. Additionally, every TigerSHARC processor and the host in the system can be accessed as slaves.

Transaction on the external bus can be word (32-bits), long (64-bits) or quad (128-bits) length. The address must be aligned to its size—that is, quad-word transactions must be quad-word aligned and long-word must be long-word aligned. Since the bus width (32- or 64-bits) may be smaller than the transaction size, a transaction may be spread over a few external bus cycles in order to be transferred in its entirety.

As shown in Figure 8-2, the external port (EBIU) can be a bus master or a bus slave on the SOC bus. It is important to note that EBIU register accesses must cross a number of clock domains when reading or writing register data. Corresponding response for EBIU register accesses may vary with SOC bus loading. For more information on clock domains, see “SOC Interface” on page 3-1.

External port input FIFO (IFIFO) refers to the TigerSHARC processor EBIU input FIFO. It is used for all:

- Externally-supplied data by bus masters (other TigerSHARC processors or a host processor)
- External reads (when the TigerSHARC processor is the external bus master)

The IFIFO also holds on-chip destination addresses and data attributes (when the TigerSHARC processor is a bus slave).
External port output FIFO (OFIFO) refers to the TigerSHARC processor BIU output FIFO. It is used for all outgoing external port addresses, data, and transaction control signals, including DMA transfers to and from external address space.
External Port and SDRAM Interface

The source of the cluster bus transaction can be either from the processor core (instruction fetch or operand access) or the DMA controller. In order to optimize the transactions flow, it is important to have a general understanding of the BIU. The EBIU data transfers are illustrated in Figure 8-3 on page 8-10.

An external transaction is initiated by the internal master (DMA or core). The transaction is strobed by the OFIFO and, in turn, is driven on the external bus. If the transaction is a write transaction, the transaction is completed at this point. If it is a read transaction, the data read on the external transaction is strobed in the IFIFO and is written later to its target (defined by the original internal transaction).

Transactions can be initiated on the external bus and driven into internal TigerSHARC processor elements. For example, memory and Uregs can be read and written by an external bus master—another TigerSHARC processor or host. The transaction is identified according to the memory map (see “Multiprocessor Space” on page 2-6) and processor ID, strobed by the IFIFO, and driven on the internal bus in turn. If the external transaction is read, the data is returned to the master that initiated the transaction through the slave’s OBUF.

On DMA transactions (reads or writes), addresses are written to the OFIFO through a dedicated address bus. All external transfers use the system on chip (SOC) interface.

The internal S-bus is used for both external and internal SOC interface transactions, where the S-bus can execute a new transaction every cycle. The S-bus provides the interface between the SOC interface and memory. (See Figure 8-2 on page 8-8.) The cause of a transaction can be an instruc-
Figure 8-3. External Bus Interface Unit (EBIU) Data Transfers
External Port and SDRAM Interface

A transaction fetch, a DMA transfer, a load or store instruction, or an IFIFO transaction (internal address accessed by an external master or data returned from an external read).

On previous TigerSHARC processors, the buses are associated with memory blocks, rather than the buses being associated with functional blocks. This difference leads to performance differences in applications that are optimized for previous TigerSHARC processors, rather than those optimized for the ADSP-TS201.

The processor uses the following priority order when arbitrating between two (or more) internal buses making simultaneous accesses to the same internal memory block. Each transaction source can come from a bus or the bus’s FIFO. In the following priority order, a FIFO indicates a transaction (held in the FIFO) that was not acknowledged by the memory block on the first access, while a bus represents the first access itself.

1. S-bus FIFO (SOC interface FIFO, high priority)
2. S-bus (SOC interface, high priority)
3. J-bus/K-bus FIFO
4. J-bus
5. K-bus
6. S-bus FIFO (SOC interface FIFO, low priority)
7. S-bus (SOC interface, low priority)
8. I-bus (instruction fetch FIFO)
9. I-bus (instruction fetch)
Processor Microarchitecture

A transaction from SOC interface is high priority in the following cases:

- Read transaction
- DMA transaction marked as high priority (TCB was high)
- When there are at least two entries in the SOC-interface IFIFO
- When a single transaction in the IFIFO has waited there for 16 CCLK cycles

The processor uses the following priority order when arbitrating between two (or more) internal masters making simultaneous accesses to the SOC bus.

1. High priority cluster bus interface transactions (highest)
2. High priority DMA transactions
3. SOC interface transactions
4. Low priority cluster bus transactions
5. Low priority DMA transactions (lowest)

The DMA request priority is determined by the source TCB priority bit (see “DPx Register” on page 7-19).

The cluster bus interface is high priority in the following cases.

- The IFIFO has more than two waiting transactions
- A write transaction has been waiting unserved for 10 CCLK cycles
- The IFIFO returns DMA data (from a high-priority DMA)

In all other cases, the IFIFO request has low priority.
SYSCON Register Programming

The SYSCON register, described in “System Configuration (SYSCON) Register” on page 2-73, is the system configuration register and must be programmed before bus traffic begins (if default values need to be changed). In User mode, SYSCON is write once after reset, unless the SYS_REG_WE strap pin is enabled. In Supervisor mode, SYSCON has no write restrictions.

For each of the banks there are mode bits—the fields are shown in Figure 2-35 on page 2-74 and Figure 2-36 on page 2-75. The setup of these fields is orthogonal—each of the fields can be programmed to a different value.

Bus Width

The bus width is configured separately for memory access (using MS1–0 or MSSD3–0), host—master or slave—transactions (using MSH), and multiprocessing memory access (using multiprocessor addresses). For a 32-bit bus, the HME bits =000, and for a 64-bit bus, the HME bits =111. Note that if either the host or memory bus width is 64-bits, the multiprocessing width must also be 64-bits. The valid width settings are shown in Table 8-2.
SYSCON Register Programming

Table 8-2. Host, Multiprocessor, and External Bus Width Settings

<table>
<thead>
<tr>
<th>SYSCON Bits</th>
<th>Bus Width</th>
</tr>
</thead>
<tbody>
<tr>
<td>(H) (M) (E)</td>
<td></td>
</tr>
<tr>
<td>0 0 0</td>
<td>32-bit wide Host, Multiprocessor, and External memory bus</td>
</tr>
<tr>
<td>0 0 1</td>
<td>Illegal</td>
</tr>
<tr>
<td>0 1 0</td>
<td>32-bit wide Host and External memory bus</td>
</tr>
<tr>
<td></td>
<td>64-bit wide Multiprocessor bus</td>
</tr>
<tr>
<td>0 1 1</td>
<td>32-bit wide Host bus</td>
</tr>
<tr>
<td></td>
<td>64-bit wide Multiprocessor and External memory bus</td>
</tr>
<tr>
<td>1 0 0</td>
<td>Illegal</td>
</tr>
<tr>
<td>1 0 1</td>
<td>Illegal</td>
</tr>
<tr>
<td>1 1 0</td>
<td>64-bit wide Host and Multiprocessor bus</td>
</tr>
<tr>
<td></td>
<td>32-bit wide External memory bus</td>
</tr>
<tr>
<td>1 1 1</td>
<td>64-bit wide Host, Multiprocessor, and External memory bus</td>
</tr>
</tbody>
</table>

Slow Device Protocol for Bus

For slow device protocol setup, the slow protocol bit (Bit5 for bank 0, Bit11 for bank 1, and Bit17 for host) is set. Additionally, the pipeline depth field is set to \(0b00\) and the \(\text{IDLE}\) bit is a “don’t care” bit—it can be programmed either way. The internal wait field identifies the number of internal wait cycles on slow accesses. If the number of internal waits is zero, the external wait mechanism cannot be used for these transactions.

Pipelined Protocol for Bus

For pipelined protocol setup, the slow protocol bit is cleared and the pipeline depth field is set to the required pipeline depth (\(0b00\) for one cycle, \(0b01\) for two cycles, \(0b10\) for three cycles, and \(0b11\) for four cycles). This defines the pipeline depth for read transactions only; pipeline depth for write transactions is always one. The \(\text{IDLE}\) bit is set if there are multiple
External Port and SDRAM Interface

Slaves in the address range of this bank, in order to prevent contention on the data bus on transitions between different slaves on consecutive reads. If this bank contains a single slave, the IDLE bit should be cleared. The internal wait field is irrelevant for pipelined transactions. In pipelined protocol, there is no internal wait state programming.

For systems using SBSRAM and pipelined protocol, it is important to note that only flow through ZBT SBRAM can be gluelessly connected to cluster bus.

Initial Value for Bus Operation

The initial value after reset, which can be used as the default, uses a 32-bit bus width for host, multiprocessing and memory access and uses the following bus configuration:

- Bank 0: slow protocol, three wait states, idle insert
- Bank 1: pipelined protocol, one cycle pipe depth, no wait states, idle insert
- Host: pipelined protocol, two cycle pipe depth, no wait states, idle insert

Pipelined Protocol Interface

The TigerSHARC processor uses the pipelined protocol to interface with other TigerSHARC processors, the host, and fast synchronous memories. The transaction is also performed by the pipelined protocol when the TigerSHARC processor is accessed as a slave.

This protocol was created for pipelining the transactions with a throughput of one datum per cycle, although the latency of the transaction can be up to four cycles. The address and controls of a transaction are issued on the address cycle and the data is transferred a few cycles later—in the case
Pipelined Protocol Interface

of TigerSHARC processor, one to four cycles depending on the transac-
tion direction and system configuration programming. The TigerSHARC
processor can issue an address of a new transaction every cycle and doesn’t
need to wait for the data cycle of the first transaction before beginning the
address cycle of the new transaction.

Control Signals for Pipelined Transactions

The control signals used in pipelined transactions are listed below.

- **RD** – the transaction is read.
- **WRH** and **WRL** – the transaction is written if one of these is active.
The two signals also indicate which word on the data bus is valid.
- **BRST** – Indicates the next cycle belongs to the same transaction as
  the current cycle—for example, when accessing a quad-word on a
  bus width of 32-bits, **BRST** is active on the first three cycles of the
  four cycle transaction.
- **ACK** – Driven by the target slave on the data cycle. If asserted, the
  slave is ready to complete the data cycle; otherwise wait cycles are
  generated.

Using Basic Pipelined Transactions

The basic pipelined transaction is shown in Figure 8-4 on page 8-17. In
the address cycle, the address is issued with the **RD** or **WRx** (**WRL**, **WRH** or
both) asserted. The data cycle begins after one to four cycles, as specified
in the target slave configuration and the direction of the transaction.
The delay between the address cycle and the data cycle is the **pipeline depth**. In
the data cycle, the data is transferred according to the transaction direc-
tion. If the slave is ready, it asserts the **ACK** signal and either drives or
strobos the data. If the slave is not ready, it deasserts the **ACK** signal and
delays the data. The exact way the **ACK** signal behaves is discussed in “Wait
Cycles” on page 8-23.
The pipeline depth is configured according to the slave and the transaction type (read or write). For write transactions, the pipeline depth is always 1. For read from another TigerSHARC processor in multiprocessing systems, the pipeline depth is always 4. When reading from external memory banks or host memory space, the pipeline depth is user-programmable and can be up to four cycles. Pipeline depth can be selected individually for each of the banks (bank0, bank1, and host) in the SYSCON register after reset.

Single transactions take a different number of cycles according to transaction size and external bus width:

- Normal word transactions: one cycle
- Long word transactions on 64-bit bus: one cycle
- Long word transactions on 32-bit bus: two cycles
- Quad word transactions on 64-bit bus: two cycles
- Quad word transactions on 32-bit bus: four cycles
Pipelined Protocol Interface

The transaction type is determined by the RD and WR/WRH signals on the address cycle. The WR/WRH signals also determine which words of the 64-bit bus should be written to memory. See the External Port Alignment Figure 8-24 on page 8-64.

Using Burst Pipelined Transactions

The pipeline is effective when used in sequential accesses having the same pipeline depth. Sequential accesses may be the result of one transaction which is too wide for the bus width, or merely distinct pipelined transactions. The first is called burst transaction and is indicated by the BRST pin. The BRST pin is valid on the address cycle.

The BRST signal is used to indicate transaction continuation in pipelined protocol transactions. It is used to qualify consecutive accesses on the system bus and indicates that the next cycle belongs to the same transaction as the current cycle.

The BRST signal is important to the TigerSHARC processor when it is acting as a slave. The BRST signal indicates that multiple transactions that are part of a burst access are to be interpreted as a single transaction. One example can be illustrated in a multiprocessor system where one TigerSHARC processor master attempts to access another TigerSHARC processor slave’s transfer control block (TCB) registers.

Because the TCB registers may only be accessed via quad-word accesses, a multiword transfer over the system bus must be qualified by the BRST signal, so the transfer is properly interpreted as a quad-word access by the slave TigerSHARC processor. Without the BRST signal, this same access is interpreted as four normal word (32-bit) accesses, which result in an illegal TCB register access.

A generic slave device (i.e., a device other than a TigerSHARC processor acting as slave) can use the BRST signal to accept the first address in a burst transfer and then automatically increment that address as successive data words arrive.
External Port and SDRAM Interface

The BRST signal must be connected in a multiprocessor TigerSHARC processor system in order to qualify long-word and quad-word accesses between DSPs. The BRST signal may optionally be connected between a host device and a TigerSHARC processor. If burst accesses (including but not limited to long-word and/or quad-word accesses) between host and TigerSHARC processors are required, then BRST must be connected and used to qualify these accesses for the TigerSHARC processor slave.

Figure 8-5 on page 8-20 and Figure 8-6 on page 8-21 illustrate examples of different types of sequential transactions in a 32-bit bus. Figure 8-5 on page 8-20 shows a quad-word write that is executed in four cycles, where the BRST pin is asserted on the first three cycles. This transaction is followed by a word write, and then two long writes where the BRST is asserted in the first cycle of each transaction. The pipeline depth is one cycle since these are write transactions.

An idle cycle is inserted between transactions to prevent contention between drivers. Figure 8-6 on page 8-21 shows two quad-word read transactions where each transaction takes four bus cycles. The two transactions are separated by an idle cycle. Idle cycles are inserted in the following cases.

- Read followed by read from two different memory banks, different TigerSHARC processors, host, or combination of the above, if the second read’s pipeline depth is smaller or equal to the first read pipeline depth.
- Read followed by read from the same memory bank, and the IDLE bit of this memory bank in the SYSCON register is set.
- Read followed by write or write followed by read, if the second transaction’s pipeline depth is smaller or equal to the first transaction pipeline depth.
- Write followed by write to multiprocessor space.
Write to multiprocess or space followed by write to other slave.

Write or read to-or-from multiprocessor space if the next transaction is to SRAM or host memory. Idle cycles are inserted until the transaction ends. This idle insertion supports SRAMs without ACK control.

If the corresponding IDLE bit is cleared, sequences of write, a sequence of a multicycle transaction, or sequences of reads from the same slave are not separated with an idle cycle.
Sequential accesses with different pipeline depths receive special treatment. If the first transaction has a smaller pipeline depth than the second transaction, the address cycles are issued with no gap, even though there may be a gap between the data cycles. If the pipeline depth of the first transaction is larger than the pipeline depth of the second, the transactions cannot be sequential. As shown in Figure 8-7 on page 8-22, the second transaction address cycle is delayed after the first transaction. The purpose of the delay is to keep the data cycles of the two transactions separate. The delay between the two transactions is the difference in the pipeline depth if no idle cycle is inserted, or the difference between the pipeline depths.
plus one if the conditions for idle insertion are true. If the difference between the pipeline depths is one, an idle cycle is inserted. The second transaction address cycle begins two cycles after the end of the first transaction.

The illustrated sequence in Figure 8-7 assumes a different pipeline depth and a bus width of 32.

Figure 8-7. Read Followed by Read From Different Slave
Wait Cycles

In regular read transactions, if the slave is ready in time for the data cycle of the targeted transactions, the slave asserts the ACK signal in the data cycle. If the slave is not ready, it deasserts the ACK signal on the data cycle and keeps it deasserted until it can continue. ACK also may be deasserted after a write transaction if the slave cannot forward the data to its destination on time. Some important points to know about operation of the ACK signal include:

- ACK is an open drain output. The ACK signal can be driven low by multiple slave processors simultaneously without contention.

- ACK is kept high by an internal 500Ω pull-up of the DSP with ID=000.

- ACK is pulled down by a slave DSP to halt an access.

- ACK is pulled up by an internal 50Ω pull-up of the DSP with ID=000 when slave DSPs stop driving ACK low (low-to-high transitions).

The ACK signal can be issued in response to any data cycle in a multicycle transaction. There is no restriction on its length.

When the ACK signal is deasserted, it stalls all the outstanding transactions, including transactions that are targeted to other slaves. Since the bus is stalled on the cycle following the active ACK cycle, the slave has to sample any new address that the master may issue on the cycle that the ACK signal was deasserted. Figure 8-8 on page 8-25 shows an example of ACK signal usage. In this example, the datum, dal, is not ready in time and is issued
Pipelined Protocol Interface

one cycle later. The \texttt{ACK} signal is deasserted on the \texttt{da1} data cycle and asserted in the next cycle to indicate valid data. The consequences of the wait cycle are listed below.

- The master samples the data one cycle later.
- The address cycle following the \texttt{ACK} signal (\texttt{ab2}) is stretched by one cycle.
- All the slaves that have pending transactions on the cycle following the \texttt{ACK} signal are stalled for the number of cycles that the \texttt{ACK} signal was deasserted. For example, between the \texttt{ab0} address cycle and the \texttt{db0} data cycle there are five cycles, although the pipeline depth is only four.

\begin{itemize}
  \item The address and controls driven on the bus on the cycle of the \texttt{ACK} signal must be latched by the slave.
\end{itemize}

Reads from memory banks that follow a read from multiprocessor space are delayed. The \texttt{ACK} signal is high assuming that memory does not have to drive \texttt{ACK}.

The sequence illustrated in Figure 8-8 is true when the \texttt{ACK} signal is not active and Bus Width = 32.

Write transactions use the \texttt{ACK} signal wait cycles differently from read transactions. Here the slave asserts the \texttt{ACK} signal as long as its write buffer has sufficient free slots. See Figure 8-9 on page 8-26. In this example, the address cycle, \texttt{aa3}, triggers the slave to deassert the \texttt{ACK} signal. This extends the transaction pipeline for two cycles. The extended cycles begin one cycle after the \texttt{ACK} signal was deasserted—for example, \texttt{ab2} and \texttt{db1} are
extended for two additional cycles. All the slaves strobe address ab1, and the target slave of transaction samples datum, db0, in the cycle that the ACK signal is deasserted.

In Figure 8-9 on page 8-26, transaction ab0 and ab1 are targeted to different slaves. If a slave drives the ACK signal low and the following transaction is to a different slave, the target slave of the second transaction may not drive the ACK signal before the first slave has completed the transaction. This applies to the cycle following the cycle in which the ACK signal was high.
The TigerSHARC processor supports a slow device protocol that can be used for simple devices. The slow device protocol can be configured for address spaces belonging to bank0, bank1, or host. The slow device protocol is set by programming the relevant bits in the \textit{SYSCON} register.

- Slow = 1
- Pipeline depth = 0b00 (fixed when slow bit is set)
External Port and SDRAM Interface

- Wait cycles = programmed according to system requirements
- IDLE cycle = 1 (fixed when slow bit is set)

The purpose of this setup is to enable a direct connection to simple and low-performance, non-critical memories or peripherals. The protocol can work with synchronous or asynchronous devices.

The basic protocol, when the configuration is “zero wait cycles”, is shown in Figure 8-10 on page 8-28 and Figure 8-11 on page 8-29. The memory select is asserted to begin the transaction and the address is driven with it. In the next cycle, the RD or WRx signal (according to the transaction type) is asserted. Data is driven by the TigerSHARC processor if the transaction is write. If the transaction is read, the slave can start driving the data. The TigerSHARC processor latches the data at the end of this cycle. For the zero-wait-cycles configuration, wait cycles cannot be inserted by deasserting the ACK signal.

In the slow protocol, the MS0-1 signals act as control signals and not as select signals. The external memory device used with this protocol should be able to latch the data on the first rising edge of either MS0-1 or the WRx signal. One of the key requirements for this device is that it meets the hold time for both address and data. The right number of wait cycles to guarantee enough data setup time is user configurable.
Slow Device Protocol Interface

Figure 8-10. Slow Protocol Write With 0 Wait Cycles
External Port and SDRAM Interface

Figure 8-11. Slow Protocol Read With 0 Wait Cycles
In slow device protocol, wait cycles can be inserted as required. The `SYSCON` register includes a field for the number of internal wait cycles—between zero and three.
External Port and SDRAM Interface

Figure 8-12 on page 8-30 and Figure 8-13 on page 8-31 illustrate slow transactions with one or more wait cycles. This situation is similar to the zero-wait-state transaction, but the second cycle (one cycle after the memory select is asserted) is repeated according to the number of internal wait cycles.

External wait cycles can be inserted by deasserting the ACK signal. This can only be done when at least one internal wait state has been configured. To insert external wait cycles, the ACK signal must be deasserted one cycle or more before the last wait cycle. The extra wait cycles are repeated until one cycle after the ACK signal has been asserted. See Figure 8-14 on page 8-32 for an example.
The TigerSHARC processor can be configured with the EBOOT strap pin during reset to boot from an external 8-bit EPROM. In this case, the program is loaded from the EPROM into internal memory by an automatic

---

Figure 8-14. Slow Protocol Read/Write With External Wait Cycles

---

8-32 ADSP-TS201 TigerSHARC Processor Hardware Reference
External Port and SDRAM Interface

process as part of the reset sequence. The TigerSHARC processor uses DMA channel 0 to load the program. The TigerSHARC processor bus interface packs bytes to 32-bit instructions. EPROM may also be accessed at runtime (after reset) using DMA. The boot EPROM cannot be accessed by the core. The EPROM is a byte address space and is not part of the TigerSHARC processor memory space. It is limited to 16M bytes (maximum address is 0xFFFF, ADDR31–24 = 0). The data is driven on the regular data bus Bits7–0.

The TigerSHARC processor uses 16 wait cycles for each read access from the EPROM. During the boot process, the BMS pin is used as the EPROM chip select pin until completion.

Figure 8-15. Access to Boot EPROM – 16 Wait Cycles
Multiprocessing Interface

This type of transaction can be operated by the DMA channels anytime. The transaction can also operate as write for flash memory programming. The write transaction is identical to the EPROM read, but the write does not scatter the bytes. When a word is written to the flash memory, it is driven on the external bus on Bits31–0. Since the flash memory is connected only to the bottom byte, the useful data is only on Bits7–0. It is the programmer’s responsibility to distribute the data on the bottom bytes of the words. In TigerSHARC processor’s internal or external RAM, the data should be organized as shown in Figure 8-16. After the last boot EPROM bus cycle, there are three idle cycles (for slow EPROM disconnect time).

![Figure 8-16. Data Preparation for Write to 8-Bit FLASH](image)

Multiprocessing Interface

The TigerSHARC processor incorporates distributed arbitration logic that supports multiprocessing. There can be up to eight processors plus a host connected to the same external bus and each of the bus agents can become the master of the bus.

The I/O pins that are related to multiprocessing are listed in Table 8-3.
## External Port and SDRAM Interface

### Table 8-3. Pin Definitions – External Port Arbitration

<table>
<thead>
<tr>
<th>Signal</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>BR7–0</td>
<td>Multiprocessing Bus Request Pins. Used by the DSPs in a multiprocessor system to arbitrate for bus mastership. Each DSP drives its own BRx line (corresponding to the value of its ID2–0 inputs) and monitors all others. In systems with fewer than eight DSPs, set the unused BRx pins high (VDD_IO).</td>
</tr>
<tr>
<td>ID2–0</td>
<td>Multiprocessor ID. Indicates the DSP’s ID, from which the DSP determines its order in a multiprocessor system. These pins also indicate to the DSP which bus request (BR0–BR7) to assert when requesting the bus: 000 = BR0, 001 = BR1, 010 = BR2, 011 = BR3, 100 = BR4, 101 = BR5, 110 = BR6, or 111 = BR7. ID2–0 must have a constant value during system operation and can change during reset only.</td>
</tr>
<tr>
<td>BM</td>
<td>Bus Master. The current bus master DSP asserts BM. For debugging only. At reset this is a strap pin.</td>
</tr>
<tr>
<td>BOFF</td>
<td>Back Off. A deadlock situation can occur when the host and a DSP try to read from each other’s bus at the same time. When deadlock occurs, the host can assert BOFF to force the DSP to relinquish the bus before completing its outstanding transaction.</td>
</tr>
<tr>
<td>BUSLOCK</td>
<td>Bus Lock Indication. Provides an indication that the current bus master has locked the bus. At reset, this is a strap pin.</td>
</tr>
<tr>
<td>HBR</td>
<td>Host Bus Request. A host must assert HBR to request control of the DSP’s external bus. When HBR is asserted in a multiprocessor system, the bus master relinquishes the bus and asserts HBG once the outstanding transaction is finished.</td>
</tr>
<tr>
<td>HBG</td>
<td>Host Bus Grant. Acknowledges HBR and indicates that the host can take control of the external bus. When relinquishing the bus, the master DSP three-states the ADDR31–0, DATA63–0, MSH, MSSID3–0, MS1–0, RD, WRD, WRT, BMS, BRST, TORD, IOWR, IOEN, RAS, CAS, SDWE, SDA10, SDCKE, LDQM and HDQM pins, and the DSP puts the SDRAM in self-refresh mode. The DSP asserts HBG until the host deasserts HBR. In multiprocessor systems, the current bus master DSP drives HBG, and all slave DSPs monitor it.</td>
</tr>
</tbody>
</table>
All on-chip arbiters in a cluster of TigerSHARC processors operate in lock-step on a cycle-by-cycle basis. In this way, every TigerSHARC processor tracks and follows the arbitration sequence of the shared bus. Bus arbitration is accomplished with the use of the BR7-0, HBR and HBG pins. The BR7-0 pins arbitrate between TigerSHARC processors, and HBR and HBG pins pass control of the external bus to the host. If the system has less than eight TigerSHARC processors, any unused BRx pins must be deasserted.

Every TigerSHARC processor has three identification input pins used to distinguish between TigerSHARC processors. These are the ID2-0 pins, where the ID2-0 pins of each TigerSHARC processor are set to a unique number and used as the processor ID. A rotating bus priority is implemented to ensure bus fairness between TigerSHARC processors. After reset, the TigerSHARC processor with ID = 000 becomes the master and priority rotates in a round robin fashion, going up from the present mas-
External Port and SDRAM Interface

ter. Priority rotation is interrupted either when the host that has highest priority in the system requests the bus, or when a TigerSHARC processor requests the bus by activating the CPA and DPA (Priority Access) pins.

In previous SHARC® processors (ADSP-2106x and ADSP-2116x families), the use of a processor with ID = 001 was mandatory for multiprocessing systems to function correctly. The SHARC processors use ID = 001 to select the bus master after reset and reserve ID = 000 for single processor systems.

In TigerSHARC processors, the ID pins operate differently. Multiprocessing TigerSHARC systems must always have a processor with ID = 000, which is the bus master after reset. Under some circumstances, multiprocessing TigerSHARC systems cannot function properly unless a processor with ID = 000 is present. For example, the ID = 000 processor must be present in any system including SDRAM because this processor performs the initialization (Mode Register Set MRS) of the SDRAM. Also, there are issues related to pull-downs, pull-ups, and open drain pull-ups which are only enabled on the processor with ID = 000.

The TigerSHARC processor can lock the bus in certain cases. For more information, see “Bus Lock (BUSLOCK) Register” on page 8-44. The following arbitration flow occurs when there is no bus lock.

System priority in descending order is:

- Host
- TigerSHARC processor (BR along with CPA asserted)
- TigerSHARC processor (BR along with DPA asserted)
- TigerSHARC processor (BR asserted)
Multiprocessing Interface

The current master yields the bus under these conditions:

- The current master does not request the bus and another slave asserts its BR or HBR.
- The current master requests the bus, but another slave asserts its BR and CPA. The current master does not assert CPA.
- The current master requests the bus, but another slave asserts its BR and DPA. The current master does not assert CPA or DPA.
- The current master requests the bus, but the host asserted its HBR.
- The current master requests the bus, but the value of the BMAX register expires and another slave asserts its BR. The BMAX register holds a count of cycles for which the TigerSHARC processor may keep the bus. The count is in internal cycles (CCLK frequency). See “Bus Master Maximum (BMAX) Register” on page 2-80 for more details.

Bus Arbitration Protocol

When one of the TigerSHARC processors needs to become a master, it asserts its own BR line, while at the same time monitoring the other BR lines. The current master, after completing its transaction, deasserts its own BR. Then, bus mastership is passed to the new requester. The new bus master retains the bus mastership by maintaining its BR, which is asserted continuously. When the current master relinquishes the bus, it is assigned the lowest priority. When other slaves want to become masters, each of them asserts their own BR and the slave with the highest priority gains the mastership of the bus. The remaining competing slaves keep their BRs asserted until gaining bus mastership according to their priority. There is no pipeline between masters. The current master completes all its pending transactions (transactions that have begun executing on the cluster bus).
before relinquishing the bus. Because of master change cycles, an IDLE cycle occurs when mastership is passed from the current master to a new master.

The bus arbitration allows any TigerSHARC processor to gain temporary priority over the current master. This is achieved by two wired OR pins—CPA and DPA.

![Diagram of External Bus Arbitration Sequence, CPA/DPA/HBR Inactive](image)

Figure 8-17. External Bus Arbitration Sequence, CPA/DPA/HBR Inactive

The host can be a bus master on the TigerSHARC processor’s external bus—it uses HBR and HBG for gaining control of the bus. In order for the host to become bus master, the host must assert HBR and wait until HBG is asserted by the current TigerSHARC processor master. The TigerSHARC processor relinquishes the external bus and indicates this by asserting HBG.
Multiprocessing Interface

After completing all outstanding transactions, the host keeps HBR asserted for as long as it needs the bus. The master that last relinquished the bus keeps its BR line asserted, so that when the host deasserts HBR, it becomes the master again.

Core Priority Access (CPA) Pin

The core priority access (CPA) pin is asserted when the TigerSHARC processor core accesses external memory. This allows a slave TigerSHARC processor to interrupt background transfers of a DMA channel belonging to a master TigerSHARC processor and gain control of the external bus. The current master in this case terminates its transaction and passes the bus mastership to the requesting TigerSHARC processor by deasserting its BR. When CPA is asserted, only TigerSHARC processors with core transactions can request the bus. The other requesting TigerSHARC processors deassert their BRs when they sense that CPA is asserted. When more than one TigerSHARC processor requests the bus by asserting their BRs along with CPA, the TigerSHARC processor with the highest priority gains bus mastership.
Figure 8-18. External Bus Arbitration Sequence; Only One Active DPA; HBR Inactive
Multiprocessing Interface

The DMA Priority Access, DPA, is asserted when a TigerSHARC processor high priority DMA channel accesses external memory—only if CPA is not asserted. This allows a high priority DMA channel belonging to a slave TigerSHARC processor to interrupt background transfers of a regular priority DMA channel belonging to a master TigerSHARC processor and gain control of the external bus. The current master in this case terminates its transaction and passes the bus mastership to the requesting TigerSHARC processor by deasserting its BR. When DPA is asserted, only TigerSHARC processors with high priority DMA transactions can request the bus. The other requesting TigerSHARC processors deassert their BRs.

Figure 8-19. External Bus Arbitration Sequence; More Than One Active CPA; HBR Active

**DMA Priority Access (DPA) Pin**

8-42 ADSP-TS201 TigerSHARC Processor Hardware Reference
when they sense DPA is asserted. When more than one TigerSHARC processor requests the bus by asserting their BRs along with DPA, the TigerSHARC processor with the highest priority gains the bus mastership.

When a priority access (CPA or DPA) finishes, and no other priority access is being requested – there are two idle cycles for the previous master (that was interrupted) to gain control of the bus. One cycle is to sense that the priority access is over, the second is to take control of the bus. This is illustrated in Figure 8-19 on page 8-42.

In the arbitration process, requesting bus agents with an active CPA have higher priority over requesting bus agents with an active DPA. This implies that requesting bus agents with an active DPA deassert their BRs upon sensing that a CPA is asserted by other bus agents.

Once the bus is owned by the TigerSHARC processor with the bus lock, the bus lock priority is almost the highest in the system. The TigerSHARC processor relinquishes the bus when the HBR and BOFF pins are asserted according to the protocol described in “Back Off (BOFF) Pin” on page 8-48. When bus lock is set, the bus is not relinquished by either a CPA, DPA, or a HBR assertion, nor by the BMAX count expire.

In an TigerSHARC processor cluster system, when one of the slave DSPs requests a priority access CPA or DPA, the TigerSHARC processor that has control of the bus deasserts the bus request line. Once the priority access is completed, there are two turnover cycles for the original TigerSHARC processor to regain control of the bus—one for the TigerSHARC processor to sense the priority access is done, and another to take control of the bus. This is illustrated in Figure 8-19 on page 8-42, which shows only one turnover cycle. This is because there are two priority accesses being requested. It only takes one turnover cycle to transfer from one master to another.
Bus Fairness (BMAX) Register

The system designer may want to limit the time a single TigerSHARC processor holds the bus, in order to prevent bus starvation of other masters in the system. The BMAX register defines the period that the TigerSHARC processor may own the bus. The time is defined in SOCCLK (or CCLK/2) cycles. The BMAX counter is initiated to the defined value when the TigerSHARC processor gets hold of the bus and starts counting down. Countdown is paused while the host gets HBG, so the amount of time that the host is the bus master is not included as a part of the specific TigerSHARC processor share in the bus. When the BMAX count expires and there is another bus request asserted by another TigerSHARC processor in the system, the master TigerSHARC processor relinquishes the bus for at least one cycle. If no other bus request is asserted, the BMAX counter is reloaded and the master continues to own the bus. When the BMAX counter is initialized with the value 0xFFFF (the reset value), the BMAX counter is disabled.

The BMAX count is not precise due to synchronization between clock domains over the SOC bus, there may be a skew in the period. Additionally, when the BMAX count expires, the bus is not relinquished until the current transaction has been completed. When the bus lock (BUSLK) bit in the BUSLOCK register is set, the BMAX expire is ignored until the bus lock is cleared. Once it is cleared, the bus is relinquished.

Bus Lock (BUSLOCK) Register

The bus lock feature is implemented in order to support atomic read-modify-write operations. When a TigerSHARC processor needs to access a semaphore, the processor requests a bus lock by setting its BUSLK bit in the BUSLOCK register (see “Bus Lock (BUSLOCK) System Control Register” on page 2-76). Setting this bit causes the bus arbitration logic to request the bus; and maintains BR assertion as long as the BUSLK bit is set. Once the BUSLK bit is cleared, arbitration resumes the regular path. Before accessing a semaphore, the TigerSHARC processor should check whether
External Port and SDRAM Interface

it is the current bus master by reading the BUSMSTR bit field in the SYSTAT register. If the TigerSHARC processor is the current master, writing to the semaphore location can be executed safely.

The system indicates whether or not the bus is locked by the TigerSHARC processor via a BUSLOCK pin. This enables the system to propagate the bus lock indication by way of a bridge to another bus.

There is also support for bus lock in the interrupt mechanism. When the bus lock bit is set and the TigerSHARC processor gets hold of the bus, a bus lock interrupt is set. For more information, see “Bus Lock Interrupt Register” on page 6-21.

Software (Processor Core) Reset Operation

The external port continues to operate even while the processor is in a software reset state (SWRST bit =1). While in software reset, the bus interface provides:

- Bus arbitration – The arbitration process on the cluster bus continues, but the processor that is in software reset cannot request the bus.

- Transaction response – The external port IFIFO replies to cluster bus transactions that are targeted to the processor, but these transactions are held in the IFIFO until the software reset is completed. Then, the transactions are executed by the processor’s SOC bus.

- Static status – The processor’s control and status registers may not change value as result of the software reset.

- Emulation response – If during software reset an emulation exception is issued, the emulation response is delayed until the processor exits software reset.
Host Processor Interface

The host processor interface connects a host to the TigerSHARC processor’s external bus. When the host is a slave, it uses the same protocol as the TigerSHARC processor—pipelined or slow device protocol depending on the host interface configuration specified in the SYSCON register. The host can access the TigerSHARC processor in the same pipelined protocol as any other TigerSHARC processor. The host can also access any of the slaves in the system using its own protocol. There are some slight differences between the host’s behavior and the TigerSHARC processor’s behavior. These differences are detailed below.

When a transaction to any address space (except the host space) follows a read from the host, the non-host transaction is serialized—for example, the non-host transaction takes place only after all host transactions have completely terminated. This is illustrated in Figure 8-20.

When the host accesses a TigerSHARC processor, it should comply with all the rules of TigerSHARC processor pipelined transactions. The most important rules are:

- Pipeline depth is always four for read and one for write.
- There are IDLE cycle restrictions.
- No read from broadcast address space is allowed.
- The host must operate with an identical system configuration as other TigerSHARC processors on the cluster bus.
External Port and SDRAM Interface

The host does not receive bus grant before the TigerSHARC processor (ID #0) has initialized the SDRAM. (See “SDRAM Programming” on page 8-67.)

- The host does not receive bus grant before the TigerSHARC processor (ID #0) has initialized the SDRAM. (See “SDRAM Programming” on page 8-67.)

- The TigerSHARC processor puts the SDRAM in self-refresh mode before the processor relinquishes the bus to the host. The host, in order to use the SDRAM, should change the SDRAM mode to normal work. Before returning the bus to the TigerSHARC processor, the host should put the SDRAM into self-refresh mode again.
When the host executes burst transactions to and from the TigerSHARC processor, the TigerSHARC processor allows for a continuous burst of more than four data units. The host issues a starting burst address. As long as BRST is asserted, the TigerSHARC processor increments the address internally. The BRST pin is used to indicate transaction continuation—in this case the transaction is ended on every quad-word address. The first transaction should begin and the last transaction should end in quad-word aligned addresses.

**Back Off (BOFF) Pin**

The host, one of the masters in the system, can request the bus, as explained in “Bus Arbitration Protocol” on page 8-38. In some cases, a deadlock situation may occur. This happens when the host and the TigerSHARC processor are trying to read from each other’s bus at the same time—see Figure 8-21 on page 8-49. In this case, the TigerSHARC processor has control of its system bus, the host has control of its system bus, and both issue a read transaction from each other. This results in the host bridge requesting both buses to execute both transactions. Neither of the buses is relinquished before the current master completes its own transaction. Likewise, neither of the transactions is completed because the buses are not relinquished. To resolve the deadlock, one of the requesters has to give up the bus, allowing the other requester to complete its read transaction. The TigerSHARC processor is designed to yield the bus to the host when the latter asserts BOFF, signaling the TigerSHARC processor to yield. In response, the TigerSHARC processor asserts HBG, allowing the host to execute its transactions. After the host completes the transactions and relinquishes the bus, the TigerSHARC processor performs the read transactions that were suspended when BOFF was asserted.

The backoff mechanism may not be used to stop write transactions to host.
The backoff mechanism can only be active when the TigerSHARC processor has reached the data cycle of transactions targeted to the host; this occurs after the host has deasserted the **ACK** pin. If the host is programmed to be accessed by the TigerSHARC processor in slow protocol with zero wait states, the backoff mechanism cannot be used because in this case the **ACK** signal cannot be asserted for the transaction.

Backoff also overrides a bus lock. If the host can access resources that are protected by a semaphore, there should be a system solution for the semaphore. One possible solution is to implement semaphores on the bridge in a way that does not require bus lock. If the semaphore is set when read and the read data is the previous value, there is no need for bus lock on the semaphore access. The host interface can be notified of a bus lock by the **BUSLOCK** pin.

![Diagram of bus lock scenarios](image)

Figure 8-21. Deadlock Scenario
The ADSP-TS201 TigerSHARC processor has a dedicated address space to support SDRAM. The banks in this address space are selected by the MSSD3–0 (Memory Select SDRAM) select lines, which can be used for direct interface with SDRAM devices. The SDRAM is accessed on memory address range 0x4000 0000 to 0x7FFF FFFF (see “External Memory Bank Space” on page 2-4).

The ADSP-TS201 processor supports standard (3.3V) SDRAM and low-power (2.5V) SDRAM.

SDRAM, unlike conventional DRAM, is synchronous to the SCLK (system clock). All inputs are sampled and all outputs are valid at the rising edge of the clock. The synchronous interface allows data transfer every cycle, yielding high throughput up to 500M Bytes/second for a 32-bit bus width, and 1G Bytes/second for a 64-bit bus width. The SDRAM has several types of burst accesses, depending on the initialization of its mode register, SDRCON. Each access type is preceded by issuance of the appropriate command to the SDRAM. The SDRAM specific modes should be initialized in the SDRCON register. (See “SDRAM Configuration (SDRCON) Register” on page 2-76.) The SDRAM interface provides a glueless interface with standard SDRAMs—6M bits, 64M bits, 128M bits, 256M bits, and 512M bits. The TigerSHARC processor directly supports a maximum of 64M words x 32 bits of SDRAM per bank for a total of 856M words.

For SDRAM, all initialization MRS (Mode Register Set) command and configuration is done by processor with ID000. This ID should always be present in any TigerSHARC processor cluster.

The SDRAM is organized internally into two or four banks. The SDRAM Bank Select pins determine which bank is being addressed. The SDRAM has a programmable read latency parameter that must be initialized by the application according to the type of device and the operating clock frequency.
External Port and SDRAM Interface

In order to meet the demanding SDRAM timing requirements, the TigerSHARC processor allows the SDRAM address and controls to be pipelined. The pipeline depth bit in the SDRCON register is used for this purpose. When this bit is set, data in write accesses are delayed by one cycle, allowing the address and controls to be externally latched and optionally manipulated in one cycle (for example, selection between Dual In-line Memory Modules (DIMMs) in the array). In pipelined read accesses, data is sampled by the TigerSHARC processor one cycle later. (CAS latency plus pipeline cycle are maximum four cycles.) (Refer to the timing diagram in the ADSP-TS201 TigerSHARC Embedded Processor Data Sheet.) Pipelining can be used when connecting several SDRAM devices in parallel such that their collective load is too high to be driven by the TigerSHARC processor.

SDRAM chips are available from different vendors. Each vendor has variations on timing requirements pertaining to ACT to PRE command delays (tRAS) and PRE to ACT command delays (tRP). In order to support all major vendors and different speed grades, the SDRCON register is programmed to help the system designer meet the specific SDRAM timing requirements. The bit fields are CAS latency, PRE to RAS delay, and RAS to PRE delay. See “SDRAM Configuration (SDRCON) Register” on page 2-76 for more information regarding SDRCON register definitions.

The TigerSHARC processor programs the SDRAM to full-page bursts. This allows other bus agents that have their own SDRAM controller, for example, an external host, to perform full-page burst transactions. The TigerSHARC processor itself operates in bursts of up to four data units. Whenever possible, however, it issues a new CAS for the following transaction early enough to make the whole sequence look like a continuous burst.

The TigerSHARC processor includes a programmable refresh rate in order to meet refresh timing. This rate may be used to coordinate between the clock rate and the required refresh rate. See “Setting the Refresh Counter Value – Refresh Rate” on page 8-70.
The following are definitions used throughout this chapter.

- **Bank Activate Command.** Activates the selected bank and latches in a new row address. It must be applied before a read or write command.

- **Burst Length.** Determines the number of words that the SDRAM inputs or outputs after detecting a write or read command, respectively. Although the SDRAM device is programmed for full-page burst, the controller uses quad-word (128 bits) burst mode. For 32-bit bus width, the burst length is 4 words, and for 64-bit width, the burst length is 2 words. Only the first read or write command is accompanied with an external address that is driven by the controller until the burst is interrupted by another address.

- **Burst Type.** Determines the order in which the SDRAM delivers or stores burst data after detecting a read or write command, respectively. The processor supports sequential accesses only.

- **CAS Latency.** The delay, in clock cycles, between when the SDRAM detects the read command and when it provides the data at its output pins. The speed grade of the device and the application’s clock frequency determine the value of the CAS latency. The application must program the CAS latency value into the SDRCON register.

- **CBR Automatic Refresh (CAS before RAS) Mode.** In this mode, the SDRAM drives its own refresh cycle with no external control input. At cycle end, all SDRAM banks are precharged (IDLE).

- **HDQM/LDQM Data I/O Mask Function.** The HDQM/LDQM pins are used by the controller to mask write operations. See “Understanding DQM Operation” on page 8-79.

- **SDCON Register.** An IOP register that contains programmable SDRAM control and configuration parameters that support different vendor’s timing and power up sequence requirements.
External Port and SDRAM Interface

- **Mode Register.** The SDRAM configuration register that contains user-defined parameters corresponding to the processor SDRCON register. After initial power up and before executing a read or write command, the application must program the initialization sequence in the SDRCON register.

- **Page Size.** The size, in words, of the SDRAM page. The processor supports 1024-, 512-, and 256-word page sizes. Page size is a programmable option in the SDRCON register.

- **Precharge Command.** Precharges an active bank.

- **Refresh Rate.** Programmable value in the SDRCON register. The refresh rate enables applications to coordinate SCLK rate with the SDRAM’s required refresh rate.

- **Self-Refresh.** The SDRAM internal timer initiates automatic refresh cycles periodically, without external control input. This command places the SDRAM device in a low power mode.

- **tRAS.** Active Command time. Required delay between issuing an activate command and issuing a precharge command. A vendor-specific value. This option is programmable in the SDRCON register.

- **tRC.** Bank Cycle time. The required delay between successive Bank Activate commands to the same bank. This vendor-specific value is defined as tRC = tRP + tRAS. The processor fixes the value of this parameter so it is a non-programmable option.
SDRAM Interface

- \( t_{RCD} \): RAS to CAS Delay. The required delay between an ACT command and the start of the first read or write operation. This vendor-specific value is defined as \( t_{RCD} = CL \). The processor fixes the value of this parameter so it is a non-programmable option.

- \( t_{RP} \): Precharge Time. The required delay between issuing a pre-charge command and issuing an activate command. This vendor-specific value is programmable in SDRCON.

Figure 8-22 on page 8-54 shows the SDRAM controller interface between the internal TigerSHARC processor core and the external SDRAM device.
The TigerSHARC processor normally generates an external memory address, which then asserts the corresponding SDRAM memory select line (MSSD3–0), along with RD and WR accesses. These control signals are intercepted by the SDRAM controller. The memory access to SDRAM is based on the mapping of the addresses and memory selects. The configuration is programmed in the SDRCON register. The SDRAM controller can hold off the TigerSHARC processor core or I/O processor as determined by refresh, non-sequential access, or page miss latency overhead.

The SDRAM controller provides a glueless interconnection between the SDRAM control, address, and data pins and the TigerSHARC processor’s internal Harvard Architecture buses. The internal 32-bit address bus is multiplexed by the SDRAM controller to generate the corresponding chip select, row address, column address, and bank select signals to the SDRAM.

SDRAM Interface I/O Pins

Table 8-4 lists the I/O pins associated with the SDRAM. Other bus interface I/O pins are detailed in Table 8-1 on page 8-4.

<table>
<thead>
<tr>
<th>Signal</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>MSSD3–0</td>
<td>Memory Select SDRAM. MSSD0, MSSD1, MSSD2, or MSSD3 is asserted whenever the DSP accesses SDRAM memory space. MSSD3–0 are decoded memory address pins that are asserted whenever the DSP issues an SDRAM command cycle (access to ADDR31:30 = 0b01—except reserved spaces). In a multiprocessor system, the master DSP drives MSSD3–0.</td>
</tr>
<tr>
<td>RAS</td>
<td>Row Address Select. When sampled low, RAS indicates that a row address is valid in a read or write of SDRAM. In other SDRAM accesses, it defines the type of operation to execute according to SDRAM specification.</td>
</tr>
<tr>
<td>CAS</td>
<td>Column Address Select. When sampled low, CAS indicates that a column address is valid in a read or write of SDRAM. In other SDRAM accesses, it defines the type of operation to execute according to the SDRAM specification.</td>
</tr>
</tbody>
</table>
Table 8-4. Pin Definitions – External Port SDRAM Controller (Cont'd)

<table>
<thead>
<tr>
<th>Signal</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>LDQM</td>
<td>Low Word SDRAM Data Mask. When sampled high, three-states the SDRAM DQ buffers. LDQM is valid on SDRAM transactions when CAS is asserted, and inactive on read transactions. On write transactions, LDQM is active when accessing an odd address word on a 64-bit memory bus to disable the write of the low word.</td>
</tr>
<tr>
<td>HDQM</td>
<td>High Word SDRAM Data Mask. When sampled high, three-states the SDRAM DQ buffers. HDQM is valid on SDRAM transactions when CAS is asserted, and inactive on read transactions. On write transactions, HDQM is active when accessing an even address in word accesses or when memory is configured for a 32-bit bus to disable the write of the high word.</td>
</tr>
<tr>
<td>SDA10</td>
<td>SDRAM Address bit 10 pin. Separate A10 signals enable SDRAM refresh operation while the DSP executes non-SDRAM transactions.</td>
</tr>
<tr>
<td>SDCKE</td>
<td>SDRAM Clock Enable. Activates the SDRAM clock for SDRAM self-refresh or suspend modes. A slave DSP in a multiprocessor system does not have the pullup or pulldown. A master DSP (or ID=0 in a single processor system) has a pullup before granting the bus to the host, except when the SDRAM is put in self refresh mode. In self refresh mode, the master has a pulldown before granting the bus to the host.</td>
</tr>
<tr>
<td>SDWE</td>
<td>SDRAM Write Enable. When sampled low while CAS is active, SDWE indicates an SDRAM write access. When sampled high while CAS is active, SDWE indicates an SDRAM read access. In other SDRAM accesses, SDWE defines the type of operation to execute according to SDRAM specification.</td>
</tr>
</tbody>
</table>

**SDRAM Physical Connection**

The SDRAM address and data pins can be connected directly to the TigerSHARC processor address and data pins. The data bus is either the full 64 bits or the lower 32 bits according to the memory bus width programming in SYSCON (see “SYSCON Register Programming” on
The address connection is according to the bus width and there is an option to split the load on different pins. For 32-bit bus, the connection is listed below.

- SDRAM address Bits9–0 are connected to TigerSHARC processor ADDR9–0
- SDRAM address Bit10 is connected to TigerSHARC processor pin SDA10
- SDRAM address Bits15–11 are connected to TigerSHARC processor ADDR15–11 (or as many as required)

In a 64-bit bus, the address should be shifted. The scheme should be as follows.

- SDRAM address Bits9–0 are connected to TigerSHARC processor ADDR10–1
- SDRAM address Bit10 is connected to TigerSHARC processor pin SDA10
- SDRAM address Bits14–11 are connected to TigerSHARC processor ADDR15–12 (or as many as required)

The control signals are specific SDRAM control signals. These signals are listed below.

- MSSD3–0 – SDRAM Chip Select
- RAS – Row Address Strobe
- CAS – Column Address Strobe
- SDWE – SDRAM Write Enable
- LDQM – Data Mask for Low Word (data Bits31–0)
**SDRAM Interface**

- **HDQM** – Data Mask for High Word (data Bits 63–32)
- **SDCKE** – Clock Enable

In large memory systems, there are often many SDRAM chips that can overload the address bus. For this reason, SDRAM address Bits 15–0 are duplicated on TigerSHARC processor ADDR31–16. The scheme for a 32-bit connection is as follows:

- SDRAM address Bits 9–0 are connected to TigerSHARC processor ADDR25–16
- SDRAM address Bit 10 is connected to TigerSHARC processor pin SDA10
- SDRAM address Bits 15–11 are connected to TigerSHARC processor ADDR31–27 (or as many as required)

The scheme for a 64-bit bus is as follows.

- SDRAM address Bits 9–0 are connected to TigerSHARC processor ADDR26–17
- SDRAM address Bit 10 is connected to TigerSHARC processor pin SDA10
- SDRAM address Bits 14–11 are connected to TigerSHARC processor ADDR31–28 (or as many as required)

Another option to reduce the load is to buffer the address and control signals in registers to delay these signals in a cycle. In this case the pipeline depth bit in SDRCON should be set.
External Port and SDRAM Interface

Bank Select Pins

The connections of the address pins as bank select lines for the SDRAM device varies depending on the following factors: operational voltage of the SDRAM device being used (standard 3.3V or 2.5V low-power SDRAM) and the number of banks that the part has.

For standard SDRAMs, any of the ADDR15-11 pins can be used as bank select as long as the pins do not drive address bits used either as row or column addresses and drive the same address bits during active and read or write cycles. For two banks, BS=ADDR11 or BS can equal any address in the ADDR15-11 range. For four banks, BS1-0=ADDR12-11 or BS1-0 can equal any other address pair in the ADDR15-11 range.

For low-power SDRAMs, the use of the address lines as bank select signals is limited to ADDR15-14. For two banks, any of the two address lines can be used, BS=ADDR14 or ADDR15. For four banks, BS1-0=ADDR15-14 must be used.
**SDRAM Interface**

**Internal Address and SDRAM Physical Connection**

Figure 8-23, Table 8-5, Table 8-6, and Table 8-7 describe external port data alignment and SDRAM connections for a 64-bit wide system bus. Figure 8-24, Table 8-8, Table 8-9, and Table 8-10 describe external port data alignment and SDRAM connections for a 32-bit wide system bus.

![Data Alignment Diagram](image-url)

Figure 8-23. External Port Data Align for 64-bit
# External Port and SDRAM Interface

Table 8-5. Word Page Size 256, 64-Bit Bus Width

<table>
<thead>
<tr>
<th>Physical TigerSHARC Processor Pin</th>
<th>Bank Active Cycle Internal Address</th>
<th>Column Access Cycle Internal Address</th>
<th>Physical SDRAM Pin</th>
</tr>
</thead>
<tbody>
<tr>
<td>A0</td>
<td>8</td>
<td>0</td>
<td>NC</td>
</tr>
<tr>
<td>A1</td>
<td>9</td>
<td>1</td>
<td>A0</td>
</tr>
<tr>
<td>A2</td>
<td>10</td>
<td>2</td>
<td>A1</td>
</tr>
<tr>
<td>A3</td>
<td>11</td>
<td>3</td>
<td>A2</td>
</tr>
<tr>
<td>A4</td>
<td>12</td>
<td>4</td>
<td>A3</td>
</tr>
<tr>
<td>A5</td>
<td>13</td>
<td>5</td>
<td>A4</td>
</tr>
<tr>
<td>A6</td>
<td>14</td>
<td>6</td>
<td>A5</td>
</tr>
<tr>
<td>A7</td>
<td>15</td>
<td>7</td>
<td>A6</td>
</tr>
<tr>
<td>A8</td>
<td>16</td>
<td>8</td>
<td>A7</td>
</tr>
<tr>
<td>A9</td>
<td>17</td>
<td>9</td>
<td>A8</td>
</tr>
<tr>
<td>A10</td>
<td>18</td>
<td>10</td>
<td>A9</td>
</tr>
<tr>
<td>SDA10</td>
<td>19</td>
<td>Zero</td>
<td>A10/AP</td>
</tr>
<tr>
<td>A11</td>
<td>Irrelevant</td>
<td>Irrelevant</td>
<td>NC</td>
</tr>
<tr>
<td>A12</td>
<td>20</td>
<td>20</td>
<td>A11 or Bank</td>
</tr>
<tr>
<td>A13</td>
<td>21</td>
<td>21</td>
<td>A12 or Bank</td>
</tr>
<tr>
<td>A14</td>
<td>22</td>
<td>22</td>
<td>A13 or Bank</td>
</tr>
<tr>
<td>A15</td>
<td>23</td>
<td>23</td>
<td>A14 or Bank</td>
</tr>
</tbody>
</table>
### SDRAM Interface

#### Table 8-6. Word Page Size 512, 64-Bit Bus Width

<table>
<thead>
<tr>
<th>Physical TigerSHARC Processor Pin</th>
<th>Bank Active Cycle Internal Address</th>
<th>Column Access Cycle Internal Address</th>
<th>Physical SDRAM Pin</th>
</tr>
</thead>
<tbody>
<tr>
<td>A0</td>
<td>9</td>
<td>0</td>
<td>NC</td>
</tr>
<tr>
<td>A1</td>
<td>10</td>
<td>1</td>
<td>A0</td>
</tr>
<tr>
<td>A2</td>
<td>11</td>
<td>2</td>
<td>A1</td>
</tr>
<tr>
<td>A3</td>
<td>12</td>
<td>3</td>
<td>A2</td>
</tr>
<tr>
<td>A4</td>
<td>13</td>
<td>4</td>
<td>A3</td>
</tr>
<tr>
<td>A5</td>
<td>14</td>
<td>5</td>
<td>A4</td>
</tr>
<tr>
<td>A6</td>
<td>15</td>
<td>6</td>
<td>A5</td>
</tr>
<tr>
<td>A7</td>
<td>16</td>
<td>7</td>
<td>A6</td>
</tr>
<tr>
<td>A8</td>
<td>17</td>
<td>8</td>
<td>A7</td>
</tr>
<tr>
<td>A9</td>
<td>18</td>
<td>9</td>
<td>A8</td>
</tr>
<tr>
<td>A10</td>
<td>19</td>
<td>10</td>
<td>A9</td>
</tr>
<tr>
<td>SDA10</td>
<td>20</td>
<td>Zero</td>
<td>A10/AP</td>
</tr>
<tr>
<td>A11</td>
<td>Irrelevant</td>
<td>Irrelevant</td>
<td>NC</td>
</tr>
<tr>
<td>A12</td>
<td>21</td>
<td>21</td>
<td>A11 or Bank</td>
</tr>
<tr>
<td>A13</td>
<td>22</td>
<td>22</td>
<td>A12 or Bank</td>
</tr>
<tr>
<td>A14</td>
<td>23</td>
<td>23</td>
<td>A13 or Bank</td>
</tr>
<tr>
<td>A15</td>
<td>24</td>
<td>24</td>
<td>A14 or Bank</td>
</tr>
</tbody>
</table>
## External Port and SDRAM Interface

Table 8-7. Word Page Size 1K, 64-Bit Bus Width

<table>
<thead>
<tr>
<th>Physical TigerSHARC Processor Pin</th>
<th>Bank Active Cycle Internal Address</th>
<th>Column Access Cycle Internal Address</th>
<th>Physical SDRAM Pin</th>
</tr>
</thead>
<tbody>
<tr>
<td>A0</td>
<td>10</td>
<td>0</td>
<td>NC</td>
</tr>
<tr>
<td>A1</td>
<td>11</td>
<td>1</td>
<td>A0</td>
</tr>
<tr>
<td>A2</td>
<td>12</td>
<td>2</td>
<td>A1</td>
</tr>
<tr>
<td>A3</td>
<td>13</td>
<td>3</td>
<td>A2</td>
</tr>
<tr>
<td>A4</td>
<td>14</td>
<td>4</td>
<td>A3</td>
</tr>
<tr>
<td>A5</td>
<td>15</td>
<td>5</td>
<td>A4</td>
</tr>
<tr>
<td>A6</td>
<td>16</td>
<td>6</td>
<td>A5</td>
</tr>
<tr>
<td>A7</td>
<td>17</td>
<td>7</td>
<td>A6</td>
</tr>
<tr>
<td>A8</td>
<td>18</td>
<td>8</td>
<td>A7</td>
</tr>
<tr>
<td>A9</td>
<td>19</td>
<td>9</td>
<td>A8</td>
</tr>
<tr>
<td>A10</td>
<td>20</td>
<td>10</td>
<td>A9</td>
</tr>
<tr>
<td>SDA10</td>
<td>21</td>
<td>Zero</td>
<td>A10/AP</td>
</tr>
<tr>
<td>A11</td>
<td>Irrelevant</td>
<td>Irrelevant</td>
<td>NC</td>
</tr>
<tr>
<td>A12</td>
<td>22</td>
<td>22</td>
<td>A11 or Bank</td>
</tr>
<tr>
<td>A13</td>
<td>23</td>
<td>23</td>
<td>A12 or Bank</td>
</tr>
<tr>
<td>A14</td>
<td>24</td>
<td>24</td>
<td>A13 or Bank</td>
</tr>
<tr>
<td>A15</td>
<td>25</td>
<td>25</td>
<td>A14 or Bank</td>
</tr>
</tbody>
</table>
SDRAM Interface

Figure 8-24. External Port Data Align for 32-bit

1 FOR SDRAM ACCESSES ONLY
2 ADDRESS PINS CONNECTION (ADSP-TS201): EXTERNAL DATA = A0>A0
3 CONFIGURED IN SYSCON BUS WIDTH SETTINGS
### External Port and SDRAM Interface

Table 8-8. Word Page Size 256, 32-Bit Bus Width

<table>
<thead>
<tr>
<th>Physical TigerSHARC Processor Pin</th>
<th>Bank Active Cycle Internal Address</th>
<th>Column Access Cycle Internal Address</th>
<th>Physical SDRAM Pin</th>
</tr>
</thead>
<tbody>
<tr>
<td>A0</td>
<td>8</td>
<td>0</td>
<td>A0</td>
</tr>
<tr>
<td>A1</td>
<td>9</td>
<td>1</td>
<td>A1</td>
</tr>
<tr>
<td>A2</td>
<td>10</td>
<td>2</td>
<td>A2</td>
</tr>
<tr>
<td>A3</td>
<td>11</td>
<td>3</td>
<td>A3</td>
</tr>
<tr>
<td>A4</td>
<td>12</td>
<td>4</td>
<td>A4</td>
</tr>
<tr>
<td>A5</td>
<td>13</td>
<td>5</td>
<td>A5</td>
</tr>
<tr>
<td>A6</td>
<td>14</td>
<td>6</td>
<td>A6</td>
</tr>
<tr>
<td>A7</td>
<td>15</td>
<td>7</td>
<td>A7</td>
</tr>
<tr>
<td>A8</td>
<td>16</td>
<td>8</td>
<td>A8</td>
</tr>
<tr>
<td>A9</td>
<td>17</td>
<td>9</td>
<td>A9</td>
</tr>
<tr>
<td>A10</td>
<td>Irrelevant</td>
<td>Irrelevant</td>
<td>NC</td>
</tr>
<tr>
<td>SDA10</td>
<td>18</td>
<td>0</td>
<td>A10/AP</td>
</tr>
<tr>
<td>A11</td>
<td>19</td>
<td>19</td>
<td>A11 or Bank</td>
</tr>
<tr>
<td>A12</td>
<td>20</td>
<td>20</td>
<td>A12 or Bank</td>
</tr>
<tr>
<td>A13</td>
<td>21</td>
<td>21</td>
<td>A13 or Bank</td>
</tr>
<tr>
<td>A14</td>
<td>22</td>
<td>22</td>
<td>A14 or Bank</td>
</tr>
<tr>
<td>A15</td>
<td>23</td>
<td>23</td>
<td>A15 or Bank</td>
</tr>
</tbody>
</table>
### SDRAM Interface

Table 8-9. Word Page Size 512, 32-Bit Bus Width

<table>
<thead>
<tr>
<th>Physical TigerSHARC Processor Pin</th>
<th>Bank Active Cycle Internal Address</th>
<th>Column Access Cycle Internal Address</th>
<th>Physical SDRAM Pin</th>
</tr>
</thead>
<tbody>
<tr>
<td>A0</td>
<td>9</td>
<td>0</td>
<td>A0</td>
</tr>
<tr>
<td>A1</td>
<td>10</td>
<td>1</td>
<td>A1</td>
</tr>
<tr>
<td>A2</td>
<td>11</td>
<td>2</td>
<td>A2</td>
</tr>
<tr>
<td>A3</td>
<td>12</td>
<td>3</td>
<td>A3</td>
</tr>
<tr>
<td>A4</td>
<td>13</td>
<td>4</td>
<td>A4</td>
</tr>
<tr>
<td>A5</td>
<td>14</td>
<td>5</td>
<td>A5</td>
</tr>
<tr>
<td>A6</td>
<td>15</td>
<td>6</td>
<td>A6</td>
</tr>
<tr>
<td>A7</td>
<td>16</td>
<td>7</td>
<td>A7</td>
</tr>
<tr>
<td>A8</td>
<td>17</td>
<td>8</td>
<td>A8</td>
</tr>
<tr>
<td>A9</td>
<td>18</td>
<td>9</td>
<td>A9</td>
</tr>
<tr>
<td>A10</td>
<td>Irrelevant</td>
<td>Irrelevant</td>
<td>NC</td>
</tr>
<tr>
<td>SDA10</td>
<td>19</td>
<td>Zero</td>
<td>A10/AP</td>
</tr>
<tr>
<td>A11</td>
<td>20</td>
<td>20</td>
<td>A11 or Bank</td>
</tr>
<tr>
<td>A12</td>
<td>21</td>
<td>21</td>
<td>A12 or Bank</td>
</tr>
<tr>
<td>A13</td>
<td>22</td>
<td>22</td>
<td>A13 or Bank</td>
</tr>
<tr>
<td>A14</td>
<td>23</td>
<td>23</td>
<td>A14 or Bank</td>
</tr>
<tr>
<td>A15</td>
<td>24</td>
<td>24</td>
<td>A15 or Bank</td>
</tr>
</tbody>
</table>
SDRAM Programming

SDRAM components are available from several vendors. Each vendor has different SDRAM product requirements for the power up sequence and the timing parameters—\( t_{\text{RAS}} \) (ACT to PRE command delay), \( t_{\text{RCD}} \) and \( t_{\text{RP}} \).
SDRAM Interface

(PR) to (ACT command delay). Use only SDRAMs that comply with Joint Electronic Device Engineering Council (JEDEC) specifications. In order to support multiple vendors, the TigerSHARC processor SDRCON register can be programmed to meet these requirements. (See Figure 2-38 on page 2-77.)

The SDRCON register of the TigerSHARC processor stores the configuration information of the SDRAM interface. Writing configuration parameters initiates commands to the SDRAM that take effect immediately.

In the SDRCON register, the parameter bits support the following operations:

- Enabling SDRAM
- Selecting the CAS Latency Value – CAS
- Setting the SDRAM Buffering Option – Pipeline Depth
- Selecting the SDRAM Page Size – Page Boundary
- Setting the Refresh Counter Value – Refresh Rate
- Selecting the Precharge to RAS Delay
- Selecting the RAS to Precharge Delay
- Setting the SDRAM Power-Up Mode – Initialization Sequence
- Enabling the Extended Mode (EMR) Register

Enabling SDRAM

Bit0 should be set if there is an SDRAM in the system, otherwise it should be cleared. Any access to SDRAM while this bit is cleared causes a hardware error interrupt.
Selecting the CAS Latency Value – CAS

The CAS latency value defines the delay, in number of system clock cycles (SCLK), between the time that the SDRAM detects the read command and the time that it provides the data at its output pins. This parameter facilitates matching the SDRAM operation with the processor’s ability to latch the data output.

CAS latency does not apply to write cycles.

The CAS Bits2–1 in the SDRCON register select the CAS latency value as follows.

- 00 = 1 cycle latency
- 01 = 2 cycles latency
- 10 = 3 cycles latency
- 11 = Reserved

Generally, the frequency of the operation determines the value of the CAS latency.

Setting the SDRAM Buffering Option – Pipeline Depth

Systems that use several SDRAM devices connected in parallel may require buffering between the processor and multiple SDRAM devices in order to meet overall system timing requirements. To meet such timing requirements and enable intermediary buffering, the processor supports pipelining of SDRAM address and control signals. The pipeline depth Bit3 in the SDRCON register enables this mode:

- Pipeline depth = 0 Disable pipelining
- Pipeline depth = 1 Enable pipelining
SDRAM Interface

When set (= 1), the SDRAM controller delays the data in write accesses by one cycle, enabling the processor to latch the address and controls externally. In read accesses, the SDRAM controller samples data one cycle later.

Figure 8-25 on page 8-71 shows another single processor example in which the SDRAM interface connects to multiple banks of SDRAM to provide 512M of SDRAM in a 4-bit I/O configuration. This configuration results in 16M x 32-bit words. In this example, 0xA and 0xB output from the registered buffers are the same signal, but are buffered separately. In the registered buffers, a delay of one clock cycle occurs between the input (Ix) and its corresponding output (0xA or 0xB).

Selecting the SDRAM Page Size – Page Boundary

The processor’s SDRAM controller page boundary bits define the page size, in number of words, of the SDRAM’s banks. The setup of Bits 5–4 in the SDRCON register is as follows.

00 = 256 words
01 = 512 words
10 = 1024 words
11 = Reserved

Setting the Refresh Counter Value – Refresh Rate

Since the clock supplied to the SDRAM can vary, the processor provides a programmable refresh counter to coordinate the SOC clock rate with the SDRAM device’s required refresh rate. Bits 8–7 select between four different refresh rates calculated with the following equation:

\[ f_{cycles} = \left( \frac{sdramclock \times t_{ref}}{rows} \right) = (clock \times refresh\_rate) \]
External Port and SDRAM Interface

Figure 8-25. Single System With SDRAM Standard Devices (32-bit Bus)
SDRAM Interface

Where:

- 00 = Once every 1100 cycles
- 01 = Once every 1850 cycles
- 10 = Once every 2200 cycles
- 11 = Once every 3700 cycles

Selecting the Precharge to RAS Delay

The tRP value (precharge to RAS delay) defines the required delay, in number of system clock cycles (SCLK), between the time the SDRAM controller issues a PRE command and the time it issues an ACT command.

The setup of Bits10–9 in the SDRCON register for this parameter is shown in Table 8-11.

Table 8-11. Precharge to RAS Delay (tRP) Bits

<table>
<thead>
<tr>
<th>Delay</th>
<th>SDRCON Bits10–9</th>
</tr>
</thead>
<tbody>
<tr>
<td>2 cycles</td>
<td>00</td>
</tr>
<tr>
<td>3 cycles</td>
<td>01</td>
</tr>
<tr>
<td>4 cycles</td>
<td>10</td>
</tr>
<tr>
<td>5 cycles</td>
<td>11</td>
</tr>
</tbody>
</table>

Selecting the RAS to Precharge Delay

The tRAS value (RAS to Precharge Delay) defines the required delay, in number of system clock cycles (SCLK), between the time the SDRAM controller issues an ACT command and the time it issues a PRE command.

The setup of Bits13–11 in the SDRCON register for this parameter is shown in Table 8-12.
External Port and SDRAM Interface

Table 8-12. RAS to Precharge Delay (tRAS) Bits

<table>
<thead>
<tr>
<th>Delay</th>
<th>SDRCON Bits13–11</th>
</tr>
</thead>
<tbody>
<tr>
<td>2 cycles</td>
<td>000</td>
</tr>
<tr>
<td>3 cycles</td>
<td>001</td>
</tr>
<tr>
<td>4 cycles</td>
<td>010</td>
</tr>
<tr>
<td>5 cycles</td>
<td>011</td>
</tr>
<tr>
<td>6 cycles</td>
<td>100</td>
</tr>
<tr>
<td>7 cycles</td>
<td>101</td>
</tr>
<tr>
<td>8 cycles</td>
<td>110</td>
</tr>
</tbody>
</table>

Setting the SDRAM Power-Up Mode – Initialization Sequence

To avoid unpredictable start-up modes, SDRAM devices must follow a specific initialization sequence during power up. The processor provides two commonly used power-up options.

The Init Sequence bit (14) in the SDRCON register selects the SDRAM power-up mode. When set (=1), the SDRAM controller sequentially issues: a PRE command, eight AutoRefresh cycles, and an MRS (Mode Register Set) command. When cleared (=0), the SDRAM controller issues, in this order: a PRE command, an MRS (Mode Register Set) command, and eight AutoRefresh cycles.

Enabling the Extended Mode (EMR) Register

The EMR enabled bit 15 in the SDRCON register enables the extended mode register set. When set (=1), the SDRAM controller issues an EMRS cycle preceding the MRS command. When cleared (=0), the init sequence proceeds as explained in the “Setting the SDRAM Power-Up Mode – Initialization Sequence” on page 8-73 without the preceding EMRS command.
SDRAM Interface

Please note that the $EMR$ enable bit should only be set when interfacing to low-power SDRAM devices (2.5V).

Flyby Transactions

Flyby transactions are used by the DMA for transferring data between an external I/O device and external SDRAM memory (for example, DMA may be programmed to transfer data from an A/D device to the SDRAM). A flyby transaction is issued by a DMA channel that is programmed to execute flyby transactions (see “DPx Register” on page 7-19). The flyby transaction applies only to SDRAM.

In flyby transactions, the TigerSHARC processor relinquishes the data bus during the transaction and then activates in the external memory and the I/O devices in parallel. Memory is driven in the regular way and the I/O device is controlled with the $IORD$, $IOWR$, and $IOEN$ pins.

During flyby write transactions (memory write and I/O read), the signals $IORD$ and $IOEN$ are driven active after the rising edge of $SCLK$ one cycle before the data cycle. The I/O device samples the $IORD$ signal at the rising edge of $SCLK$, and valid data is driven after this edge. The $IOEN$ signal is driven active as long as the I/O devices drives the data bus. If there are flyby transactions interleaved with regular write transaction to the same SDRAM page, there is an IDLE cycle added between the two transactions.

Figure 8-26 shows an example of a flyby write transaction. In transactions CA0 and CA2, the current bus master drives the data. During transaction CA1, the I/O device drives the data. IDLE cycles are inserted between the CA0 and CA1 writes and are inserted between the CA1 and CA2 writes to avoid data contention. In this case, data drivers are switching from the current bus master to the I/O devices and back to the current bus master. The $IORD$ signal can be used, for example, to control FIFO pointers. The $IOEN$ signal should be used to control the I/O devices’s output buffers. $IOEN$ is driven low one cycle before the data cycle to improve the data setup and the frequency of operation.
The sequence illustrated in Figure 8-26 applies when:

- Transfer is followed by another burst write transfer from I/O to the SDRAM device
- Flyby transaction and bus width = 64

Figure 8-26. Synchronous Burst Write Transfer Followed by Another Burst Write Transfer

1 Note that if the SDRAM is set to pipelined mode (one cycle delay), the IORD, IOWR, and IOEN signals are delayed for one cycle with the transaction.

Flyby transactions are used by the DMA controller to transfer data between an external I/O device and external SDRAM memory. (For the ADSP-TS201 processor, flyby transactions are supported with the external...
SDRAM memory space only. A flyby transaction type can be from SDRAM memory to I/O (flyby read) or from I/O to SDRAM memory (flyby write). A flyby transaction is issued by a DMA channel that is programmed to execute flyby transactions using the $TY$ field of the $DPx$ register. (See “DPx Register” on page 7-19.)

In flyby transactions, the ADSP-TS201 relinquishes the data bus during the data transaction(s) and selects the external SDRAM memory and the I/O device in parallel. The SDRAM memory is controlled using the appropriate SDRAM controller signals ($MSSDx$, $RAS$, $CAS$, and $SDWE$), and the I/O device is controlled with the $TORD$, $TOWR$, and $TOEN$ pins.

The $TORD$ and $TOWR$ signals on the ADSP-TS201 processor replace the $FLYBY$ signal from previous TigerSHARC processors for flyby mode transactions.

During flyby read transactions (external SDRAM memory read and I/O write), the $TOWR$ signal is active during the data cycle. The memory drives data after the rising edge of $SCLK$ and the I/O device latches the data on the rising edge of $SCLK$ when the $TOWN$ signal is active.

**Figure 8-27** shows an example of a flyby read transaction. In transactions, CA0 and CA2 data are read by the current bus master, while the I/O device latches CA1 data. In order to support SDRAM flyby read transactions, the I/O devices should be a synchronous device.

The sequence illustrated in **Figure 8-27** applies when:

- Transfer is followed by another burst read transfer from the SDRAM to I/O device
- Flyby transaction; $CAS$ latency = 2; bus width = 64
External Port and SDRAM Interface

Table 8-13 lists the data throughput rates for the processor’s core or DMA read/write accesses to SDRAM. The following assumptions are made for the information in this table:

- CAS latency = 2 cycles (CL = 2)
- No SDRAM buffering (pipeline depth = 0)
- $t_{RP} = 2$ cycles, $t_{RAS} = 3$ cycles, and $t_{RCD}$ (fixed) = $CL = 2$ cycles
Multiprocessing Operation

In a multiprocessing environment, the SDRAM is shared among two or more TigerSHARC processors. SDRAM input signals are always driven by the bus master. The slave processors track the commands that the master processor issues to the SDRAM. This feature or function helps to synchronize the SDRAM refresh counters and to prevent needless refreshing operations.

Table 8-13. Data Throughput Rates

<table>
<thead>
<tr>
<th>Accesses</th>
<th>Operations</th>
<th>Page</th>
<th>Throughput per SDRAM Clock (64-bit words)</th>
</tr>
</thead>
<tbody>
<tr>
<td>Sequential Uninterrupted</td>
<td>Read</td>
<td>Same</td>
<td>1 word/1 cycle</td>
</tr>
<tr>
<td>Non-sequential Uninterrupted</td>
<td>Read</td>
<td>Same</td>
<td>1 word/1 cycle</td>
</tr>
<tr>
<td>Sequential Interrupted</td>
<td>Read</td>
<td>Same</td>
<td>1 word/8 cycles (6 + CL)</td>
</tr>
<tr>
<td>Sequential Uninterrupted</td>
<td>Write</td>
<td>Same</td>
<td>1 word/1 cycle</td>
</tr>
<tr>
<td>Non-sequential Uninterrupted</td>
<td>Write</td>
<td>Same</td>
<td>1 word/1 cycle</td>
</tr>
<tr>
<td>Sequential Interrupted</td>
<td>Write</td>
<td>Same</td>
<td>1 word/1 cycle</td>
</tr>
<tr>
<td>Both</td>
<td>Read to Write</td>
<td>Same</td>
<td>1 word/6 cycles</td>
</tr>
<tr>
<td>Both</td>
<td>Write to Read</td>
<td>Same</td>
<td>1 word/4 cycles (2 + CL)</td>
</tr>
<tr>
<td>Non-sequential</td>
<td>Read</td>
<td>Different</td>
<td>1 word/12 cycles (6 + tRP + tRCD + CL)</td>
</tr>
<tr>
<td>Non-sequential</td>
<td>Write</td>
<td>Different</td>
<td>1 word/6 cycles</td>
</tr>
<tr>
<td>Autorefresh before read</td>
<td>Read</td>
<td>Different</td>
<td>1 word/17 cycle (6 + 2tRP + tRAS + tRCD + CL)</td>
</tr>
<tr>
<td>Autorefresh before write</td>
<td>Write</td>
<td>Different</td>
<td>1 word/11 cycle (2 + 2tRP + tRAS + tRCD)</td>
</tr>
</tbody>
</table>

1 SYSCON external bus width configuration: 64-bit bus
2 With SDRAM buffering (pipeline depth = 1), replace any instance of (CL) with (CL+1)
External Port and SDRAM Interface

In multiprocessor systems, where the bus mastership changes, the current bus master automatically issues a precharge command (PRE) before the bus is relinquished to the new bus master. This way, the new bus master can safely start accessing the SDRAM by simply issuing an activation command (ACT).

In TigerSHARC multiprocessor systems where SDRAM is used, the Mode Register Set sequence is only issued by the TigerSHARC processor with ID = 000. Therefore, a TigerSHARC processor with ID = 000 must be present in every multiprocessor system.

Understanding DQM Operation

The HDQM/LDQM pins are used by the controller to mask write operations. HDQM three-states the SDRAM DQ buffers when performing 32-bit writes to even addresses in a 64-bit bus configuration, or when bus is configured as a 32-bit bus. LDQM three-states the SDRAM DQ buffers when performing writes to odd addresses in a 64-bit bus configuration. This data mask function does not apply for read operations, where the LDQM and HDQM pins are always low (inactive).

Powering Up After Reset

After reset, once the SDRCON register is written to in the user application code, the controller initiates the selected power up sequence. The exact sequence is determined by the Init Sequence bit and Extended Mode Register bit in the SDRCON register. In a multiprocessing environment, the power up sequence is initiated by the TigerSHARC processor with ID = 000. Note that software reset does not reset the controller and does not re-initiate a power up sequence.
SDRAM Interface

SDRAM Controller Commands

This section describes each command the SDRAM controller uses to manage the SDRAM interface. These commands are transparent to applications.

A summary of the various commands used by the on-chip controller for the SDRAM interface is as follows.

MRS

(Mode Register Set). Initializes the SDRAM operation parameters during the power up sequence.

PRE

(Precharge). Precharges the active bank.

ACT

(Bank Activate). Activates a page in the required bank.

Read

Read from SDRAM.

Write

Write to SDRAM.

REF

(Refresh). Causes the SDRAM to enter refresh mode and generate all addresses internally.

SREF

(Self-Refresh). Places the SDRAM in self-refresh mode, in which it controls its refresh operations internally.
NOP
(No Operation). Issued after a read or write to enable burst operation or to insert wait cycles for different SDRAM accesses. \texttt{MSSD3–0} is deasserted during this command.

**Mode Register Set (MRS) Command**

Mode Register Set (MRS) is a part of the system initialization sequence. MRS initializes SDRAM operation parameters by using ADDR13–0 of the SDRAM as data input. The MRS command is issued only by the TigerSHARC processor with ID = 000, and must be issued prior to the first SDRAM access after power up.

MRS consists of two optional sequences of which only one is selected, depending on the SDRCON Init Sequence bit. The availability of two optional types is to help support different SDRAM vendors.

MRS initializes the following parameters (using ADDR13–0):

- **Burst length** – full page; Bits2–0; hardwired in the TigerSHARC processor
- **Wrap type** – sequential; Bit3; hardwired in the TigerSHARC processor
- **Ltmode** – latency mode (CAS latency); Bits6–4; set according to SDRCON programming
- **Bits13–7** – always 0; hardwired in the TigerSHARC processor

When executing MRS, the SDRAM must be in IDLE state in all its banks. The SDRAM cannot accept any other access during the two clock cycles following MRS. The SDRAM pin state during the MRS command is shown in Table 8-14 below.
Previous to the MRS sequence in which the SDRAM parameters are initialized, the TigerSHARC processor SDRAM controller may also issue an additional mode register set command depending on the programmed value in the EMR bit in SDRCON.

This extra sequence is issued when the corresponding bit is enabled, which should only be the case when specifically required by the SDRAM device being interfaced to (i.e. low-power SDRAM devices). Otherwise, the EMR bit should be cleared in which case only a MRS will be issued by the controller.

Although, this is also a MRS command, the difference between the two sequences, MRS and EMRS, lies on the data sent via the address lines driven by the TigerSHARC SDRAM controller (also see “Bank Select Pins” on page 8-59).

The extended mode register controls power saving related functions and initializes the following parameters (using ADDR13–0 as for MRS):

- Partial array self-refresh (PASR) – the self-refresh coverage is set to four banks; bits 2–0; hardwired in the TigerSHARC processor
- Temperature compensated self-refresh (TCSR) – the self-refresh frequency is set to its maximum (maximum case temperature 85°C); bits 4–3; hardwired in the TigerSHARC processor

Bits 13–5 – always 0; hardwired in the TigerSHARC processor
External Port and SDRAM Interface

Table 8-14. Pin State During MRS Command

<table>
<thead>
<tr>
<th>Pin</th>
<th>State</th>
</tr>
</thead>
<tbody>
<tr>
<td>MSSD3–0</td>
<td>low</td>
</tr>
<tr>
<td>CAS</td>
<td>low</td>
</tr>
<tr>
<td>RAS</td>
<td>low</td>
</tr>
<tr>
<td>SDWE</td>
<td>low</td>
</tr>
<tr>
<td>SDCKE</td>
<td>high</td>
</tr>
<tr>
<td>A15–14 = A31–30</td>
<td>00 = MRS 10 = EMRS</td>
</tr>
</tbody>
</table>

Precharge (PRE) Command

The PRE command has two functions—terminate a read or write cycle and precharge the active bank.

Terminating Read/Write Cycles

The exact moment when the PRE command is issued to terminate a read or write cycle depends on the SDRAM latency mode.

Precharging

After powerup a PRE command to all SDRAM banks is issued (SDA10 high). However, only the active bank precharges. Only one bank at a time can be active—no single bank precharge supported.

Precharge is issued in these cases:

1. During the SDRAM Init Sequence
2. Before ACT – access to a new page
SDRAM Interface

3. Before refresh cycle

4. Before the TigerSHARC processor relinquishes the external bus

Following a PRE command, the SDRAM cannot accept any other access for the precharge to RAS delay, tRP. The appropriate field in the SDRCON register should be programmed according to the SDRAM characteristics.

The SDRAM pin state during the PRE command is shown in Table 8-15.

Table 8-15. Pin State During PRE Command

<table>
<thead>
<tr>
<th>Pin</th>
<th>State</th>
</tr>
</thead>
<tbody>
<tr>
<td>MSSD3–0</td>
<td>low</td>
</tr>
<tr>
<td>SDWE</td>
<td>low</td>
</tr>
<tr>
<td>RAS</td>
<td>low</td>
</tr>
<tr>
<td>CAS</td>
<td>high</td>
</tr>
<tr>
<td>SDCKE</td>
<td>high</td>
</tr>
<tr>
<td>SDA10</td>
<td>high</td>
</tr>
</tbody>
</table>

Bank Active (ACT) Command

The ACT (Bank Active) command precedes any read or write access to a page that was not accessed beforehand. The ACT command is preceded with a PRE command. The ACT command asserts RAS and MSSD3–0 in order to enable the SDRAM to latch the row address. The ACT command opens up the SDRAM page, which remains open until the next precharge command. This is done to keep the page open even if there are occurrences of non-SDRAM accesses within a flow of accesses to a single SDRAM page.
External Port and SDRAM Interface

Table 8-16. Pin State During ACT Command

<table>
<thead>
<tr>
<th>Pin</th>
<th>State</th>
</tr>
</thead>
<tbody>
<tr>
<td>MSSD3–0</td>
<td>low</td>
</tr>
<tr>
<td>CAS</td>
<td>high</td>
</tr>
<tr>
<td>RAS</td>
<td>low</td>
</tr>
<tr>
<td>SDWE</td>
<td>high</td>
</tr>
<tr>
<td>SDCKE</td>
<td>high</td>
</tr>
</tbody>
</table>

Read Command

A Read command is preceded by an ACT command if it is the first access to the page. In Read commands, SDWE is deasserted and CAS and MSSD3–0 are asserted to enable the SDRAM to latch the column address according to which burst start address is set.

The delay between ACT and Read commands is determined by the tRCD parameter. Data is available after the tRCD and CAS latency requirements are met.

Single transactions take a different number of cycles according to transaction size and external bus width.

- Single word read – 1 cycle
- Long word read on 64-bit bus – 1 cycle
- Long word read on 32-bit bus – 2 cycles
- Quad word read on 64-bit bus – 2 cycles
- Quad word read on 32-bit bus – 4 cycles

If no new command is issued to the SDRAM after a Read command, the TigerSHARC processor continues reading from the sequential addresses. This is called “page mode” or “burst”. If the whole transaction is more
than one cycle (for example, quad transactions on a 32-bit bus), no new command is issued to the SDRAM and the rest of the data is read from sequential addresses. When the whole transaction is completed, the SDRAM can continue in different ways:

- If there is no new SDRAM transaction, the \textit{Bstop} command is issued followed by a precharge command closing the active page.
- If there is an SDRAM read transaction to the same page, a new transaction begins in the next cycle.
- If there is an SDRAM write to the same page, \textit{Bstop} command is issued and the write begins after a delay of $t_{CAS}$ latency to prevent contention on the data bus.
- If there is an SDRAM access to another page, \textit{Bstop} command is issued and then, following the RAS-to-precharge ($t_{RP}$) delay, Pre-charge ($PRE$) and Bank Active ($ACT$) cycles are issued.

Different examples of \textit{Read} sequences are shown in Figure 8-28 on page 8-87 through Figure 8-29 on page 8-88.

Table 8-17. Pin State During Read Command

<table>
<thead>
<tr>
<th>Pin</th>
<th>State</th>
</tr>
</thead>
<tbody>
<tr>
<td>MSSD3-0</td>
<td>low</td>
</tr>
<tr>
<td>CAS</td>
<td>low</td>
</tr>
<tr>
<td>RAS</td>
<td>low</td>
</tr>
<tr>
<td>SDWE</td>
<td>high</td>
</tr>
<tr>
<td>SDCKE</td>
<td>high</td>
</tr>
</tbody>
</table>
External Port and SDRAM Interface

Bus Width = 64

For the sequence in Figure 8-28, the following is true:

- \( \text{CAS} \) latency = 2
- Bus width = 64

Figure 8-28. Bus Width = 64
(Burst Read Followed by Burst Read in the Same Page)
SDRAM Interface

Bus Width = 32

For the sequence in Figure 8-29, the following is true:

- $\overline{\text{CAS}}$ latency = 2
- Bus width = 32

Figure 8-29. Bus Width = 32
(Burst Read Followed by Burst Read in the Same Page)
Write Command

A Write command is preceded by an ACT command if it is the first access to the page. In Write commands, CAS, SDME, and MSSD3–0 are asserted to enable the SDRAM to latch the column address to set the burst start address.

The delay between ACT and Write commands is determined by the tRCD parameter. Data is driven by the TigerSHARC processor on the Write command.

Single write transactions take a different number of cycles according to transaction size and external bus width.

- Single word write – 1 cycle
- Long word write on 64-bit bus – 1 cycle
- Long word write on 32-bit bus – 2 cycles
- Quad word write on 64-bit bus – 2 cycles
- Quad word write on 32-bit bus – 4 cycles

If no new command is issued to the SDRAM after a Write command is issued, the SDRAM continues writing to sequential addresses. This is called ‘page mode’ or ‘burst’. If the whole transaction is more than one cycle—for example, quad write on a 32-bit bus—no new command is issued to the SDRAM and the rest of the data is written to sequential addresses. When the whole transaction is completed, the SDRAM can continue in different ways:

- If there is no new SDRAM transaction, Bstop command is issued, followed by a precharge command closing the active page.
- If there is an SDRAM write transaction to the same page, a new transaction begins on the next cycle.
SDRAM Interface

- If there is an SDRAM read to the same page, \texttt{Bstop} command is issued to stop the write transaction, and the read transaction begins on the next cycle.

- If there is an SDRAM access to another page, \texttt{BSTOP} command is issued and then, keeping to the RAS-to-precharge (t\textsubscript{RP}) delay constraint, Precharge (PRE) and ACT cycles are issued.

Table 8-18. Pin State During Write Command

<table>
<thead>
<tr>
<th>Pin</th>
<th>State</th>
</tr>
</thead>
<tbody>
<tr>
<td>MSSD3–0</td>
<td>low</td>
</tr>
<tr>
<td>CAS</td>
<td>low</td>
</tr>
<tr>
<td>RAS</td>
<td>high</td>
</tr>
<tr>
<td>SDWE</td>
<td>low</td>
</tr>
<tr>
<td>SDCKE</td>
<td>high</td>
</tr>
</tbody>
</table>
External Port and SDRAM Interface

Bus Width = 64

For the sequence in Figure 8-31, the following is true:

- CAS latency = 2
- Bus width = 64

Figure 8-30. Bus Width = 64
(Burst Write Followed by Burst Write in the Same Page)
**SDRAM Interface**

**Bus Width = 32**

For the sequence in Figure 8-31, the following is true:

- $\overline{\text{CAS}}$ latency = 2
- Bus width = 32

![Diagram of bus width 32](image)

Figure 8-31. Bus Width = 32
(Burst Write Followed by Burst Write in the Same Page)
External Port and SDRAM Interface

Refresh (REF) Command

The REF command is a request to the SDRAM to perform a CBR (CAS Before RAS) refresh transaction. This transaction is generated automatically by the TigerSHARC processor and causes all addresses to be produced internally in the SDRAM. Before executing the REF command, the SDRAM controller executes a precharge (PRE) command to the active bank. The next active (ACT) command is given by the controller only after a minimum delay equal to $t_{RC}$, where:

$$t_{RC} = t_{RP} + t_{RAS} + 1$$

The refresh cycle is required by the SDRAM to keep the data valid. The refresh frequency is set in the SDRCON register (see “SDRAM Programming” on page 8-67). In a multiprocessing system, the refresh is done by the current bus master. To eliminate problems caused by differences between TigerSHARC processors in refresh counter initialization, every slave TigerSHARC processor monitors the external bus and resets its own counter when identifying a refresh cycle.

Table 8-19. Pin State During REF Command

<table>
<thead>
<tr>
<th>Pin</th>
<th>State</th>
</tr>
</thead>
<tbody>
<tr>
<td>MSSD3–0</td>
<td>low</td>
</tr>
<tr>
<td>CAS</td>
<td>low</td>
</tr>
<tr>
<td>RAS</td>
<td>low</td>
</tr>
<tr>
<td>SDWE</td>
<td>high</td>
</tr>
<tr>
<td>SDCKE</td>
<td>high</td>
</tr>
</tbody>
</table>

Self-Refresh (SREF) Command

The TigerSHARC processor enters self-refresh mode before the bus is relinquished to the host. The purpose is to keep the SDRAM self-refreshed in case the host doesn’t interface with the SDRAM. If the
host can interface with the SDRAM, it releases the SDRAM from self-refresh mode and returns it to self-refresh mode before relinquishing the bus to the TigerSHARC processor. Before entering into self-refresh mode, the same conditions that apply to the refresh mode (see “Refresh (REF) Command” on page 8-93) must be sustained. Similarly, reaccessing the SDRAM after exiting from self-refresh mode is also limited by the same restrictions as for refresh mode. Self-refresh mode is entered by deasserting the \texttt{SDCKE} signal and is exited by reasserting the same signal.

The \texttt{SREF} command causes refresh operations to be performed internally by the SDRAM without any external control. Before executing the \texttt{SREF} command, the SDRAM controller precharges all banks. After exit from self-refresh mode, the SDRAM controller waits for \( t_{RC} \) number of cycles before executing a bank active command (\texttt{ACT}).

Table 8-20. Pin State During SREF Command

<table>
<thead>
<tr>
<th>Pin</th>
<th>State</th>
</tr>
</thead>
<tbody>
<tr>
<td>MSSD3–0</td>
<td>low</td>
</tr>
<tr>
<td>CAS</td>
<td>low</td>
</tr>
<tr>
<td>RAS</td>
<td>low</td>
</tr>
<tr>
<td>SDWE</td>
<td>high</td>
</tr>
<tr>
<td>SDCKE</td>
<td>low</td>
</tr>
</tbody>
</table>

Programming Example

This section provides a programming example written for the TigerSHARC processor. The example shown in Listing 8-1 on page 8-95 demonstrates how to set up the SDRAM controller to work with the ADSP-TS201 EZ-KIT Lite™.
Listing 8-1. SDRAM Controller Setup for ADSP-TS201 EZ-KIT Lite

```c
#include <defts201.h>
#include "TSE2kitDef.h"

.section program;

SDRAM_Init:

xr0 = SYSCON_MP_WID64 | SYSCON_MEM_WID64 |
     SYSCON_MSH_PIPE2 | SYSCON_MSH_WT0 | SYSCON_MSH_IDLE |
     SYSCON_MS1PIPE1 | SYSCON_MS1_WT0 | SYSCON_MS1_IDLE |
     SYSCON_MS0_SLOW | SYSCON_MS0_WT3 | SYSCON_MS0_IDLE ::
SYSCON = xr0;;

xr0 = SDRCON_INIT | SDRCON_RAS2PC5 | SDRCON_PC2RAS2 |
     SDRCON_REF3700 | SDRCON_PG256 | SDRCON_CLAT2 |
     SDRCON_ENBL ::
SDRCON = xr0;;
```
SDRAM Interface
9 LINK PORTS

The ADSP-TS201 TigerSHARC processor has four full-duplex link ports. Link ports provide an optional communication channel. It is intended to be used for point-to-point communications between TigerSHARC processors in a system, but can be used to interface to any other device that is designed to work in the same protocol.

The link ports on the ADSP-TS201 TigerSHARC processor use low voltage differential signalling (LVDS) circuitry. Each link port has both a receive and transmit channel and is capable of full-duplex operation.

ADSP-TS201’s link ports are similar to those on previous TigerSHARC processors in the way the links are accessed by the core and the DMA, but the link ports differ in their physical implementation.

Data is driven and latched on both the rising and falling edge of the link clock.

Transfer control at the core occurs through interrupts or polling. Transfer control at the DMA occurs through the dedicated DMA channels for transmit and receive.

DMA chaining is supported, and all link ports can be used for booting.

ADSP-TS201 TigerSHARC processor link ports are not compatible with previous TigerSHARC processor or SHARC DSP link ports.
Link Architecture

As shown in Figure 9-1, the link ports are bus slaves on the SOC bus. It is important to note that link port register accesses must cross a number of clock domains when reading or writing register data.

Figure 9-1. Link Ports and SOC Bus Connections
Link Ports

Corresponding response for link port controls may vary with SOC bus loading. For more information on clock domains, see “SOC Interface” on page 3-1.

The link port has two parts—transmitter and receiver. The transmit channel has a double buffer and the receive channel has a triple buffer, as shown in Figure 9-2 on page 9-3. The \texttt{LBUFTXx} and \texttt{LBUFRxx} registers are 28-bit, memory-mapped \texttt{Ureg} registers. The shift registers are not accessible by software.

![Link Port Architecture](image-url)

Figure 9-2. Link Port Architecture
Link Architecture

Link I/O Pins

Table 9-1 describes the I/O pins related to the link ports. There is a set of link pins for each link port. The ‘x’ in the signal name indicates the link port—0, 1, 2, or 3.

Table 9-1. Pin Definitions – Link Ports

<table>
<thead>
<tr>
<th>Signal</th>
<th>Description</th>
</tr>
</thead>
<tbody>
<tr>
<td>LxDATO3–0P</td>
<td>Link Ports 3–0 Data 3–0 Transmit LVDS P</td>
</tr>
<tr>
<td>LxDATO3–0N</td>
<td>Link Ports 3–0 Data 3–0 Transmit LVDS N</td>
</tr>
<tr>
<td>LxCLKOUTP</td>
<td>Link Ports 3–0 Transmit Clock LVDS P</td>
</tr>
<tr>
<td>LxCLKOUTN</td>
<td>Link Ports 3–0 Transmit Clock LVDS N</td>
</tr>
<tr>
<td>LxACKI</td>
<td>Link Ports 3–0 Transmitter Acknowledge Input. Using this signal, the receiver indicates to the transmitter that it may continue the transmission</td>
</tr>
<tr>
<td>LxBCMPO</td>
<td>Link Ports 3–0 Block Completion. When the transmission is executed using DMA, this signal indicates to the receiver that the transmitted block is completed. At reset, the L1BCMPO, L2BCMPO, and L3BCMPO pins are strap pins. Prior to port initialization, the link port drives LxBCMPO high (deasserted), indicating to the receiver TigerSHARC processor that the port is connected.</td>
</tr>
<tr>
<td>LxDATI3–0P</td>
<td>Link Ports 3–0 Data 3–0 Receive LVDS P</td>
</tr>
<tr>
<td>LxDATI3–0N</td>
<td>Link Ports 3–0 Data 3–0 Receive LVDS N</td>
</tr>
<tr>
<td>LxCLKINP</td>
<td>Link Ports 3–0 Receive Clock LVDS P</td>
</tr>
<tr>
<td>LxCLKINN</td>
<td>Link Ports 3–0 Receive Clock LVDS N</td>
</tr>
<tr>
<td>LxACKO</td>
<td>Link Ports 3–0 Receiver Acknowledge Output. Using this signal, the receiver indicates to the transmitter that it may continue the transmission.</td>
</tr>
<tr>
<td>LxBCMPI</td>
<td>Link Ports 3–0 Block Completion. When the reception is executed using DMA, this signal indicates to the receiver that the receive block is completed. When reset is deasserted, a high (deasserted) value on LxBCMPI indicates that the port is connected.</td>
</tr>
</tbody>
</table>
Each link port has two independent channels that can operate simultaneously. The transmit channel transfers data to a second device, and the receive channel gets data from a second device. Each channel communicates using up to four data bits, using the $LxCLKOUTP/N$, $LxACKI$, $LxCLKINP/N$, and $LxACKO$ signals to control the data transfer. $LxBCMPI$ and $LxBCMPO$ signals are used to signal that the current data block transmission has been completed. Link ports should be connected as shown in Figure 9-3 or Figure 9-4.

**Figure 9-3. Link Configuration (4-Bit Mode)**

**Figure 9-4. Link Configuration (1-Bit Mode)**

The ADSP-TS201 processor’s link ports are clocked LVDS high-speed data ports. LVDS is a standard for differential signaling between distant elements at higher speeds than single-ended techniques. This technique allows data transmission between two ADSP-TS201 TigerSHARC proces-
Link Architecture

LVDS, or between a TigerSHARC processor and an FPGA. LVDS provides higher frequency, higher noise immunity, lower power consumption and less electro-magnetic interference.

LVDS signalling requires differential termination. Figure 9-5 shows an ADSP-TS201 link port transmitting to a close ADSP-TS201 device. The internal 100 Ω termination resistors (RT) on the receiver side provide the required termination. The traces on the board must be matched in order to keep the same delays for all data pins and clock. The routing of the single-ended signals (LxACKI, LxACKO, LxBCMPI, and LxBCMPO) is not as critical, but delays should be kept close to those of the differential signals.

Figure 9-5. ADSP-TS201S To ADSP-TS201S Configuration
Link Ports

Link Transmit and Receive Data

Data transfer is performed by writing to the transmit buffer and reading from the receive buffer. All the data written to the transmit buffer is copied to the shift register once it is empty, and then transmitted. The receiver enables data movement in, only when the receive shift register is empty, or when there is space in the receive buffers to accept the data from shift register when receive of this quad-word is completed. After the whole quad word has been received, the receiver moves the data from the shift register to the receive buffer when it is free. The link control moves the data into the receive buffer as its first priority. If the receive buffer is full, it copies the data into the temporary receive buffer, and keeps it there until the receive buffer is free. The receiver controls the data flow by deasserting LxACK0. Up to two quad words can be shifted in after the LxACK0 deassertion.

Table 9-2. Link Port TX and RX Buffer Registers

<table>
<thead>
<tr>
<th>Register (Quad Words)</th>
<th>Link Ports 0–3 Registers</th>
</tr>
</thead>
<tbody>
<tr>
<td>LBUFTX0</td>
<td>Link # 0 transmit register</td>
</tr>
<tr>
<td>LBUFRX0</td>
<td>Link # 0 receive register</td>
</tr>
<tr>
<td>LBUFTX1</td>
<td>Link # 1 transmit register</td>
</tr>
<tr>
<td>LBUFRX1</td>
<td>Link # 1 receive register</td>
</tr>
<tr>
<td>LBUFTX2</td>
<td>Link # 2 transmit register</td>
</tr>
<tr>
<td>LBUFRX2</td>
<td>Link # 2 receive register</td>
</tr>
<tr>
<td>LBUFTX3</td>
<td>Link # 3 transmit register</td>
</tr>
<tr>
<td>LBUFRX3</td>
<td>Link # 3 receive register</td>
</tr>
</tbody>
</table>
## Link Architecture

### Table 9-3. Link Port Control and Status Registers

<table>
<thead>
<tr>
<th>Register (Normal Word)</th>
<th>Link Ports 0–3 Control and Status Registers</th>
</tr>
</thead>
<tbody>
<tr>
<td>LRCTL0</td>
<td>Link # 0 receive control register</td>
</tr>
<tr>
<td>LRCTL1</td>
<td>Link # 1 receive control register</td>
</tr>
<tr>
<td>LRCTL2</td>
<td>Link # 2 receive control register</td>
</tr>
<tr>
<td>LRCTL3</td>
<td>Link # 3 receive control register</td>
</tr>
<tr>
<td>LTCTL0</td>
<td>Link # 0 transmit control register</td>
</tr>
<tr>
<td>LTCTL1</td>
<td>Link # 1 transmit control register</td>
</tr>
<tr>
<td>LTCTL2</td>
<td>Link # 2 transmit control register</td>
</tr>
<tr>
<td>LTCTL3</td>
<td>Link # 3 transmit control register</td>
</tr>
<tr>
<td>LRSTAT0</td>
<td>Link # 0 receive status register</td>
</tr>
<tr>
<td>LRSTAT1</td>
<td>Link # 1 receive status register</td>
</tr>
<tr>
<td>LRSTAT2</td>
<td>Link # 2 receive status register</td>
</tr>
<tr>
<td>LRSTAT3</td>
<td>Link # 3 receive status register</td>
</tr>
<tr>
<td>LTSTAT0</td>
<td>Link # 0 transmit status register</td>
</tr>
<tr>
<td>LTSTAT1</td>
<td>Link # 1 transmit status register</td>
</tr>
<tr>
<td>LTSTAT2</td>
<td>Link # 2 transmit status register</td>
</tr>
<tr>
<td>LTSTAT3</td>
<td>Link # 3 transmit status register</td>
</tr>
<tr>
<td>LRSTATC0</td>
<td>Link # 0 receive status clear register</td>
</tr>
<tr>
<td>LRSTATC1</td>
<td>Link # 1 receive status clear register</td>
</tr>
<tr>
<td>LRSTATC2</td>
<td>Link # 2 receive status clear register</td>
</tr>
<tr>
<td>LRSTATC3</td>
<td>Link # 3 receive status clear register</td>
</tr>
<tr>
<td>LTSTATC0</td>
<td>Link # 0 transmit status clear register</td>
</tr>
<tr>
<td>LTSTATC1</td>
<td>Link # 1 transmit status clear register</td>
</tr>
<tr>
<td>LTSTATC2</td>
<td>Link # 2 transmit status clear register</td>
</tr>
<tr>
<td>LTSTATC3</td>
<td>Link # 3 transmit status clear register</td>
</tr>
</tbody>
</table>
Link DMA

Each link port is associated with two DMA channels. One channel is used for transmitting data while the other is used for receiving data. The two DMA channels can interface with internal memory, external memory, or other link port buffers.

The link port issues a service request to the transmit DMA channel when the $LBUF TXx$ register is empty and the DMA channel is enabled. The link port issues a DMA request to the receive link DMA channel when it receives a data quad-word—that is, when the $LBURXx$ register is full and the DMA channel is enabled.

When a link DMA channel (transmit or receive) becomes active and the transmit buffer is empty (for transmit channel) or the receive buffer is full (for receive channel), a DMA request is immediately issued.

The receive DMA can also be used for pass-through transfers by writing to the transmit register of another channel. The DMA also takes care to transfer data only when the transmit register of the target link is free.

Link Block Completion

In some applications, the receiver may not know exactly how many quad-words it will receive from the transmitter. The block completion feature allows the transmitter to indicate to the receiver that the block transfer is complete when all of the data for that block has been transmitted.

When the DMA writes the last quad-word to the transmit buffer of the transmitting link port, it issues a buffer complete indication along with it. The link port issues $LBXBCMPO$ active when it transmits this quad-word and the $TBCMPE$ bit in $LTCTLx$ is set. See “Link Port Control Registers” on page 9-21.
Link Architecture

When $\text{LxBCMPI}$ is sampled active, the receiving link port transfers this information to the DMA together with its DMA request. The TCB word counters are cleared by the DMA and the DMA sequence completes.

Link Interrupts

The link ports have dedicated interrupts for controlling data flow when performing link port receive transfers using the processor core (as opposed to DMA). The link port receive interrupt is asserted only if the DMA channel for a link port has not been initialized. A receive link port asserts the link receive interrupt when a quad-word received by the link port is waiting in the $\text{LBUF}RXx$ register. The link port receive interrupt is level-sensitive and therefore if the DMA channel associated with the link port becomes active after the link port interrupt has been asserted, the link port receive interrupt will be deasserted and a DMA request will be issued.

Link Reset Initialization and Boot

Link port booting is a slave boot mode; the TigerSHARC processor expects boot code to be placed internally. After reset, all four receive link port DMA channels are initialized to transfer a 256-word block (boot kernel) to internal memory addresses 0 through 255 and set to issue an interrupt at the end of the block (similar to an external port DMA). The corresponding DMA interrupts are set to address zero. For more information, see “Processor Booting Methods” on page 11-2.

The DMA channel of any port not being used for boot must be cleared and reinitialized. If the DMA channel is not deactivated, the link interrupt mechanism does not work. The link may be reinitialized as well.

For link port boot, place a 500$\Omega$ resistor between the $\text{BMS}$ pin and $V_{DD\_IO}$ supply.

Applying reset ($\text{RST}\_IN$=0) or disabling the link ports through the link port control registers flushes the link port buffers.
It is important to be aware of the sequence of events for link port booting. The link port boot sequence following a reset is:

1. Link port booting is always enabled. The hardware reset value for the LRCTLx register enables the link port receive (REN=1).

2. The processor detects that external boot (EBOOT) strap pin (boot memory select (BMS) pin at runtime) is high, and waits for an external processor or link port boot to begin.

   Because link port receive is always enabled at reset, link port booting does not depend on the EBOOT strap pin value.

3. When the device booting the processor through the link port is ready, the device de-asserts (=1) the LxBCMPT pin of the link port to initialize the port. The processor responds by executing the link receive DMA and receiving the 256 word boot kernel.

   This link port initialization using the LxBCMPT pin for a hardware handshake only occurs the first time the port is used after reset. Note that it may require up to 2500 SOCCLK clock cycles from LxBCMPT pin assertion to receiver initialization. After this initialization, normal usage of the LxBCMPT pin applies.

### Link Alternate (Software) Initialization

The first time a link port receiver is used after reset, the port requires an initialization handshake. Most systems perform this handshake by de-asserting (=1) the LxBCMPT pin of the link port to initialize the port. See “Link Reset Initialization and Boot” on page 9-10. When a device (for example, an FPGA or host) is connected to a link port receiver, but the device cannot generate an initialization handshake on the LxBCMPT pin, the system can use alternate (software) initialization of the link port.
Alternate initialization of the link port uses the \texttt{REN} (receive enable), \texttt{RINIF} (receive initialization, forced), and \texttt{RINV} (receive initialization, value) bits in the link port’s LRCTLx register. The usage of these bits for initialization is follows:

- If \texttt{REN}=1, the link port receive is enabled, and link initialization forced is disabled.
- If \texttt{REN}=0 and \texttt{RINIF}=0, the link port receive is disabled, and link initialization forced disabled.
- If \texttt{REN}=0, \texttt{RINIF}=1, and \texttt{RINV}=0, the link port receive is disabled, and link initialization is forced to a value of 0. This software state is the same as the LxBCMPT pin being held low (=0).
- If \texttt{REN}=0, \texttt{RINIF}=1, and \texttt{RINV}=1, the link port receive is disabled, and link port initialization is forced to a value of 1. This software state is the same as the LxBCMPT pin being held high (=1).

To perform alternate initialization, the processor does the following:

1. Write \texttt{REN}=0, \texttt{RINIF}=1, and \texttt{RINV}=1 (initialize link port receive)
2. Write \texttt{REN}=1 (link port receive enable)

The link port communicates through a 1- or 4-bit data bus per direction (RX or TX), using three control signals. These pins are listed in Table 9-1 on page 9-4. In 1-bit mode, the LxDATIOP/N and LxDATOOP/N pins are used for link port transfers.

Each link port has two independent channels that can operate simultaneously. The transmit channel transfers data to an external device, the receive channel gets data from an external device. Each channel communicates using up to four data bits, using the LxCLKOUTP/N, LxACKI,
**Link Ports**

$LxCLKINP/N$ and $LxACKO$ signals to control the data transfer. $LxBCMPT$ and $LxBCMPU$ signals are used to signal that the current buffer transmission has been completed.

There are some general rules that apply to the timing in the link port protocol (shown in Figure 9-6 through Figure 9-14). Understanding these rules makes it easier to recognize patterns in these diagrams. The general rules are:

- First data (1-bit or 4-bit) is transferred on the rising edge of the link clock ($LxCLKOUTP$ for discussion purposes). This is always true.
- Last data (1-bit or 4-bit) is transferred on the falling edge of the link clock ($LxCLKOUTP$ for discussion purposes). This is always true.
- $LxCLKOUTP$ is driven low when the link is idle.

The minimum granularity of the transfer is a quad word. A quad word can be transferred in 16 clock cycles when the four data bits are enabled or 64 clock cycles when only one data bit is enabled. Figure 9-6 and Figure 9-7 show these two cases.

![Figure 9-6. 4-bit Data Enabled](image-url)
Link Port Communication Protocol

The transmission is initiated when LxACKI is sampled high, meaning that the receive buffer is vacant. As shown in Figure 9-8 and Figure 9-9, the first data element is driven valid before first LxCLKOUTP rising edge and the last data element is transmitted before last clock falling edge. LxCLKOUTP is driven low when the link is idle.

Figure 9-8. Link “a” Transmits a Single Quad Word to Link “b”

Figure 9-10 illustrates that Link “b” is not ready to receive more data.

The completion of a transfer and the beginning of a new one change according to the RVERE (Enable Verification) bit in the LRCTLx register and the TVERE bit in the LTCTLx register. If the TVERE bit is set, the checksum
byte is sent after the last byte in quad-word. The checksum byte is always followed by a “dummy” byte. Sending the checksum (Vx) and “dummy” (Xx) bytes lasts two cycles for a four-bit and 8 cycles for a one-bit data bus. Figure 9-11 illustrates that Link b is not ready to receive more data. In this case, the TVERE and TVERE bits are set (one-bit data).

Messages between the different elements in the system usually have a variable length. In some cases data blocks have a variable length too. This length is not always available to the receiving device and cannot be transmitted by the transmitting device. The transmitting link indicates to the receive link that the block has been completed using the LxBCMP0 output signal, which is connected to the LxBCMPT input of receiver. When the receiver recognizes this indication, it indicates to the DMA channel that the block is complete. As a result of this indication, the DMA channel

Figure 9-9. Link “a” Transmits Back-To-Back Quad Words to Link “b”

Figure 9-10. Link “a” Transmits to Link “b” (RX Fills)
Link Port Communication Protocol

The $L_{xBCMPO}$ signal indicates block completion by changing to low after first $L_{xCLKOUT}$ rising edge in the last quad-word of the block. It is set high (inactive) before the last $L_{xCLKOUT}$ falling edge of the same quad-word. See the ADSP-TS201 TigerSHARC Embedded Processor Data Sheet for further details. It is inactive if the $TBCMPE$ bit is cleared in the $LTCTLx$ register or while link transactions are executed by the processor core.

Figure 9-12 illustrates how Link “a” signals Link “b” that the current buffer has been completed.

Figure 9-11. Link “a” Transmits to Link “b” With Verification (RX Fills)
In Figure 9-13 Link “a” signals Link “b” that the current buffer has been completed. A new quad word follows the completed block.

**Link Port Transmission Delays**

The links should be able to overcome delays between the source and destination. The delays of the different link signals ($L_xCLKOUT$ and $L_xDAT3-0$) are matched to tolerances specified in the data sheet. There are two delay restrictions in the system:

- The delay of $L_xCLKOUT$ of transmitter to $L_xCLKIN$ of the receiver plus the delay of $L_xACKO$ of receiver to $L_xACKI$ of transmitter should be less than one-half quad-word transfer period.

- The difference between the trace delay of $LxBCMPO$ and the data and clock trace delay is a maximum one-third of a quad-word transfer period.

The first restriction is that the gating factor for determining distance of operation is the round-trip propagation delay $L_xCLKOUT$ of transmitter to $L_xCLKIN$ of the receiver plus the delay of $L_xACKO$ of receiver to $L_xACKI$ of transmitter. If the transmitter sends data and the receiver signals that it is
Link Port Transmission Delays

not ready to accept more data via LxACKO, the transmitter must sample LxACKI before it sends the next transmission of data. If LxACKI is not sampled in time, data is lost by the receiver.

For example, while using 4-bit mode transmission, 16 link clock cycles are required to transfer one quad-word (64 cycles are required using 1-bit mode). So, 8 link clock cycles represents a one-half quad-word transfer period using 4-bit mode transmission, and 32 link clock cycles represents a one-half quad-word transfer period using 1-bit mode transmission.

Figure 9-14. Link “a” Transmits to Link “b” With Delay
Link Port Error Detection Mechanisms

The link ports support on-the-fly error detection. When the link port detects an error condition, it operates according to the following:

1. If a time out error is detected, the TTER (Transmit Timeout Error) bit in LTSTATx or RTER (Receive Timeout Error) bit in LRSTATx is set accordingly. If the time out errors are enabled in the LRCTLx or LTCTLx registers, a hardware error interrupt is asserted. (See “Hardware Error Interrupt” on page 6-22.)

2. If a data overrun is detected on the receiver the ROVER bit in LRSTATx is set accordingly. If the receive overrun error is enabled in LRCTLx, a hardware error interrupt is asserted. (See “Hardware Error Interrupt” on page 6-22.)

3. If verification is enabled in LRCTLx and an incorrect checksum is detected, the RCSER bit in LRSTATx is set accordingly, and a hardware error interrupt is asserted. (See “Hardware Error Interrupt” on page 6-22.)

4. If an active value is written to LRCTLx or LTCTLx and the value in this control register is active, the register over write is prevented, the RWER bit is set in LRSTATx or the TWER bit is set in LTSTATx accordingly, and a hardware error interrupt is asserted. (See “Hardware Error Interrupt” on page 6-22.)

The error indication fields of LRSTATx and LTSTATx can be cleared by reading from the clear registers LRSTATCx and LTSTATCx respectively.
Link Port Error Detection Mechanisms

Link Transmitter Timeout

If the transmitter’s \( L_{xACKI} \) is deasserted for a time period of 2048 clock cycles and the transmitter has data to transmit, a timeout error occurs, and the \( TTER \) bit is set in \( LTSTATx \). \( L_{xACKI} \) has a weak pull down resistor that holds it deasserted in case it is not being driven.

Unlike previous TigerSHARC processors, the ADSP-TS201 does not need or provide an optional connectivity check for error detection because the transmit timeout and pull-down on \( L_{xACKI} \) are sufficient for detecting that a receiver is not connected.

Link Receiver Timeout

If the receiver has its receive buffer full but there is not any read access to it for 2048 clock cycles, a time-out error occurs, and the \( RTER \) bit is set in \( LRSTATx \).

Link Verification Error

The transmitter creates and sends a special verification byte on every transmitted quad word (if it is enabled). This byte is a checksum calculated from all data bytes that have been transmitted. It is sent at the end of every 16-byte (quad word) transmission. The receiver compares the transmitted verification byte to the local verification byte created during data reception. If the two bytes differ, the \( RCSER \) bit is set in the \( LSTATx \) register and a hardware interrupt is issued.

The checksum algorithm is:

\[
\text{Checksum} = \text{Least-Significant-Byte of } (B0 + B1 + \ldots + B15)
\]

where \( B0, B1 \), and so on are byte1, byte2 …
Link Ports

Link Transmit/Receive Write Error

Writing an active value into transmit or receive control register is legal only when the corresponding register is inactive. Writing an active value to an active control register does not succeed and causes a write error.

Link Port Control Registers

The ADSP-TS201 processor’s link control register is split into two control registers—receive link control register (\texttt{LRCTLx}) and transmit control register (\texttt{LTCTLx}). These registers define the flavor of the link protocol that is used in a specific application. These registers are not to be changed during link operation.

Writing to a control register an active value is allowed only when the register has an inactive value (\texttt{LREN} in \texttt{LRCTL} or \texttt{LTEN} in \texttt{LTCTL} are cleared). In order to change the setup while link is running, an inactive value has to be
written to stop operation of the link, and then the new active value should be written. Ignoring this rule shall cause a hardware error interrupt, and the new value will not be written.

Figure 9-15. LRCTLx Low (Lower) Register Bit Descriptions

1 The reset value of the RDSIZE bit depends on the value of the LINK_DWIDTH strap pin at reset. If LINK_DWIDTH=0 at reset, RDSIZE=0 (1 bit data).
2 The upper 16 bits (31–16) of LRCTLx are reserved (=0)
## Table 9-4. LRCTLx Register

<table>
<thead>
<tr>
<th>Bit</th>
<th>Field</th>
<th>Definition</th>
<th>Default</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>REN</td>
<td>Link Receive Enable. When cleared, the receive channel goes to idle state. LxACKO is driven low. On reset, the REN set.</td>
<td>1</td>
</tr>
<tr>
<td>1</td>
<td>RVERE</td>
<td>Verification Enable. Enables checksum verification for received data. On reset, this bit is cleared.</td>
<td>0</td>
</tr>
<tr>
<td>2</td>
<td>RTOE</td>
<td>Time-Out Enable. Enables interrupt on timeout check for received data. On reset, this bit is cleared.</td>
<td>0</td>
</tr>
<tr>
<td>3</td>
<td>RBCMPE</td>
<td>Block Completion Enable. Enables the receiver to signal the DMA that the whole block has been received. On reset, this bit is cleared.</td>
<td>0</td>
</tr>
<tr>
<td>4</td>
<td>RDSIZE</td>
<td>Data Size(^1): 0: 1 bit 1: 4 bits For LRCTLx, on reset this bit gets its value of the strap “LINK_DWIDTH”.</td>
<td>0,1</td>
</tr>
<tr>
<td>5</td>
<td>ROVRE</td>
<td>Over-Run Enable</td>
<td>0</td>
</tr>
<tr>
<td>6</td>
<td>RINIF</td>
<td>Rx initialized state forced by software enable</td>
<td>0</td>
</tr>
<tr>
<td>7</td>
<td>RINIV</td>
<td>Rx initialized state forced value enable</td>
<td>0</td>
</tr>
<tr>
<td>31–8</td>
<td>Reserved</td>
<td>N/A</td>
<td>0</td>
</tr>
</tbody>
</table>

\(^1\) The reset value of the RDSIZE bit in LRCTLx depends on the setting for the LINK_DWIDTH strap pin.
Link Port Control Registers

LTEN Transmit Enable
When cleared, the transmit channel goes to idle state, a transmission in progress is stopped and the transmit buffer is invalidated and LxCLKOUT goes low. On reset, LTEN bit is cleared.

TVERE Verification Enable
Enables Checksum byte (and dummy byte) following transmit quad words. On reset, this bit is cleared.

TTOE Transmit Time Out check IRQ Enable
Enables interrupt on timeout check for transmitted data. On reset, this bit is cleared.

TBCMPE Transmit Block Completion
Enables the transmitter to transmit the word counter zero signal from DMA on LxBCMP0. On reset, this bit is cleared.

TDSIZE Data Size
0 = 1 bit, 1 = 4 bit. On reset, this bit is cleared.

SPD Transfer Speed.
Defines the LxCLKOUT rate as a division of CCLK
000: divide by 1
001: divide by 1.5
010: divide by 2
100: divide by 4
011: (RESERVED)
101: (RESERVED)
110: (RESERVED)
111: (RESERVED)

RESERVED (DLLV3–0)
RESERVED

Figure 9-16. LTCTLx Low (Lower) Register Bit Descriptions¹

¹ The upper 16 bits (31–16) of LTCTLx are reserved (=0)
## Link Ports

Table 9-5. LTCTLx Register

<table>
<thead>
<tr>
<th>Bit</th>
<th>Field</th>
<th>Definition</th>
<th>Default</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>LTEN</td>
<td>Link Transmit Enable. When cleared, the transmit channel goes to idle state, a transmission in progress is stopped and the transmit buffer is invalidated and LxCLKOUT goes low. On reset, LTEN bit is cleared.</td>
<td>0</td>
</tr>
<tr>
<td>1</td>
<td>TVERE</td>
<td>Verification Enable. Enables Checksum byte (and dummy byte) following transmit quad words. On reset, this bit is cleared.</td>
<td>0</td>
</tr>
<tr>
<td>2</td>
<td>TTOE</td>
<td>Time-Out Enable. Enables interrupt on timeout check for transmitted data. On reset, this bit is cleared.</td>
<td>0</td>
</tr>
<tr>
<td>3</td>
<td>TBCMPE</td>
<td>Block Comp. Enable. Enables the transmitter to transmit the word counter zero signal from DMA on LxBCMPO. On reset, this bit is cleared.</td>
<td>0</td>
</tr>
<tr>
<td>4</td>
<td>TDSIZE</td>
<td>Data Size. 0 = 1 bit data, 1 = 4 bit data. On reset, this bit is cleared.</td>
<td>0</td>
</tr>
<tr>
<td>5–7</td>
<td>SPD2–0</td>
<td>Transfer Speed</td>
<td>0</td>
</tr>
<tr>
<td>8–10</td>
<td>Reserved</td>
<td>DLLV3–0</td>
<td>0</td>
</tr>
<tr>
<td>11–12</td>
<td>Reserved</td>
<td>N/A</td>
<td>0</td>
</tr>
</tbody>
</table>

1 See “Link Port Control Registers” on page 9-21.
The link status register is split into two status registers – receive link status register \((LRSTAT_x)\) and transmit status register \((LTSTAT_x)\). These registers show the link runtime status. These registers are read-only registers.

Figure 9-17. LRSTATx Low (Lower) Register Bit Descriptions

---

1 The upper 16 bits (31–16) of LRSTATx are reserved (=0)
### Table 9-6. LRSTATx Register

<table>
<thead>
<tr>
<th>Bit</th>
<th>Field</th>
<th>Definition</th>
<th>Default</th>
</tr>
</thead>
<tbody>
<tr>
<td>1–0</td>
<td>RSTAT1–0</td>
<td>Receive Status. 00: buffer empty 01: buffer valid (can still receive) 11: buffer full and data valid 10: buffer not available because of synchronization problem — data not ready (in transient temporary state)</td>
<td>0</td>
</tr>
<tr>
<td>2</td>
<td>RTER</td>
<td>Time-Out Error</td>
<td>0</td>
</tr>
<tr>
<td>3</td>
<td>RWER</td>
<td>Write Error, an active value was written to control register when previous value of control register was active.</td>
<td>0</td>
</tr>
<tr>
<td>4</td>
<td>RCSER</td>
<td>Check Sum Error</td>
<td>0</td>
</tr>
<tr>
<td>5</td>
<td>ROVER</td>
<td>Over-Run Error</td>
<td>0</td>
</tr>
<tr>
<td>6</td>
<td>RINIT</td>
<td>Receiver initialization status</td>
<td>0</td>
</tr>
<tr>
<td>31–7</td>
<td>Reserved</td>
<td>N/A</td>
<td>0</td>
</tr>
</tbody>
</table>

Transmitter Vacant
When set, the transmitter is vacant—data for transmit can be written to transmit buffer.
Note: After enabling the transmitter (LTEN bit set in LTCTLx) it takes up to 800 SOC-CLK cycles for this bit to assert.

Transmit is empty.
When set, transmit is empty and all data that has been written to the transmit buffer has been transmitted.
Note: This should be tested before disabling the channel.

Transmit timeout error indication.

Transmit write error—an active value was written to control register when previous value of control register was active

The upper 16 bits (31–16) of LTSTATx are reserved (=0)

Figure 9-18. LTSTATx Low (Lower) Register Bit Descriptions

1
Table 9-7. LTSTATx Register

<table>
<thead>
<tr>
<th>Bit</th>
<th>Field</th>
<th>Definition</th>
<th>Default</th>
</tr>
</thead>
<tbody>
<tr>
<td>0</td>
<td>TVACANT</td>
<td>Transmit Vacant. When set, the transmitter is vacant—data for transmit can be written to transmit buffer. Note: After enabling the transmitter (LTEN bit set in LTCTLx) it takes up to 800 SOCCLK cycles for this bit to assert.</td>
<td>0</td>
</tr>
<tr>
<td>1</td>
<td>TEMPTY</td>
<td>Transmit Empty. When set, transmit is empty and all data that has been written to the transmit buffer has been transmitted. Note: This should be tested before disabling the channel.</td>
<td>1</td>
</tr>
<tr>
<td>2</td>
<td>TTER</td>
<td>Time-Out Error</td>
<td>0</td>
</tr>
<tr>
<td>3</td>
<td>TWER</td>
<td>Write Error. – an active value was written to control register when previous value of control register was active</td>
<td>0</td>
</tr>
<tr>
<td>31–4</td>
<td>Reserved</td>
<td>N/A</td>
<td>0</td>
</tr>
</tbody>
</table>

1 See “Link Port Status Registers” on page 9-26.
Link Port Status Registers
10 JTAG PORT AND TEST/DEBUG INTERFACE

The functionality described in this chapter is useful for both software debugging and services usually found in operating system (OS) kernels. These features are implemented in the TigerSHARC processor hardware to allow debuggers and operating systems to build the more complex aspects of their features. The functionality is grouped into levels. These levels build upon each other to allow greater control and visibility of the TigerSHARC processor for the debugger or kernel.

The debugging approach is based on the idea of protected execution (user mode) and unprotected execution (supervisor mode). In user mode, some of the system resources can be made accessible. However, when those resources must be managed by an OS kernel, they are restricted. When user code tries to access a protected resource, an exception occurs and the access is aborted.

The debugger can control the TigerSHARC processor in one of two ways:

- Using a debug monitor (running in supervisor mode)
- Using an In-circuit emulator (ICE)

In supervisor mode, some system resources are allocated to the monitor. (For example, memory, communication channel-to-host, interrupt, flag bits, and so on.)

The ICE, by comparison, uses a reserved communication channel (the JTAG test access port) to control the TigerSHARC processor. This way the ICE can be non-intrusive to the system, as well as control systems that cannot communicate with a host system.
As shown in Figure 10-1, the JTAG port is a bus slave on the SOC bus. It is important to note that debug register accesses must cross a number of clock domains when reading or writing register data. Corresponding response for emulator software exceptions may vary with SOC bus loading. Because the emulator interrupt (an interrupt input to the interrupt unit) has a direct connection to the program sequencer, response to latched emulator interrupts is not affected by SOC bus loading. For more information on clock domains, see “SOC Interface” on page 3-1.

Since many of the functions needed by the debug monitor and the ICE overlap, most of the debug functions in the TigerSHARC processor are shared between the two methods. There are some features that need to be optionally restricted by the ICE so that the debugger can be used in supervisor mode.

Debug capabilities are available either through the ICE or through the supervisor mode. As already mentioned, the ICE mode is accessed through the JTAG port. The supervisor mode is entered via an interrupt, an exception, or a specific software watchpoint. When the TigerSHARC processor is in supervisor or emulation mode, it has complete access to all of the TigerSHARC processor’s resources.
JTAG Port and Test/Debug Interface

Figure 10-1. JTAG Port and SOC Bus Connections
Operating Modes

Table 10-1 details debug concepts and capabilities.

Table 10-1. Debug Concepts

<table>
<thead>
<tr>
<th>Debug Tools</th>
<th>Debug Feature</th>
</tr>
</thead>
<tbody>
<tr>
<td>Watchpoints</td>
<td>Allows a user to specify particular watchpoint address ranges and conditions that halt the processor when those conditions are met. The user can then examine the state of the processor, restore that state, and continue instruction execution. Software breakpoints and single step are implemented using watchpoint as well.</td>
</tr>
<tr>
<td>Multiprocessing Capability</td>
<td>Allows a user to single step and set breakpoints in a multiprocessor system.</td>
</tr>
<tr>
<td>Trace History</td>
<td>Allows a trace of the program counter to be recreated. An on-chip trace buffer (FIFO) stores the last eight branches (all discontinuous values of the program counter) taken by the program sequencer, allowing the recent program path to be recreated.</td>
</tr>
<tr>
<td>Cycle Count</td>
<td>Provides the basic functionality for all code profiling functions.</td>
</tr>
<tr>
<td>Performance Monitoring</td>
<td>Allows internal resources to be monitored unobtrusively. The debugger can specify which internal resource(s) should be monitored.</td>
</tr>
<tr>
<td>Access to Protected Registers</td>
<td>Provides access to protected registers. Protected registers are only accessible in supervisor mode or ICE. They cannot be accessed by user mode and are protected from accidental modifications that could cause system damage.</td>
</tr>
<tr>
<td>Watchpoint Counters</td>
<td>Allows the user to search for the nth occurrence of an event before halting.</td>
</tr>
<tr>
<td>Exception</td>
<td>Aborts execution of the next and the following piped instructions. For more information, see &quot;Handling Exceptions&quot; on page 6-15.</td>
</tr>
</tbody>
</table>

Operating Modes

The TigerSHARC processor operates in one of three modes—emulator, supervisor, and user. In user and supervisor modes, all instructions are executed normally. In user mode, however, the register access is limited. In emulation mode, none of the control flow instructions (excluding if
JTAG Port and Test/Debug Interface

True, RTI (np);) may be used—control flow instructions may cause problems and it is illegal to use them in emulation. Operating modes are discussed in “Operation Modes” on page 1-18.

Debug Resources

The processor provides a number of debug resources, including:

- "Special Instructions" on page 10-5
- "Watchpoints" on page 10-5
- "Instruction Address Trace Buffer (TBUF)" on page 10-8

Special Instructions

The TigerSHARC processor supports special instructions (traps) which are used to aid system debugging. These instructions are required to implement both software breakpoints (in debuggers) and operating system calls (for OS kernels).

Watchpoints

Address watchpoints allow the user to examine the state of the processor whenever a set of user-defined memory access conditions occurs.

There are three watchpoint sets that can operate in parallel. At each watchpoint, the user can define conditions for bus access cycles and the operation the TigerSHARC processor should execute when these occur.

Programming – Control and Address Pointer Registers

Each watchpoint has two address registers—WPIL and WPIH. See “Watchpoint Address Pointer (WPxL/WPxH) Registers” on page 2-89 for more information. When the watchpoint is programmed for a single address,
only the low address register (WPIL) is used. If the watchpoint is searching for an address range, the WPIL holds the low address, and the WPIH holds the high address.

It is important to set the Address Pointer registers first (before the WPiCTL register) which define the actual watchpoint operation. The address registers of a watchpoint cannot be changed when the watchpoint is active.

A watchpoint operation is defined by its Control register—WPiCTL (where i is 0, 1, or 2 for different watchpoints). Figure 2-46 on page 2-86, Figure 2-47 on page 2-87, and Figure 2-48 on page 2-88 show the configuration of the WPiCTL registers.

Watchpoint Operation

The following information must be programmed into the Watchpoint registers before a search can begin:

- Address or address range
- Count

When the control register is set to a non-idle operation, the count field is set to the initial count. After the count field is reset, the watchpoint hardware monitors the internal buses for a transaction address matching what is specified in the watchpoint address registers.

Whenever there is a match, the counter is decremented.

After the counter reaches zero, an exception is initiated—either a software exception or an emulator trap according to Bits3–2 in the EXtype field in WPiCTL. At this point, the Status register indicates 'watchpoint count expired'. To begin another search, the user must rewrite the control register.
JTAG Port and Test/Debug Interface

The three watchpoints are associated with different buses. *Watchpoint 0* is used to monitor instruction fetches on the I-bus. Watchpoint 0 supports a single step option (SSTP) which is used by emulation to perform single step operations. *Watchpoint 1* is used to monitor data accesses over the J- and K-buses. Watchpoint 1 supports detection of accesses by either J, K, or both. Watchpoint 1 also supports detection of read accesses only, write accesses only, or both. *Watchpoint 2* is used to monitor data accesses over the S-bus. Watchpoint 2 supports detection of read accesses only, write accesses only, or both.

**Watchpoint Status (WPiSTAT)**

The *WPiSTAT* register indicates the current status of the search. It holds the current count of the search and the mode the watchpoint is executing. When the counter expires, the watchpoint status Bits17–16 change to 11.

![Figure 10-2. WPxSTAT (Upper) Register Bit Descriptions](image)

The lower 16 bits of WPxSTAT (not shown) are the watchpoint current count.

**Table 10-2. WPiSTAT Register Bit Descriptions**

<table>
<thead>
<tr>
<th>Bits</th>
<th>Name</th>
<th>Description</th>
</tr>
</thead>
</table>
| 15–0  | VALUE  | Watchpoint Value  
Determines the current value of the watchpoint counter. |
| 17–16 | EX    | Execution Bit  
Watchpoint is executing  
00 – Watchpoint disabled  
01 – Searching for match  
11 – Watchpoint count expired |
| 31–18 | Reserved | Reserved |
Performance Monitors

Instruction Address Trace Buffer (TBUF)

Since the program counter is not accessible in real time off chip when executing instructions from internal memory, a trace buffer has been provided to assist code debugging.

The trace buffer is an eight register list that stores a history of the last four to eight branches taken by the program sequencer, allowing the user to fully recreate the path the program took for the last eight branches.

The branches are written into the eight Trace Buffer registers in a circular manner. The first is written into TRCB0, the next into TRCB1, and so on until TRCB7. The ninth branch is written back into TRCB0, and so on. In order to ascertain the last register, the user must refer to the Trace Buffer pointer register, which points to the last written entry.

The trace buffer always holds the PC of the branch instruction line. In case of a computed branch, it also stores the target PC of the branch on the next entry. To distinguish between the jump PC and the target PC, the TRCBMASK should be used. Every bit in the TRCBMASK is associated with the corresponding TRCBi register. If it is cleared, the TRCBi holds the PC of the jump instruction line. If set, the TRCBi holds the target PC of the jump.

Performance Monitors

The performance monitors are targeted to optimize the functioning of a working application. The monitors provide indications of how many times an event occurs during a specific run. These monitors are constructed of three registers—cycle counter, performance monitor mask, and performance monitor counter. The cycle counter and performance monitor should be initialized to zero by the user before a specific run. The ratio between the total count and the performance monitor counter gives the required indication.
When the performance monitor counter expires, it issues an exception.

**Cycle Counter (CCNT1–0)**

The Cycle Counter (CCNT1–0) counts cycles while the program is executing. All cycles are counted. The cycle counter is a 64-bit counter that increments with every cycle, including executions in both user and supervisor modes. All hold, wait state, aborted instructions, and other delay cycles are also counted.

When the TigerSHARC processor enters emulator mode, the cycle counter halts, since it is irrelevant.

**Performance Monitor Mask – PRFM**

The Performance Monitor Mask is a 32-bit register that defines the events that are monitored by the performance counters. Every bit in the Performance Monitor refers to an event that reflects the application performance (for example, the number of stall cycles in the system). See “Performance Monitor Mask (PRFM) Register” on page 2-89.

When only one bit in the PRFM register is set, the feature indicated by this bit is counted while the TigerSHARC processor is running in user and supervisor mode. The count is halted when the TigerSHARC processor is in emulation mode. For certain events (for example, compute block instructions or bus transactions), the counter may compute more than one transaction/instruction every cycle, according to the event occurrence.

When more than one bit is set in the mask, the count is according to the SUM bit (Bit 31). If the SUM bit is clear, and more than one bit in the Performance Mask is set, PRFCNT counts every cycle where one of the events occurs. The counter, however, does not sum the events but ORs them. If the SUM bit is set, the events are summed. In this case, on a cycle where the compute block (X or Y) executes two instructions, the counter is incremented by two, and on bus transactions (Bits14–12) the counter could be incremented by three.
Performance Monitors

Where the \texttt{SUM} bit is set, there are restrictions on the different events that can be combined. The set bits must be a subset of one of the following groups.

- Bits11–0 – granted or non granted bus requests
- Bits20–12 – transactions per bus
- Bits29–21 – instructions

The Performance Monitor Mask is disabled by clearing all bits in the mask.

Performance Monitor Counter

The Performance Monitor counts the events according to the programmed mask. It counts the events while the TigerSHARC processor is in user or supervisor mode.
JTAG Port and Test/Debug Interface

JTAG Functionality

The TigerSHARC processor supports the IEEE Standard 1149.1 Test Access Port.

JTAG Port I/O Pins

Table 10-3 describes those TigerSHARC processor pins used by JTAG emulation hardware. Underlined or overbarred pin names indicate negative true signals.

Table 10-3. JTAG and Emulation I/O Pins

<table>
<thead>
<tr>
<th>Signal</th>
<th>Type</th>
<th>Description</th>
</tr>
</thead>
</table>
| TRST   | Input Asynchronous | Test Reset (JTAG)  
Resets the test state machine. The TRST signal must be asserted after power up to ensure proper JTAG operation. The TRST signal has a 40 kΩ internal pull-up resistor. |
| TCK    | Input         | Test Clock (JTAG)  
Provides an asynchronous clock for JTAG boundary scan. |
| TDI    | Input         | Test Data Input (JTAG)  
A serial data input of the boundary scan path. This signal has a 40 kΩ internal pull-up resistor. |
| TDO    | Output        | Test Data Output (JTAG)  
A serial data output of the boundary scan path. |
| TMS    | Input         | Test Mode Select (JTAG)  
Controls the test state machine. This signal has a 40 kΩ internal pull-up resistor. |
| EMU    | Output        | Emulation  
Connected to the ADSP-TS201 DSP EZ-ICE target board connector only. |
JTAG Functionality

As an extension to the JTAG standard, the TMS and EMU pins have these additional debugging functions:

- The TMS pin (when TEME bit in the EMUCTL register is set) functions as an emulation exception pin. An emulation exception is issued on every rising edge of the TMS signal. This function is in addition to TMS functionality in JTAG.

- The EMU output is an open drain signal (for wired OR) driven only if the EMUOE bit in the EMUCTL register is set. The EMU pin gives the following indications:
  - On watchpoint event no exception (EXtype field in WPiCTL) is 00, the EMU is pulled down for 20 CCLK cycles.
  - Whenever the TigerSHARC processor is in emulation mode and ready for a new instruction line driven into the EMUIR register, the EMU pin is driven low until such an instruction is inserted. For more information, see "Emulator Mode" on page 1-20.
JTAG Port and Test/Debug Interface

JTAG Instruction Register

The JTAG Instruction register is five bits long and allows the Test Access Port (TAP) controller to select any of the scannable data registers. The LSB is the first to be shifted out through the TDO. The encoding of this Instruction register follows the TigerSHARC processor implementation.

Table 10-4. JTAG Port Access Instruction Decode

<table>
<thead>
<tr>
<th>IF IR4–0 Value is...</th>
<th>Corresponding Instruction is...</th>
<th>To...</th>
</tr>
</thead>
<tbody>
<tr>
<td>00000</td>
<td>EXTEST</td>
<td>Execute EXTEST</td>
</tr>
<tr>
<td>10000</td>
<td>Sample/Preload</td>
<td>Execute sample and preload</td>
</tr>
<tr>
<td>01000</td>
<td>EMUIR</td>
<td>Scan in EMUIR register</td>
</tr>
<tr>
<td>10100</td>
<td>EMUDAT</td>
<td>Scan EMUDAT register</td>
</tr>
<tr>
<td>01110</td>
<td>IDCODE</td>
<td>Scan ID code register</td>
</tr>
<tr>
<td>11110</td>
<td>SAMPLEPC</td>
<td>Sample PC from which the TigerSHARC processor is running</td>
</tr>
<tr>
<td>10001</td>
<td>EMUTRAP</td>
<td>Execute an emulation trap</td>
</tr>
<tr>
<td>10011</td>
<td>OSPID</td>
<td>Scan OSPID register</td>
</tr>
<tr>
<td>00100</td>
<td>EMUCTL</td>
<td>Scan EMUCTL register</td>
</tr>
<tr>
<td>01100</td>
<td>EMUSTAT</td>
<td>Scan EMUSTAT register</td>
</tr>
<tr>
<td>11111</td>
<td>BYPASS</td>
<td>Bypass</td>
</tr>
</tbody>
</table>

Data Registers

The Data registers are selected via the Instruction register. Once a particular Data register’s value is written into the Instruction register, and the TAP state is changed to SHIFT-DR, the particular data going in and out of the TigerSHARC processor depends on the definition of the Data register selected. See the IEEE 1149.1 specification for more details.
When registers are scanned out of the device, the LSB is the first bit to go out of the TigerSHARC processor.

The data registers are:

- **Bypass**
  
  A IEEE 1149.1 required register, this is a one (1) bit register.

- **Boundary**
  
  Registers used by multiple JTAG instructions. All three of the JTAG instructions that use the boundary are required by the IEEE 1149.1 specification.

- **EMUIR**
  
  The JTAG instruction loads 48 bits, of which the 32 LSBs are loaded into the EMUIR register. The EMUIR is a 128-bit register. In emulation mode the register is loaded four times in a sequence (with the same JTAG instruction) or is loaded until one word has the EX bit set (then the rest of the EMUIR slots are considered NOPs); and each time the data is loaded into the next word in EMUIR—Bits31–0 are loaded first, next Bits63–32 are loaded, and so on.

  When the four EMUIR loads have completed (or the instruction line was completed with the EX bit set), the instructions that were loaded to the JTAG are fed into the sequencer and executed.

  When a quad-word is loaded into EMUIR, it should include one full instruction line. The instruction line must not continue to the next quad-word. In this case the last slots have to be filled with “no operation” (NOP) instructions. Except for the restriction of not
crossing the quad-word boundary, this is similar to the assembler coding described in “Instruction Set” in the ADSP-TS201 TigerSHARC Processor Programming Reference.

Writing to EMUIR when the TigerSHARC processor is not in emulator mode is an illegal operation.

- EMUDAT

This is a 48-bit register whose 32 MSBs are used to scan the Ureg of the same name. The emulator uses this register to read and set all other registers in the TigerSHARC processor.

The emulator also uses this register to access memory. There must be an instruction that uses the IALUs to generate the address for the memory access whose target or source is EMUDAT. When the emulator executes a series of memory accesses, the TigerSHARC processor should insert hold cycles to allow all of the accesses to complete (just as it does normally).
JTAG Functionality
11 SYSTEM DESIGN

This chapter provides information that is useful for developing ADSP-TS201 TigerSHARC processor based systems and applications. There are descriptions of how to use the TigerSHARC processor in single processor implementations, in multiple TigerSHARC processor designs, and in testing scenarios. Hardware, software, and system design information are included.

When engineers are designing a system that includes the ADSP-TS201 TigerSHARC processor, they need a number of documentation resources including manuals, data sheets, and engineer-to-engineer notes. Many of these documents are identified in the reference sections:

- “Booting References” on page 11-8
- “Hardware Design References” on page 11-17
- “Thermal Design References” on page 11-27
- “Other Design and Test References” on page 11-28
- “Recommended Reading References” on page 11-31

Included in both Analog Devices manuals and data sheet are references of source materials, such as Engineer-to-Engineer Notes or Application Notes, available on the Internet. The documents listed here are considered to be essential to any design involving the ADSP-TS201 TigerSHARC processor and are maintained in the form of Engineer-to-Engineer notes due to the dynamic nature of this information and the capability to update these documents more easily whenever it may become necessary.
Processor Booting Methods

The application and engineer-to-engineer notes from Analog Devices that are mentioned in this chapter are available at:

http://www.analog.com/dsp/appnotes

Processor Booting Methods

This section explains the functional operation booting methods and provides a high-level overview of available boot loader kernels (booting software) for the ADSP-TS20x TigerSHARC processors.

This section includes information on boot kernels for ADSP-TS201S and ADSP-TS202S processors. Boot kernels for ADSP-TS203S processors form a subset of the discussed functionality because this processor has only two link ports and a 32-bit external bus. Except for these restrictions, the information in this section applies to all ADSP-TS20x processors.

It is important to note that the information in this section, while considered complete and accurate when printed, may be updated as new information becomes available. Please use the information in any versions of ADSP-TS20x TigerSHARC Processor Boot Loader Kernels Operation (EE-200) engineer-to-engineer note that are published after the publication date of this book.

After reset, the ADSP-TS20xS has four boot options for beginning operation: EPROM boot, host boot, link port boot, and no boot. Systems select among these options using strap mode pins. After the boot mode is selected, the processor boot loads (if in a master mode) or is boot loaded (if in a slave mode) with 256 words of boot data. Typically, this boot data contains the boot kernel for the selected mode. After this initial load is complete, the boot kernel executes, loading the developer’s program into the processor and starting execution of that program.
Using Boot Modes and Boot Loader Kernels

A loader kernel is a program executed by the processor that is appended to user application code by the elfloader utility (elfloader.exe) of the VisualDSP++® development tools. The processor executes the loader kernel at boot time, allowing the processor to initialize its internal and external memory sections defined in the application code.

The loader kernel is a self-modifying program that is transferred into the processor’s internal memory. The ADSP-TS20x family of processors supports three booting methods: EPROM booting (via the external port), host booting (via an external host processor or another ADSP-TS20x processor), and link booting (via the processor’s link ports). VisualDSP++ includes three distinct loader kernels that support each of the processor’s booting modes. Additionally, there are several no-boot modes, which do not require kernels.

To understand the booting process for each of the boot modes in further detail, please refer to the ADSP-TS20x TigerSHARC Processor Boot Loader Kernels Operation (EE-200) engineer-to-engineer note.

Executing a Processor Boot Operation

The booting mode is selected by the processor’s BMS pin. While the processor is held in reset, the BMS pin is an active input. If BMS is sampled low a certain number of SCLK cycles after reset, EPROM boot mode is selected; a number of SCLK cycles after this, the BMS pin becomes an output and serves as the EPROM chip select. If BMS is sampled high instead, the ADSP-TS20x processor is in an idle state, waiting for a host boot or a link port boot to occur. The exact timing for sampling BMS boot strap and following driving of BMS is provided in the processor’s data sheet.
Processor Booting Methods

At reset, a weak internal pull-down resistor is on the BMS pin. Depending upon the external line loading on this pin, this pull-down resistor may not be sufficient. You may need to add an external pull-down resistor to select EPROM booting mode. If host or link boot is desired, BMS must be held high during and after reset and may be tied directly to VDD_IO, provided it is never used as a chip select.

Each booting method is described in detail in the following sections.

**EPROM Boot**

Booting the processor from an EPROM is a master boot mode—a boot mode in which the TigerSHARC processor starts and controls the external data fetch process.

The ADSP-TS20xS processor defaults to EPROM booting depending on the value of the BMS strap pin. When the processor is configured to boot from EPROM, BMS is active during the boot sequence and should be connected to the chip select signal of the EPROM. For additional information refer to the BMS strap pin section.

When EPROM boot mode is selected, the ADSP-TS20x processor initializes its external port DMA channel 0 to transfer 256 32-bit words of code from the boot EPROM into memory block 0 locations 0x00-0xFF of the ADSP-TS20x processor. The corresponding interrupt vector (for DMA channel 0) is initialized to internal memory address 0x00. Upon completion of the DMA, the ADSP-TS20x processor continues program execution from address location 0x00. These 256 words of code act as a boot loader to initialize the rest of the ADSP-TS20x processor’s memory. The VisualDSP++ development tools provide a default boot loader kernel source file (TS201_prom.asm). For more information on this boot kernel, see the *ADSP-TS20x TigerSHARC Processor Boot Loader Kernels Operation* (EE-200) engineer-to-engineer note.
Host Boot

Booting the processor from a host is a *slave boot mode*—a boot mode in which the TigerSHARC processor expects an external device to start and control the placement of code in the processor’s memory.

The ADSP-TS20xS processor supports booting from an external master (host or another ADSP-TS20xS). Any master on the cluster bus can boot the ADSP-TS20xS through writes to its internal memory or through autoDMA.

To configure the processor for host boot, place a pullup resistor between BMS and $V_{DD, IO}$. For resistor value information, see the *ADSP-TS201S TigerSHARC Embedded Processor Data Sheet*.

When a host or link boot mode is selected, the ADSP-TS20x processor enters an idle state after reset, waiting for the external host processor or link port to boot it. Host booting uses the ADSP-TS20x processor’s AUTODMA registers (either channel), both of which are initialized to transfer 256 words of code to block 0, locations 0x00-0xFF of the ADSP-TS20x processor’s internal memory. The corresponding interrupt vector is initialized to point to address 0x00. Thus, upon completion of the DMA, the ADSP-TS20x processor continues execution at memory location 0x00. These 256 words of code act as a boot loader to initialize the rest of the ADSP-TS20x processor’s memory. The VisualDSP++ development tools supplies a default boot loader, named *TS201_host.asm*. For more information on this boot kernel, see the *ADSP-TS20x TigerSHARC Processor Boot Loader Kernels Operation (EE-200)* engineer-to-engineer note.

An example code of a ADSP-TS20x processor acting as a master, host booting another ADSP-TS20x processor is provided with the examples for the ADSP-TS201S EZ-KIT Lite™ development system.
Link Port Boot

Booting the processor from a link port is a \textit{slave boot mode}\textemdash a boot mode in which the TigerSHARC processor expects an external device to start and control the placement of code in the processor’s memory.

All four receive link port DMA channels are initialized after reset to transfer a 256-word block to internal memory addresses 0 through 255, and to issue an interrupt at the end of the block (similar to an external port DMA). The corresponding DMA interrupts are set to address zero. For additional information refer to the \texttt{BMS} and \texttt{TMROE} strap pin sections.

To configure the processor for Link Port boot place a pullup resistor between \texttt{BMS} and \texttt{VDD\_IO}. For resistor value information, see the \textit{ADSP-TS201S TigerSHARC Embedded Processor Data Sheet}.

When a host or link boot mode is selected, the ADSP-TS20x processor enters an idle state after reset, waiting for the host or link port to boot it. A link boot can use any of the ADSP-TS20x processor’s link ports, all of whose DMAs are initialized to transfer 256 words of code to ADSP-TS20x processor’s memory block 0, locations 0x00-0xFF. The corresponding DMA interrupt vectors are initialized to 0. Thus, upon completion of the link DMA, the ADSP-TS20x processor continues execution at location 0x00. These 256 words of code act as a boot loader to initialize the rest of ADSP-TS20x processor’s memory. Analog Devices supplies a default boot loader, \texttt{TS201\_link.asm}, with the VisualDSP++ tools set. For more information on this boot kernel, see the \textit{ADSP-TS20x TigerSHARC Processor Boot Loader Kernels Operation (EE-200)} engineer-to-engineer note.

No Boot

Starting the processor in no boot is a \textit{master boot mode}\textemdash a boot mode in which the TigerSHARC processor starts and controls the external data fetch process. In no boot mode, the TigerSHARC processor starts from an \texttt{IRQ} vector (externally or internally) fetching data.
The ADSP-TS20xS processor begins execution from the memory address selected with one of the IRQ3–0 interrupt signals. Using the ‘no boot’ option, the ADSP-TS20xS starts running from memory when one of the interrupts is asserted. For additional information refer to the BMS and BM strap pin sections.

To configure the processor for No boot (boot from memory address) place a pullup resistor from BM to VDD_IO and place a pullup resistor from BMS to VDD_IO. For resistor value information, see the ADSP-TS201S TigerSHARC Embedded Processor Data Sheet.

When a host or link boot mode is selected, the ADSP-TS20x processor enters an idle state after reset, waiting for the host or link port to boot it. It does not have to be booted by the host or a link port. If external interrupts IRQ3–0 are enabled (selected at reset by the IRQEN strap pin), they can be used to force code execution according to the default interrupt vectors, as shown in Table 11-1.

In this scenario, another device, such as a host processor, initializes the appropriate memory with the appropriate code and then forces the corresponding interrupt.

Table 11-1. IRQ3–0 Default Interrupt Vectors

<table>
<thead>
<tr>
<th>Interrupt</th>
<th>Address</th>
</tr>
</thead>
<tbody>
<tr>
<td>IRQ0</td>
<td>0x30000000 (MS0)</td>
</tr>
<tr>
<td>IRQ1</td>
<td>0x38000000 (MS1)</td>
</tr>
<tr>
<td>IRQ2</td>
<td>0x80000000 (MSH)</td>
</tr>
<tr>
<td>IRQ3</td>
<td>0x00000000 (Internal Memory)</td>
</tr>
</tbody>
</table>

Another no-boot method of starting up the ADSP-TS20x processor is for a host to initialize a memory buffer with code and then force that code execution via a vector interrupt.
Many intricate details must be addressed when the chip is started up. The standard supplied boot loader kernels take care of those details, so system start up based on those loaders is highly recommended.

Booting References

The following documents from Analog Devices provide information that is helpful for developing a system boot process.

1. ADSP-TS20x TigerSHARC Processor Boot Loader Kernels Operation (EE-200), Analog Devices, Inc.

2. Considerations for porting code from the ADSP-TS101S TigerSHARC processor to the ADSP-TS201S TigerSHARC processor (EE-205), Analog Devices, Inc.


5. ADSP-TS201S TigerSHARC Embedded Processor Data Sheet. Analog Devices, Inc.

Hardware System Design Guidelines

This section discusses specific hardware issues when implementing a system design, which incorporates any of the ADSP-TS20xS TigerSHARC processors. This document is provided as an aid to hardware engineers for
designing systems with silicon revision 1.0 and higher. All guidelines provided in this section apply to the ADSP-TS201S, ADSP-TS202S, and ADSP-TS203S TigerSHARC processors.

It is important to note that the information in this section, while considered complete and accurate when printed, may be updated as new information becomes available. Please use the information in any versions of *ADSP-TS20xS TigerSHARC System Design Guidelines* (EE-179) that are published after the publication date of this book.

It is important to run PCB signal integrity analysis for all signals in a single or multiprocessor ADSP-TS20xS based systems.

**Power Supplies**

The ADSP-TS20xS processor has four power supply domains $V_{DD}$ (Internal), $V_{DD_A}$ (Analog PLL), $V_{DD_IO}$ (External I/O) and a $V_{DD_DRAM}$ (DRAM) domain. The $V_{DD_A}$ supply is a filtered version of the $V_{DD}$ supply.
Hardware System Design Guidelines

The ADSP-TS20xS processor requires bypass capacitors on each supply. In many cases it is difficult to place lots of supply bypass capacitors close to the package pins, especially on the bottom side of the PCB board. It is recommended that PCB designers prioritize decoupling capacitor placement in the following order:

- Low-ESR/low-ESL capacitors are recommended for proper bypassing. In some cases, performing SPICE analyses of the power supply filtering characteristics may be necessary.

- Enough “bulk” capacitors must be used to prevent power supply ripple that exceeds maximum or minimum power supply tolerances caused by current transients in the system. Several parallel electrolytic and/or tantalum capacitors are preferred in order to minimize ESR and to provide sufficient capacitance.

For power supply decoupling specific to each supply, please see the most current versions of ADSP-TS20xS TigerSHARC System Design Guidelines (EE-179).

System Clock (SCLK) Pin

The SCLK design and distribution are very important for reliable system operation. In addition to the SCLK considerations in the following sections, see the ADSP-TS20xS TigerSHARC Embedded Processor Data Sheet and ADSP-TS20xS TigerSHARC System Design Guidelines (EE-179) for additional guidance.

Considerations for SCLK Design

Careful analysis is required when choosing components for generating, buffering and distributing the SCLK signals on a PCB. Refer to the ADSP-TS20xS data sheet specification for SCLK input jitter requirements.
Single-stage or dual-stage clock tree designs are typically used to create a clock distribution network. Figure 11-1 shows a couple of examples of these types of designs.

![Clock Generation Examples](image)

In most instances single stage clock designs provide lower jitter specifications and tighter duty-cycle control than dual or multi-stage clock designs. It is very important to simulate all designs, however dual and multi-stage designs require special attention when analyzing total jitter (OSC jitter & BUF jitter) and duty cycle impact. Designers should review manufacturer data sheets and application notes before choosing oscillators, crystals and clock driver components to ensure they meet the jitter and duty cycle requirements for the SCLK of the ADSP-TS20xS.

Other factors to consider: When selecting components, the output-to-output skew between various clock buffer outputs should be as small as possible to ensure high speed operation of the external bus interface. Make sure the output rise and fall times of clock drivers are symmetrical. Review power supply grid and supply decoupling for all clock generation components. Signal integrity analysis should be run on all clock signals to ensure no external coupling and they meet or exceed the SCLK specifications.

**Considerations for SCLK Distribution**

In single and multiprocessor designs careful clock design and distribution is required to ensure proper and full-speed internal and external operation.
Hardware System Design Guidelines

Listed below are some guidelines for clock distribution.

- PCB connections should be point-to-point from the clock buffer output to all clock inputs. Trace lengths should be matched to minimize skew.
- Capacitance on all clock signals should be matched.
- Minimize the number of PCB vias.
- Maintain same number of vias on each clock signal.
- Do not run clock signals close to other signals on same layer.
- Do not run any signals directly above or below the clock signals.
- Use a high quality low jitter clock source for generating the clock reference.
- Use a low-jitter clock buffer driver (see jitter specification in the data sheet).
- Use a low output-to-output skew clock buffer driver.
• All clock signals from the clock buffer outputs to the SCLK inputs should be carefully reviewed.

• A single multiple-output clock buffer should be used to drive the clock signals to all devices including DSPs, FPGAs, ASICs and Memories (see Figure 11-2). Using multiple clock buffer chips increases the clock-to-clock skew between clock signals and is not recommended (see Figure 11-2 on page 11-13).

Figure 11-2. Recommended Clock Distribution Method
Hardware System Design Guidelines

Figure 11-3. Not Recommended Clock Distribution Method
Boundary Scan and Emulator Pins

The ADSP-TS20xS has six pins associated with the boundary scan and emulator interface. The pins, EMU, TCK, TDI, TDO, TMS, and TRST should be connected to a boundary scan pod connector if the ADSP-TS20xS emulator is used.

The Analog Devices family of emulators are tools which aid developers when testing and debugging a hardware and software system. Analog Devices has supplied an IEEE 1149.1 compliant Joint Test Action Group (JTAG) Test Access Port (TAP) on each JTAG TigerSHARC processor. The emulator uses the TAP to access the internals of the TigerSHARC processor, allowing the developer to load code, set breakpoints, observe variables, observe memory, examine registers, and so on.

For details on designing the JTAG scan path on your target system, refer to ADSP-TS201 TigerSHARC Embedded Processor Data Sheet and Analog Devices Engineer-to-Engineer Note EE-68 Analog Devices JTAG Emulation Technical Reference (2.4), available on the Analog Devices Internet site: http://www.analog.com.

External Ports Pins

In a single-processor system, the ID2-0 pins of the single processor must be set to “000”. In a multi-processor system, the processor IDs must be uniquely assigned starting from “000” up to “111”; a single TigerSHARC...
Hardware System Design Guidelines

can gluelessly support up to 8 DSPs. For both single and multiple processor topologies, it is imperative to include processor ID2-0 = “000" in the system, since this processor supports the following features upon reset:

- Has active internal pull-ups or pull-downs on certain external signals when ID2-0 = “000" (DSP ID 0). See ADSP-TS201S TigerSHARC Embedded Processor Data Sheet for details.

- Is the default bus master, and can therefore provide active bus arbitration signals to an external host processor.

- Has an on-chip SDRAM controller, which provides an MRS sequence to any external SDRAM present in the system.

If there is an external host on the cluster bus and common data are shared between the host and the TigerSHARC processor(s), the endianess on both sides must be matched to each other. Note, the TigerSHARC processor is only little endian and does not support big endian.

The TigerSHARC processor’s addressing is word-oriented (32-bit). Some host processors’ addressing is byte-oriented. Therefore, for connecting to these processors the least-significant bit of the TigerSHARC processor’s address bus should be connected to the third least-significant bit of the host processor’s address bus, regardless if a 32-bit or 64-bit bus width is specified.

The address and data busses may float for several cycles during bus-mastership transitions between a TigerSHARC processor and a host. Floating in this case means that these inputs are not driven by any source. The ADSP-TS20xS contains internal pull-up resistors to ensure busses don’t float under these conditions.
System Design

Link Ports Pins

The ADSP-TS201S and ADSP-TS202S contain four full duplex Link port whereas the (ADSP-TS203S) contains only two full duplex Link Ports. Each link port’s receive and transmit sections operate independently and may or may not be used or connected to other link partners. If link ports are used in the system, all required link port pins must be connected between link partners. It is important to consider link port LVDS signal sensitivity in PCB design. For important guidance on PCB design for link port LVDS signals, see *ADSP-TS20xS TigerSHARC System Design Guidelines* (EE-179).

Hardware Design References

The following documents from Analog Devices provide information that is helpful for system design.

1. ADSP-TS201S TigerSHARC Embedded Processor Data Sheet, Analog Devices, Inc.


5. Thermal Relief Design for the ADSP-TS201S TigerSHARC Processor (EE-182), Analog Devices, Inc.


7. ADSP-TS201S TigerSHARC Processor Hardware Reference, Analog Devices, Inc.
Processor Thermal Relief Methods

This section discusses thermal relief design considerations for Analog Devices ADSP-TS20xS TigerSHARC processors. This information assists PCB and hardware system designers by providing thermal data as well as heat sink recommendations to allow for proper design of their thermal relief system.

⚠️ It is important to note that the information in this section, while considered complete and accurate when printed, may be updated as new information becomes available. Please use the information in any versions of Thermal Relief Design for the ADSP-TS201S TigerSHARC Processor (EE-182) that are published after the publication date of this book.

This section discusses the following topics:

- Thermal overview
- Thermal calculations
- Heat sink basics
- Heat sinks: pin fins vs. rectangular-fins
- Heat sink recommendations
System Design

- PCB design for thermal dissipation
- Thermal simulations
- Alternate thermal relief solutions
- Terminology

Thermal Overview

Proper thermal management is required to ensure that the processor operates within the temperature specifications provided in the *ADSP-TS201S TigerSHARC Embedded Processor Data Sheet*. Operating within the specified temperature range ensures proper processor operation and reliability.

The overall power estimation can also be used to estimate a thermal relief budget for the processor. Equation 11-1 gives a value for the total core thermal weighted average power. Note that Equation 11-1 yields the total core thermal weighted average power consumption for a single ADSP-TS20xS in a given system. Guard-banding this value is recommended for a thermal relief design that allows the system to operate within specified thermal parameters, even under worst-case conditions.

Equation 11-1. Total Core Thermal Weighted Average Power Calculation

\[ P_{\text{Thermal}} = P_{\text{DD (average)}} + P_{\text{DD\_IO (average)}} + P_{\text{DD\_DRAM (average)}} \]

For more information on power consumption for the ADSP-TS20x processor, please refer to the *Estimating Power for ADSP-TS201S TigerSHARC Processors* (EE-170) engineer-to-engineer note, which can be found on the Analog Devices Web site, [http://www.analog.com](http://www.analog.com).
Processor Thermal Relief Methods

After thermal calculations have been completed, if it is determined that a heat sink is necessary in the system, use a heat sink with a minimum footprint equal to the size of the processor package.

Figure 11-4 is a simple model of a thermal system, showing the components of the processor package. This model shows all of the associated components present in a thermal system. Note that there are two possible avenues for thermal heat dissipation: the primary heat dissipation path (i.e., the path with the least thermal resistance) is via the “top” of the processor package (through the thermal path denoted by $\theta_{JC}$), and the secondary heat dissipation path is through the “bottom” of the processor package, via the package balls (through the thermal path denoted by $\theta_{JB}$) to the PCB.

The maximal thermal energy of the processor can be transferred when the thermal resistance from each component in the system is minimized. Thus, the thermal energy generated by the processor can be dissipated to the cooler ambient air of the system (or through the PCB by the use of thermal vias and an internal or external thermal plane).

![Figure 11-4. Thermal System Model Example](image)

The values for $\theta_{JA}$, $\theta_{JB}$, and $\theta_{JC}$ are provided in the “Thermal Characteristics” section of the ADSP-TS201S TigerSHARC Embedded Processor Data Sheet.
Thermal Calculations

To calculate the thermal performance of a system, the first parameter that should be known at the time of performing thermal calculations is the maximum ambient air temperature, \( T_{\text{AMBIENT}} \), of the system. The second parameter that should be known is the value of the processor’s thermal power consumption \( (P_{\text{THERMAL}}) \). The third parameter is the junction-to-ambient thermal resistance, \( \theta_{JA} \). These three system parameters are required to calculate the maximum junction temperature, as shown in Equation 11-2.

Equation 11-2. Processor Junction Temperature Calculation

\[
T_{\text{JUNCTION}} = (P_{\text{THERMAL}} \times \theta_{JA}) + T_{\text{AMBIENT}}
\]

From the result of Equation 11-2, the calculated value for \( T_{\text{JUNCTION}} \) can be used to solve for the calculated value for the processor’s case temperature, \( T_{\text{CASE}} \), using Equation 11-3. The result of Equation 11-3 determines whether a heat sink is required to allow the ADSP-TS20x to operate within the thermal operating conditions specified in the ADSP-TS20xS data sheet. If the calculated value for \( T_{\text{CASE}} \) exceeds the maximum specified case temperature for the device (from the ADSP-TS20xS data sheet), a heat sink is required.
Equation 11-3. Heat Sink Requirement Calculation

\[ T_{CASE\ (max)} = T_{JUNCTION} - (P_{THERMAL} \times \theta_{JC}) \]

If a heat sink is required for the processor, an appropriate heat sink with the proper thermal performance characteristics should be chosen. The following two parameters for the heat sink must be known: the sink-to-ambient (\(\theta_{SA}\)) thermal resistance, and the thermal resistance of the thermal interface material (\(\theta_{CS}\)), which resides between the processor’s case (i.e. processor package) and the bottom surface of the heat sink.

Knowing these two thermal resistance parameters of the desired heat sink, it is now possible to calculate the case temperature (\(T_{CASE}\)) of the processor with the heat sink attached by using Equation 11-4.

Equation 11-4. Derived Heat Sink Requirement Calculation

\[ T_{CASE\ (max)} < T_{AMBIENT} + (P_{THERMAL} \times \theta_{SA}) + (P_{THERMAL} \times \theta_{CS}) \]

If the resultant value from Equation 11-4 exceeds the maximum value for \(T_{CASE\ (MAX)}\) (from the ADSP-TS20xS data sheet), a heat sink with better thermal characteristics is required.

Equation 11-4 yields a conservative estimate for the value for \(T_{CASE}\). This is because there are other paths in the system to sink the thermal energy (for example, through the PCB). A more comprehensive model of the system to include these additional paths can be used when performing the thermal calculations for the processor. (The value for \(\theta_{JB}\) is provided in the data sheet for the ADSP-TS20xS processor.)
For a specific application, the heat sink’s thermal resistance values can be obtained from the particular heat sink vendor.

**Heat Sink Basics**

A heat sink is characterized by its thermal resistance, which describes the flow of heat from the heat sink to the ambient air for a given rise in the heat sink temperature.

Thermal resistance is measured in units of °C/W. Heat sink to local ambient thermal resistance ($\theta_{SA}$) is a measure of the thermal resistance from the bottom of the heat sink to the local ambient air. Lowering the thermal resistance between the processor and the ambient air increases the thermal solution’s efficiency.

Thermal resistance is dependent upon the following four parameters:

1. Heat sink material
2. Thermal conductivity of the heat sink
3. Geometry of the heat sink
4. Air velocity through the fins of the heat sink

**Pin Fins versus Rectangular Fins**

Although rectangular-fin heat sinks have been around longer, pin-fin heat sinks perform better than rectangular-fin heat sinks, especially in environments that provide little or no airflow in the system. (See comparison in Figure 11-5.) Due to the omni-directional structure of pin-fin heat sinks, air can penetrate and exit the heat sink at every possible angle, providing more efficiency. The round shape of the “pin-fins” creates turbulence within the heat sink; this turbulence breaks the stagnant air boundary layers around the pins, enhancing the heat sink’s thermal performance. In
addition, the round pin structure exposes a large percentage of the surface area to incoming airflow without presenting an extreme pressure resistance to the incoming airflow.

Heat sinks of many different sizes are available from the listed manufacturers. Following is a list of recommended heat sink manufacturers and specific heat sinks that exhibit required thermal relief performance. Visit the Web sites listed below for more information.

- *Cool Shield Inc.*, http://www.coolshieldinc.com
- *AAVID Thermalloy*, http://www.aavidthermalloy.com

**PCB Design for Thermal Dissipation**

*Figure 11-6*, a thermal model of the BGA_ED package, shows that more thermal energy is dissipated through the top of the package since the die is upside-down in the package. Due to system design constraints, there may be situations where sufficient clearance to install a heat sink on the device
System Design

may not be available (or there may be enough room only for a smaller heat sink that may not have sufficient heat dissipation characteristics) to allow for thermal power dissipation to escape through this interface.

In this situation, thermal energy can be dissipated from the solder balls of the BGA_ED package to a thermal (i.e. heat sinking) plane of the PCB. Thermal vias in the PCB can be used in conjunction with a heat sinking plane (i.e., a copper layer or some other type of thermally conductive material) of sufficient area to allow thermal transfer to a heat sink or some other means of thermal relief.

Figure 11-6. ADSP-TS20xS BGA-ED Thermal Model

In this situation, thermal energy can be dissipated from the solder balls of the BGA_ED package to a thermal (i.e. heat sinking) plane of the PCB. Thermal vias in the PCB can be used in conjunction with a heat sinking plane (i.e., a copper layer or some other type of thermally conductive material) of sufficient area to allow thermal transfer to a heat sink or some other means of thermal relief.

Thermal Simulations

Due to the high-performance of modern DSP-based systems, proper thermal management is critical for desired performance and operation. Performing thermal simulations on a given system is one method of ensuring proper system performance.

Correct processor performance is not guaranteed when $T_{CASE}$ is exceeded; please ensure that the operating value for $T_{CASE}$ is within the range specified in the ADSP-TS20xS data sheet.
Processor Thermal Relief Methods

Below is a listing of vendors that provide thermal simulation software. These companies can also provide thermal simulation assistance.

- Maya (http://www.mayahtt.com)
- Flotherm (http://www.flowtherm.com)
- ThermoAnalytics, Inc. (http://www.thermoanalytics.com)
- Harvard Thermal Inc. (http://www.harvardthermal.com)

Thermal Design Terminology

- $P_{DD}$: The total power consumed on the $V_{DD}$ voltage domain by the TigerSHARC core. This is an average value, not an instantaneous or peak value.
- $P_{DD,IO}$: The total power consumed on the $V_{DD,IO}$ voltage domain by the Link Ports and Cluster Bus of the TigerSHARC processor. This value is system dependent, and is an average value, (not an instantaneous or peak value).
- $P_{DD,DRAM}$: The total power consumed by the internal DRAM of the processor. This is an average value, not an instantaneous or peak value.
- $P_{THERMAL}$: The total average thermal power consumed by the processor.
- Heat transfer coefficient: theta ($\theta$), given in °C/W.
- Thermal resistance: A measure of the flow of heat from one medium to another.
- Thermal equilibrium: System state when the electrical power dissipated in the device is equal to the heat flow out of the device.
System Design

- $T_{\text{AMBIENT}}$: The temperature of the local air surrounding the processor in the system.
- $T_{\text{CASE}}$: The case temperature of the processor.
- $T_{\text{JUNCTION}}$: The processor junction temperature.
- $T_{\text{SINK}}$: The temperature of the heat sink.
- $\theta_{\text{CA}}$: The thermal resistance between the case of the processor and the ambient air.
- $\theta_{\text{JA}}$: The thermal resistance between the junction of the processor and the ambient air.
- $\theta_{\text{JB}}$: The thermal resistance between the junction and the balls of the package.
- $\theta_{\text{JC}}$: The thermal resistance between the junction and the case of the processor.
- $\theta_{\text{SA}}$: The thermal resistance between the heat sink and the ambient air. This is also sometimes known as the thermal resistance of the thermal interface material (applied between the heat sink and the processor package), or $\theta_{\text{TIM}}$.

Thermal Design References

The following documents from Analog Devices provide information that is helpful for developing processor thermal management.

1. Thermal Relief Design for the ADSP-TS201S TigerSHARC Processor (EE-182), Analog Devices, Inc.
Other Design and Test References

3. ADSP-TS201S TigerSHARC Embedded Processor Data Sheet, Analog Devices, Inc.


5. ADSP-TS201S TigerSHARC Processor Hardware Reference, Analog Devices, Inc.


Other Design and Test References

When designing TigerSHARC processor systems, a number of resources are available, including:

- “ADSP-TS201 Processor EZ-KIT Lite” on page 11-28
- “ADSP-TS201S MP System Simulation and Analysis Cluster Bus Topology – Signal Integrity and PCB Design Considerations” on page 11-29
- “IBIS Models” on page 11-29
- “FPGA Link Port Support” on page 11-30

ADSP-TS201 Processor EZ-KIT Lite

The ADSP-TS201 processor EZ-KIT Lite provides developers with a cost-effective method for initial evaluation of the ADSP-TS201 TigerSHARC processor Family. The EZ-KIT Lite includes two TigerSHARC processors on the desktop evaluation board and fundamental debugging software to facilitate architecture evaluations via a
USB-based, PC-hosted tool set. With this EZ-KIT Lite, users can learn more about Analog Devices TigerSHARC processor hardware and software development and prototype applications.

**ADSP-TS201S MP System Simulation and Analysis**

**Cluster Bus Topology – Signal Integrity and PCB Design Considerations**

This technical article (available at [http://www.analog.com](http://www.analog.com)) describes the PCB implementation of the Analog Devices ADSP-TS201S TigerSHARC processors in clustered systems with detailed signal integrity & timing analysis of cluster bus communication.

**IBIS Models**

I/O Buffer Information Specification (IBIS) models provide transistor characteristics for output drivers for use in circuit modeling.

It is recommended that users perform simulation tests to ensure proper signal integrity. For these purposes, Analog Devices supplies IBIS models. TigerSHARC processor IBIS models are available on the Analog Devices Web site [http://www.analog.com](http://www.analog.com).

For more information access:

- The Analog Devices Web site at [http://www.analog.com](http://www.analog.com) and search for modeling information by entering keyword “IBIS”, in the web site search feature.

Other Design and Test References

FPGA Link Port Support

There are links and articles (available at http://www.analog.com) that identify implementations and third party providers of FPGA support for ADSP-TS201S TigerSHARC processors.
Recommended Reading References

The text High-Speed Digital Design: A Handbook of Black Magic is recommended for further reading. This book is a technical reference that covers the problems encountered in state-of-the-art, high frequency digital circuit design and is an excellent source of information and practical ideas.

Topics covered in the book include:

- High-Speed Properties of Logic Gates
- Measurement Techniques
- Transmission Lines
- Ground Planes and Layer Stacking
- Terminations
- Vias
- Power Systems
- Connectors
- Ribbon Cables
- Clock Distribution
- Clock Oscillators

INDEX

Symbols
.D unit, See IALU
.L unit, See ALU
.M unit, See multiplier
.S unit, See shifter
θ_{CA} (case-to-ambient thermal resistance), 11-27
θ_{JA} (junction-to-ambient thermal resistance), 11-27
θ_{JB} (junction-to-base thermal resistance), 11-27
θ_{JC} (junction-to-case thermal resistance), 11-27
θ_{SA} (sink-to-ambient thermal resistance), 11-27

Numerics
16-bit count, DMA, 7-18
16-bit modify, DMA, 7-18
2DDMA (two dimension DMA) bit, 2-64, 7-19, 7-21
  enabling/disabling, 7-47
32-bit bus, 8-13, 8-17
  and SDRAM, 8-60, 8-65, 8-66, 8-67
64-bit bus, 8-13, 8-17
  and SDRAM, 8-60, 8-61, 8-62, 8-63

A
A10 (address 10) pin, SDRAM, 8-6
AC (ALU carry) bit, 2-21
  See also ALU status
accelerator, See CLU (communications logic unit)

accesses
  bus arbitration, 3-3
  accesses, bus arbitration, 8-11, 8-12
  accesses, sequential, 8-18
  access time, 1-22
ACK (memory acknowledge) pin, 1-6, 8-3, 8-6, 8-16, 8-23, 8-24
  and back off, 8-49
  description, 8-4
  slow protocol, 8-27
  wait cycle insertion, 8-23
acknowledge (ACK) pin, See ACK (memory acknowledge) pin
ACT (activate) command, 8-86
  pin state, 8-85
active status, DMA, 7-23
ACT-to-PRE delay (tRAS), 8-51
addition, reverse carry, 1-14
additional literature, xxiv
ADDR (DMA address) bits, 2-62
ADDR31–0 (address bus) pins, 1-6, 2-1, 8-1, 8-6
  and SDRAM, 8-57, 8-58
description, 8-4
address (ADDR31–0) pins, See ADDR31–0 (address bus) pins
INDEX

addresses
branch target, 1-17
from register, 1-16
immediate, 1-16
indirect, 1-14
jump, 1-17
relative, 1-17
addressing
See also bit reversed addressing and
   circular buffer addressing
circular buffer, 1-14
direct, 1-16
indirect, 1-16
modulo, 1-14
address modify, DMA, 7-5, 7-18
address pointer registers, 10-5
address range restrictions, DMA, 7-30
address space, 2-1
   external memory bank space, 2-3, 2-4 to
      2-5
   external multiprocessor space, 2-3, 2-6 to
      2-7, 2-60
   host address space, 2-4
   internal address space, 2-3, 2-7
AI (ALU invalid) bit, 2-21
   See also ALU status
AIS (ALU invalid, sticky) bit, 2-20
   See also ALU status
algorithms
digital filters, 1-14
Fourier transforms, 1-14
path search, 1-12
turbo code, 1-12
Viterbi coding, 1-12
alignment, external bus data, 8-7
alignment of words, 1-11
alignment restrictions, DMA, 7-29
alternate initialization, link port, 9-12
ALU (arithmetic logic unit), 1-2, 1-4, 1-33
   registers, 2-20
   status, 1-33
ambient air temperature, 11-21
AN (ALU negative) bit, 2-21
   See also ALU status
AOS (ALU overflow, sticky) bit, 2-20
   See also ALU status
arbitration
   between buses, 1-22, 1-28, 3-3, 8-11,
      8-12
   DMA priority, 3-6, 7-22
   SOC bus, 3-3
   SOC interface priority, 3-6, 8-12
   system priority, 8-37
architecture
   DMA unit, 7-4
   external port, 8-5
   features, 1-7
   link port, 9-2
   SOC interface, 3-2
   Static Superscalar, 1-8
   system, 1-5
asynchronous devices, slow protocol, 8-27
asynchronous protocol, 2-4
audience, intended, xxi
AUS (ALU underflow, sticky) bit, 2-20
   See also ALU status
autoDMA, 2-60, 2-61, 7-10, 7-27
   booting, 11-5
   channels, 1-27, 7-3, 7-37
   error condition, 6-22
   external to internal DMA transfers, 7-54
   hardware error interrupt, 7-33
   operation, 7-9, 7-13
   setup restrictions, 7-29
   two dimension, 7-11
autoDMA registers, 7-14, 7-16, 7-32, 7-34
  accesses, 7-3
  channel operation, 7-57
  control, 7-32
  DMA priority, 7-37
  register group, 2-12, 2-60
  usage, 7-10
AUTODMAx, y (autoDMA data registers)
  registers
  address, 2-60
auto setup, DMA, 7-41
AV (ALU overflow) bit, 2-21
  See also ALU status
AVS (ALU overflow, sticky) bit, 2-20
  See also ALU status
AZ (ALU zero) bit, 2-21
  See also ALU status

B
back off (BOFF) pin, See BOFF pin
bank activate (ACT) command, 8-84
banks of memory, 2-2
  bus pipeline depth, 8-17
  idle insertion, 8-19
  slow device protocol, 8-26
  wait cycles, 8-23
BF (block floating-point) bit, 2-21
  See also shifter status
BFOTMO (bit FIFO temp) register, 2-21
BFOTMP (bit FIFO temp) register, 1-13
bit field manipulation, 1-13
bit FIFO, 1-13
bit operations, 1-13
bit reverse addressing, 1-14
bit set, clear, toggle, and test, 1-13
bit wise barrel shifter, See shifter
block completion
  interrupt, 7-21
  link port, 9-9
blocks of data length, link port, 9-15
blocks of memory, 2-2
  broadcast, 2-36
BM (bus master) pin, 1-6, 6-21, 8-6
  description, 8-35
  usage, 11-7
BMAX (bus fairness count) bits, 2-80
BMAX (bus fairness count) register, 2-80, 8-44
  address, 2-72
  and system priority, 8-38
BMAXC (bus fairness count current count) bits, 2-80
BMAXC (bus fairness count current count) register, 2-80
BMS (boot memory select) pin, 1-6, 1-22, 8-6
  description, 8-4
  EPROM boot, 8-33
  usage, 9-11, 11-3, 11-4, 11-5, 11-6, 11-7
BNKxIDLE (memory bank x idle insert) bit, 2-75
BNKxPIPE (memory bank x pipe depth select) bits, 2-75
BNKxSLOW (memory bank slow protocol select) bits, 2-75
BNKxWAIT (memory bank x wait select) bits, 2-75
BOFF (back off) pin, 1-6, 2-4, 8-6, 8-35, 8-48 to 8-49
boot EPROM
  connections, 8-6
DMA, 7-22

INDEX
INDEX

booting, 6-3
boot loading, 1-22
EPROM boot with (BMS) pin, 6-3
host boot, 1-31
interrupts, 6-21
link boot, 6-3, 9-10, 9-11
methods, 11-2
mode, 11-3
options, 11-2
references, 11-8
boot loader kernels, 11-3, 11-8
BOOTMDE (boot mode) bit, 2-79
boot memory select (BMS) pin, See BMS
(boot memory select) pin
boundary scan, 11-15
BR7–0 (bus request) pins, 1-6, 8-6, 8-36,
8-44
and bus arbitration, 8-38
and priority access, 8-40, 8-42, 8-43
and system priority, 8-37
description, 8-35
branch
penalty, 1-17
prediction, 1-17, 6-24
target address, 1-17
branch target buffer (BTB), See BTB
broadcast
access, 2-19
distribution, 2-3
DMA channel error interrupt, 7-23
read access, 2-11, 6-22, 6-23
write, system control registers, 8-2
write access, 2-3
write access, 2-11
broadcast to all memory blocks, 2-36
BRST (burst access) pin, 1-6, 1-28, 8-6,
8-16
burst pipelined transactions, 8-18
description, 8-5
host processor, 8-48
BSTOP (burst stop) command, 8-86, 8-89,
8-90
BTB (branch target buffer), 1-2, 1-4, 1-9,
1-15, 1-17, 1-21, 2-32
BTBEN (BTB enable) bit, 2-32
BTBLK (BTB lock) bit, 2-32
BTBMOD (BTB mode) bits, 2-32
BTBPR (BTB predictor) bit, 2-90
buffering/distributing the clock, 11-10
burst (BRST) pin, See BRST pin
burst pipeline transaction, 8-18
burst read transfer timing, 8-21, 8-25,
8-87, 8-88
burst stop (BSTOP) command, See
BSTOP command
burst write transfer timing, 8-26, 8-92
SDRAM, 8-75
bus, 1-23
arbitration, 1-28, 1-30, 3-3
clock domains, 3-2
clock rate, 1-24
cluster, 1-9, 1-24
data, 1-11
DMA, 1-26
I-, J-, K-, and S-buses, 1-23
SOC bus, 3-2
width, 1-4, 1-5
bus arbitration, 1-30, 7-36, 8-1, 8-3, 8-5,
8-11, 8-12, 8-36, 8-37, 8-48 to 8-49
protocol, 8-38 to 8-39
SOC priority, 3-6
system priority, 8-37
bus control/status registers, 2-60, 2-80
buses
cluster bus, 3-2
DMA operations, 7-33
processor core buses, 3-1
SOC bus, 3-2
system bus, 1-29
INDEX

ADSP-TS201 TigerSHARC Processor Hardware Reference
INDEX

CIX (column index) register, 7-47
CJMP (computed jump) instruction, 1-17
CJMP (computed jump) register, 2-27
CKE (SDRAM clock enable) pin, 8-6
CLKMULT (SCLKRATx pin input) bits, 2-79
clear bit, 1-13
clock distribution, 11-10, 11-12, 11-13
clock domains, 1-17, 3-2
DMA unit, 7-2
extended core, 1-25
external port, 8-8
flags, 5-1
interrupts, 6-3
link port, 9-2
timer, 4-8
clock duty cycle requirements, 11-11
clock generation components, 11-11
clock input, system, 8-6
clock jitter requirements, 11-10
clock rate, bus, 1-24
CLU (communications logic unit), 1-2, 1-4, 1-12, 1-33
registers, 2-22
status, 1-33
cluster bus, 1-9, 1-24, 3-2
bus master DMA, 7-3
priority, 3-6, 8-12
transfers, 7-8
cluster multiprocessing, 1-30
CMCTL (communication control) register, 2-22
CNT-X (DMA X count) bits, 2-62
CNT-Y (DMA Y count) bits, 2-62
column address select (CAS) pin, See CAS pin
column count (COL), 7-46
column index (CIX) register, 7-45
columns, two dimension DMA, 7-45
communication protocol, link port, 9-12
completion status, DMA, 7-23
complex numbers, 1-12
COM port, McBSP, See link port
compute block, 1-4, 1-9, 1-10, 1-22
data registers, 2-14, 2-20
multiplier registers, 2-21
pipeline, 1-16
registers, unmapped, 2-19
RF groups, 2-14
shifter registers, 2-21
status, 2-20
status registers, 2-20
unmapped registers, 2-19
compute block register groups, 2-12, 2-14
access restrictions, 1-19
X registers, 2-15
XY (broadcast access) registers, 2-19
XY (merged access) registers, 2-17
Y registers, 2-16
YX (merged access) registers, 2-18
context save/restore, 6-12
nested interrupts, 6-14
context switching, 1-17
core clock (CCLK), 1-17, 3-1
core priority access (CPA) pin, See CPA (core priority access) pin
counters, free-running, See timers
CPA (core priority access) pin, 1-6, 8-6, 8-40 to 8-43
See also DPA pin, 8-36
and system priority, 8-38
description, 8-36
crossbar configuration, 1-30
INDEX

cross channel DMA
examples, 7-73
restrictions, 7-40
CS (SDRAM chip select) pin, 8-6
customer support, xxv
cycle counter, 1-21, 10-4
cycle counter (CCNT1–0) registers, 2-92, 10-9

data
DAB (data alignment buffer), 1-2
data (Dreg) registers, 1-24
data, immediate, 1-16
DATA63–0 (data bus) pins, 1-6, 8-1, 8-6
description, 8-4
EPROM boot, 8-33
data access, 1-24
data alignment
32-bit bus, 8-64
64-bit bus, 8-60
FLASH memory data, 8-34
SDRAM connections, 8-60
data alignment buffer (DAB), 6-23
data block length, link port, 9-15
data bus (DATA63–0) pins, See DATA63–0 pins
data flow, 1-22
data latching/driving, link port, 9-1
data overrun error, link port, 9-19
data packing, 1-29
data registers, 10-13
data sheets, 11-1
data structures, 1-14
data throughput rates, 8-78

data transfers
cluster bus, 7-8
DMA operations, 7-1, 7-7, 7-8, 7-33, 7-45
external-to-external memory (flyby), 7-60, 7-62
external-to-internal memory, 7-54
flyby mode transfers, 7-13
internal-to-external memory, 7-57
internal-to-internal memory, 7-3
multi-word transfers, 8-18
normal, long, and quad word, 8-17
readiness, 7-9
read wait cycle insertion, 8-23, 8-31
terminology, 7-14
write wait cycle insertion, 8-24
data transfer timing
burst read timing, 8-21, 8-25, 8-87, 8-88
burst write timing, 8-26, 8-92
CPA/DPA/HBR timing, 8-39
EPROM access timing, 8-33
FLASH write timing, 8-34
link port timing, 9-1, 9-13
pipeline depth and timing, 8-22
pipelined protocol timing, 8-14, 8-16, 8-20
SDRAM timing, 8-75
slow protocol timing, 8-14, 8-28, 8-29, 8-31
data types, 1-11, 1-12
DBG (debug) interrupt, 6-17
DBGE (debug emulation) register, 6-14
address, 2-27
exceptions, 6-15
usage, 6-10
DBGEN (debug enable) bit, 2-29
DCDx (DMA channel dest TCB) registers, 7-6, 7-17
chaining example, 7-44
<table>
<thead>
<tr>
<th>Index Item</th>
<th>Page Range</th>
</tr>
</thead>
<tbody>
<tr>
<td>DCDx, DI (ext port DMA dest TCB index) registers</td>
<td>2-56</td>
</tr>
<tr>
<td>DCDx, DP (ext port DMA dest chain ptr) registers</td>
<td>2-56</td>
</tr>
<tr>
<td>DCDx, DX (ext port DMA dest X dim) registers</td>
<td>2-56</td>
</tr>
<tr>
<td>DCDx, DY (ext port DMA dest Y dim) registers</td>
<td>2-56</td>
</tr>
<tr>
<td>DCNT (DMA control) register</td>
<td>2-51, 7-26, 7-43</td>
</tr>
<tr>
<td>DCNTCL (DMA control clear) register</td>
<td>7-27</td>
</tr>
<tr>
<td>DCNTST (DMA control set) register</td>
<td>7-27</td>
</tr>
<tr>
<td>DCsx (DMA channel src TCB) registers</td>
<td>7-6, 7-17</td>
</tr>
<tr>
<td>DCsx, DI (ext port DMA src TCB index) registers</td>
<td>2-56</td>
</tr>
<tr>
<td>DCsx, DP (ext port DMA src chain ptr) registers</td>
<td>2-56</td>
</tr>
<tr>
<td>DCsx, DX (ext port DMA src X dim) registers</td>
<td>2-56</td>
</tr>
<tr>
<td>DCsx, DY (ext port DMA src Y dim) registers</td>
<td>2-56</td>
</tr>
<tr>
<td>DCx (DMA channel TCB) registers</td>
<td>7-17</td>
</tr>
<tr>
<td>DCx, DI (link port RX DMA TCB index) registers</td>
<td>2-58</td>
</tr>
<tr>
<td>DCx, DI (link port TX DMA TCB index) registers</td>
<td>2-57</td>
</tr>
<tr>
<td>DCx, DP (link port RX DMA chain ptr) registers</td>
<td>2-58</td>
</tr>
<tr>
<td>DCx, DP (link port TX DMA chain ptr) registers</td>
<td>2-57</td>
</tr>
<tr>
<td>DCx, DX (link port RX DMA X dim) registers</td>
<td>2-58</td>
</tr>
<tr>
<td>DCx, DX (link port TX DMA X dim) registers</td>
<td>2-57</td>
</tr>
<tr>
<td>DCx, DY (link port TX DMA Y dim) registers</td>
<td>2-57</td>
</tr>
<tr>
<td>Deadlock</td>
<td>8-48 to 8-49</td>
</tr>
<tr>
<td>Debug</td>
<td></td>
</tr>
<tr>
<td>Concepts</td>
<td>10-4</td>
</tr>
<tr>
<td>Functionality</td>
<td>10-1</td>
</tr>
<tr>
<td>Registers</td>
<td>2-84</td>
</tr>
<tr>
<td>Resources</td>
<td>10-5</td>
</tr>
<tr>
<td>Debug (DBG) interrupt</td>
<td>6-17</td>
</tr>
<tr>
<td>Debug registers group</td>
<td>2-12, 2-84</td>
</tr>
<tr>
<td>Access restriction</td>
<td>1-21</td>
</tr>
<tr>
<td>Delay lines</td>
<td>1-14</td>
</tr>
<tr>
<td>Dependency</td>
<td></td>
</tr>
<tr>
<td>Check</td>
<td>1-16</td>
</tr>
<tr>
<td>DESpread instruction</td>
<td>1-12</td>
</tr>
<tr>
<td>Destination address</td>
<td>7-5</td>
</tr>
<tr>
<td>Development enhancements</td>
<td>1-7</td>
</tr>
<tr>
<td>Device type</td>
<td>7-22</td>
</tr>
<tr>
<td>Digital filters</td>
<td>1-14</td>
</tr>
<tr>
<td>Direct addressing</td>
<td>1-14</td>
</tr>
<tr>
<td>Direct calls</td>
<td>1-16</td>
</tr>
<tr>
<td>Direct jumps</td>
<td>1-16</td>
</tr>
<tr>
<td>Direct memory access (DMA)</td>
<td></td>
</tr>
<tr>
<td>See DMA</td>
<td></td>
</tr>
<tr>
<td>Distributing/buffering the clock</td>
<td>11-10</td>
</tr>
<tr>
<td>Dlx (DMA index) registers</td>
<td>2-61, 2-62, 7-5, 7-18</td>
</tr>
<tr>
<td>Chaining example</td>
<td>7-44</td>
</tr>
<tr>
<td>Operation</td>
<td>7-16</td>
</tr>
<tr>
<td>Two dimension DMA operation</td>
<td>7-45</td>
</tr>
<tr>
<td>Usage</td>
<td>7-34</td>
</tr>
<tr>
<td>Dlx (ext port DMA dest TCB index) registers</td>
<td>2-56</td>
</tr>
<tr>
<td>Dlx (ext port DMA src TCB index) registers</td>
<td>2-56</td>
</tr>
<tr>
<td>Dlx (link port RX DMA TCB index) registers</td>
<td>2-58</td>
</tr>
<tr>
<td>Dlx (link port TX DMA TCB index) registers</td>
<td>2-57</td>
</tr>
<tr>
<td>Dlx (link port TX DMA TCB index) registers</td>
<td>2-57</td>
</tr>
</tbody>
</table>
INDEX

DMA, 1-21, 1-23, 1-27, 2-60, 6-3, 6-22, 7-1, 7-33 to 7-34, 7-41
active status, 7-23
aligned boundaries, 7-6
architecture, 7-12 to 7-13
block complete interrupts, 7-15
boot DMA processes, 7-72
completion status, 7-23
count values, 7-6
data transfer readiness, 7-9
device type, 7-22
disable/enable, 7-47
DMA interrupt, 1-26, 6-3, 6-20, 7-40
enable/disabled, 7-22
ending a sequence, 7-52
error status, 7-23
examples, 7-73
high priority, 3-6, 7-39, 8-12
inactive/active TCB loading, 7-17
internal/external memory setup, 7-36
internal memory arbitration, 7-1
interrupts, 7-17, 7-21, 7-23, 7-49
memory addressing, 7-34, 7-34 to 7-36
operand length, 7-6, 7-21
pausing a sequence, 7-26, 7-52
priority select, 7-22
programming, 7-15 to 7-30
request enable, 7-21
resuming a sequence, 7-52
rows and columns, 7-45
SDRAM accesses, 8-1
semaphores, 7-63
SOC bus, 3-3, 7-1
starting a sequence, 7-15, 7-17
suspending a sequence, 7-52
terminology, 7-14
throughput, 7-71 to 7-72
transfer complete, 7-49
transfer types, 7-3, 7-8, 7-33
transfer word count, 7-18

DMA (direct memory access), See DMA
DMA, autoDMA, 2-60, 2-61, 7-9, 7-13,
7-14, 7-16, 7-27
channel priority, 7-37
control, 7-32
errors, 6-22
external to internal, 7-54
operation, 7-34
restrictions, 7-29
two dimension, 7-11
DMA, chained, 7-6, 7-11, 7-14, 7-17,
7-19, 7-31, 7-33, 7-35, 7-40, 7-40 to
7-43, 7-50, 7-51
chain insertion, 7-43
DMA complete interrupt, 7-42
enable, 7-20
enable/disables, 7-42
epletion, 7-21
example, 7-44
insertion, 7-12
operations, 7-3
priority, 7-37
set up and start, 7-43
DMA, external port, 1-28, 2-56, 7-32, 7-52
to 7-65, 8-3, 8-9, 8-11
cluster bus transfers, 7-8 to 7-9
EPROM access, 8-34
EPROM boot, 8-33
flyby mode, 7-13, 7-32
flyby transactions, 8-74 to 8-76
handshake mode, 7-64
host processor DMA, 7-10
DMA, link port, 1-32, 7-10, 7-31, 7-66 to
7-69, 9-9
channels, 9-9
link input and IFIFO DMA TCB, 2-58
link port service interrupts, 6-20
DMA, two dimension, 7-11, 7-19, 7-21, 7-27, 7-31, 7-36
   enable, 7-21
   examples, 7-48
   transfer steps, 7-46
DMA13–0 (DMA channel complete) interrupts, 6-18
DMA address
   alignment restrictions, 7-29
   destination, 7-5
   increment, 7-5
   modify, 7-18
   range restrictions, 7-30
DMA chained, 1-27, 2-63
   chain pointer, 7-20
DMA chaining enable (CHEN) bit, 7-41
DMA chaining pointer (DP) register, 2-63
DMA channel complete (DMA13–0) interrupts, 6-18
DMA channels, 1-26, 2-56, 2-58, 6-20, 7-1, 7-8, 7-16 to 7-23, 7-34, 7-36 to 7-40
   autoDMA, 7-32
   booting, 11-4, 11-5
   chaining, 7-3
   control, 7-16
   cross channel chaining, 7-20, 7-40
   external memory, 7-16
   external port, 1-29, 7-54
   hardware error interrupt, 7-23
   idle state, 2-53
   initialization, 7-4, 7-17
   link port, 7-34, 9-16
   pausing to update a TCB, 7-42
   priority, 7-36, 7-37, 7-38
   priority and chain loading, 7-43
   setup restrictions, 7-28, 7-29
   single and cross chained DMA, 7-12
   TCB registers, 7-17
   uninitialized channels, 7-32
DMA control (DCNT) register, See
   DCNT register
DMA control and chaining pointer (DPx) registers, See DPx registers
DMA control clear (DCNTCL) register, See DCNTCL register
DMA controller, 1-4, 1-26, 7-1, 7-8 to 7-11, 7-31 to 7-52
   data flow, 7-4
   features, 7-8
   internal registers, 7-45
   operations, 7-31
DMA control set (DCNTST) register, See
   DCNTST register
DMA data transfer, 7-7
DMA index (DIx) register, See DIx registers
DMA priority access (DPA) pin, See DPA (DMA priority access) pin
DMAR3–0 (DMA request) pins, 1-6, 1-27, 3-2, 7-4, 7-13, 7-32, 8-6
   data readiness, 7-9
   DMA priority, 7-38
   enable, 7-21
   handshake mode, 7-64
   operation, 7-9, 7-13
DMA registers, 2-51, 7-15 to 7-30
   access restrictions, 2-7
   control and status, 7-23, 7-26 to 7-30
   DMA registers group, 2-12, 2-51
   receiver TCB, 7-8
   restrictions, 7-27 to 7-30
   TCB registers, 7-15, 7-16
DMA request (DMAR3–0) pins, See
   DMAR3–0 pins
DMA request (DRQ) bit, setup restrictions, 7-29
DMA sequence, 7-41, 7-51 to 7-52
   set up and start, 7-51
INDEX

DMA status (DSTAT) register, 7-23, 7-23 to 7-25
DMA transfer control block (TCB), See TCB
DMA type field (TY) bits, 7-17
DMAx_P (DMA channel pause) bits, 2-52
DMA X dim count/modify (DXx) registers, 2-61, 7-5
operation, 7-16
two dimension DMA operation, 7-45
DMA Y dim count/modify (DYx) registers, 2-61, 7-6
operation, 7-16
two dimension DMA operation, 7-45
documentation resources, 11-1
double (Rmd) registers, 1-10, 1-11
DPA (DMA priority access) pin, 1-6, 8-6, 8-42 to 8-43
See also CPA pin, 8-6
and bus arbitration, 8-39
and system priority
description, 8-36
DPx (DMA chain pointer) registers, 2-63, 2-64, 7-5, 7-6, 7-19
bit descriptions, 7-20
chaining example, 7-44
operation
usage, 7-43
DPx (ext port DMA dest chain ptr) registers, 2-56
DPx (ext port DMA src chain ptr) registers, 2-56
DPx (link port RX DMA chain ptr) registers, 2-58
DPx (link port TX DMA chain ptr) registers, 2-57
DRAM, embedded
See embedded DRAM
Dreg (data) registers, 1-24, 2-20
DRQ (DMA request enable) bit, 2-64, 7-19, 7-21, 7-29
DS2–0 (drive strength) pins, 1-6, 8-6
DSP architecture, 1-8, 1-33
DSTAT (DMA status) register, 2-51, 7-19, 7-21
2-53
DUAL (DMA channel) register, 7-55, 7-56
dual register accesses, 2-10
dual TCB channel, 7-55, 7-56
dual words, 1-15
dummy byte, link port, 9-15
D unit, See IALU
duty cycle requirements, clock, 11-11
DXx (DMA X dim count/modify) registers, 2-61, 2-62, 7-5, 7-18, 7-19
chaining example, 7-44
restrictions, 7-27
row/column calculation, 7-46
two dimension DMA operation, 7-45
usage, 7-35
DXx (ext port DMA dest X dim) registers, 2-56
DXx (ext port DMA src X dim) registers, 2-56
DXx (link port RX DMA X dim) registers, 2-58
DXx (link port TX DMA X dim) registers, 2-57
DYx (DMA Y dim count/modify) registers, 2-61, 2-62, 7-6, 7-19
chaining example, 7-44
restrictions, 7-27
row/column calculation, 7-46
two dimension DMA operation, 7-45
usage, 7-35
DYx (ext port DMA dest Y dim) registers, 2-56
DYx (ext port DMA src Y dim) registers, 2-56
DYx (link port TX DMA Y dim) registers, 2-57
### INDEX

**E**
- EBIU (external bus interface unit), 2-12, 3-2
- EBOOT (EPROM boot strap) pin, 8-32
  - usage, 9-11
- edge sensitive interrupts, 6-3, 6-5, 6-10, 6-19, 6-20, 6-21
- embedded DRAM, 1-4, 1-33
- EMCAUSE (emulator cause) bit, 2-33
- EMREN (SDRAM extended mode register) bit, 2-77
- EMU (emulator) pin, 8-6, 10-11
- EMUCTL (emulator control) register, 1-20
- EMUIR (emulator instruction) register, 1-21
- EMUL (emulator interrupt, pointer) bit, 2-32
- emulation, 2-33
  - debug, 6-10, 6-24
  - EMUIR (emulator instruction) register, 6-15
  - emulation technical reference, 11-17
  - EMU pin, 10-11
  - exception, 1-20, 6-10, 6-15
- emulator
  - mode, 1-18, 1-20 to 1-21, 6-10, 6-24, 10-4
  - trap, 1-20, 6-17, 6-24, 10-6
- emulator trap (EMUTRAP) instruction, 1-20
- EMUMOD (emulator mode) bit, 2-83
- EMUSTAT (emulator status) register, 2-83
- EMUTRAP (emulator trap) instruction, 1-20
- endian matching, 11-16
- engineer-to-engineer notes, 11-1
- EPROM, DMA access to, 7-22
- EPROM booting, 1-22, 11-2, 11-4
  - (BMS) pin, 6-3
- EPROM interface, 8-3, 8-32, 8-32 to 8-34
  - error detection mechanisms, link port, 9-19
  - error interrupt, DMA, 7-40
  - error status, DMA, 7-23
  - ESL capacitors, 11-10
  - ESR capacitors, 11-10
  - estimating power, 11-17, 11-27
- EX (watch point execution state) bits, 2-89
- EXCAUSE (exception cause) bit, 2-33
- exception (software interrupt), 2-40, 10-4
  - arithmetic, 1-26
  - software, 2-20
- execution
  - rate, 1-16
  - time, 1-23
- extended core, 3-2
- extended word, 1-11
- external bus, 1-28, 7-8 to 7-9
  - and host interface, 1-28
  - control signals, 8-16
  - features, 8-2
  - normal, long, or quad word transactions, 8-7
- external bus interface unit (EBIU), 8-2
  - See also external port
- external I/O device to external memory (flyby), 7-60
- external memory, 1-29, 1-31, 2-3, 2-4 to 2-5
  - bank space, 2-4
  - bus pipeline depth, 8-17
  - bus width, 8-13, 8-15
external memory DMA, 1-26, 1-27, 7-3, 7-7, 7-9
adressing, 7-53
chained, 7-41
channels, 7-1, 7-16
flyby mode transfers, 7-13
link port transfers, 7-10
memory to link TCB, 7-68, 7-69
setup, 7-28, 7-36, 7-55, 7-59, 7-61, 7-63
throughput, 7-71
to external I/O device (flyby), 7-62
to external memory, 7-60, 7-62
to internal memory, 7-54
transfer complete, 7-49
type select, 7-22
external port, 1-3, 1-5, 1-9, 1-22, 1-28 to 1-31, 7-14, 7-68, 7-71, 8-7, 8-9, 8-10
to 8-49
burst transfer timing, 8-87, 8-88, 8-92
bus width, 8-2, 8-60
connections, 8-6
CPA/DPA/HBR arbitration timing, 8-39
data alignment, 8-60, 8-64
EPROM access timing, 8-33
external bus interface unit (EBIU), 7-65
FIFOs, 1-28
FLASH memory data, 8-34
flyby mode, 7-65
high priority transfers, 8-12
IFIFO (input FIFO), 7-14, 7-68, 7-71, 8-7, 8-9, 8-10
OFIFO (output FIFO), 7-49, 7-62, 7-68, 7-72, 8-8, 8-9
pipelined protocol, 8-15
reset operations, 8-45
sequential accesses, 8-18
slow protocol, 8-26 to 8-29
external port, (continued)
SOC bus priority, 3-3
transfers to link port, 1-32
Ureg accesses, 8-9
external port bus interface unit (EBIU), 3-2
external port DMA, 7-52
channel priority, 7-37
channels, 1-27
to temporary control, 7-32
to transfer types, 7-54
transfer types, 7-54
external port registers group, 2-12, 2-56
EXTYPE (watch point on count expiration type) bits, 2-86, 2-87, 2-88
EZ-KIT Lite, 11-28
F
FFT algorithms, 1-14
fixed-point data formats, 1-10
flag (FLAG3–0) pins, See FLAG3–0 pins
FLAG3–0 (flag I/O) pins, 1-6, 1-25, 3-2, 5-1, 5-10
flag control (FLAGREG) register, 5-1
flag input (FLAGx_IN) conditions, 5-1
flag input/output enable (FLAGx_EN) bits, 5-1
flag output select (FLAGx_OUT) bits, 5-1
flag pin input value (FLGx) bits, 5-1
FLAGREG (flag control) register, 2-27, 2-28, 5-1
flags, 5-1 to 5-11
clock domains, 5-1
handshake protocol, 5-2
programming, 5-3
SOC bus, 5-1
FLAGx_EN (flag enable) bits, 2-28, 5-1
FLAGx_IN (flag input) conditions, 5-1
INDEX

FLAGx_OUT (flag output select) bits, 2-28, 5-1
FLASH memory data preparation, 8-34
FLASH memory interface, 8-3, 8-34
FLGx (FLAG pin input value) bits, 2-32, 5-1
floating-point data format, 1-10
flyby mode, 7-60, 7-62, 7-65
DMA, 7-13
DMA type select, 7-22
operations, 7-32
transactions, 7-9, 7-60 to 7-63, 8-3, 8-74, 8-74 to 8-76
formats, 1-12
data word, 1-11
fixed-point, 1-10
floating-point, 1-5, 1-10
Fourier transforms, 1-14
FPGA support, 11-30
free-running counters, See timers
full-duplex operation, link port, 9-1

G
GIE (global interrupt enable) bit, 2-29, 6-6
interrupt service routines, 6-9
nesting, 2-32
global interrupt enable (GIE) bit, 6-6
interrupt service routines, 6-9
global memory, 1-28, 1-30, 2-2
connections, 8-6
global static flags, See SFREG register
GR (granted bus request) bits, 2-91
GSCFx (IALU static flag) bits, 2-30

H
handling interrupts, 6-13, 6-14
handshake mode, 7-64
handshake protocol, flags, 5-2
hardware error (HWERR) interrupt, 6-17, 6-22
DMA channel, 7-23
link port write error, 9-22
operations, 6-22
hardware interrupt, 1-20, 6-2, 6-8, 6-10,
6-17, 6-22
interrupt flow, 6-12
operation, 6-11
operation flow, 6-11, 6-12
HBG (host bus grant) pin, 1-6, 1-31, 8-6
description, 8-35
HBR (host bus request) pin, 1-6, 1-31, 8-6
description, 8-35
HDQM (high word SDRAM data mask) pin, 1-6, 8-6
description, 8-56
operations, 8-79
heat dissipation, 11-20, 11-25
heat sinks
See also thermal relief design, 11-22
rectangular vs. pin fin, 11-23
requirement, 11-22
heat transfer coefficient, 11-26
high priority DMA, 7-22, 7-37
bus arbitration, 8-12
chain insertion, 7-43
chain loading, 7-43
high priority external port, 8-12
high priority SOC interface, 8-12
high word SDRAM data mask (HDQM) pin, 8-58
high word SDRAM data mask (LDQM) pin, 8-1
host bus grant (HBG) pin, 1-31, 8-36
and back off, 8-48
and bus arbitration, 8-39
and bus fairness, 8-44
INDEX

host bus request (HBR) pin, 1-31, 8-36, 8-43
  and bus arbitration, 8-39
  and system priority, 8-38
  transfer timing, 8-41, 8-42
HOSTIDLE (host idle insert) bits, 2-75
HOSTPIPE (host pipe depth select) bits, 2-74
host processor, 1-28, 1-30
  address bus connections, 11-16
  address space, 2-4
autoDMA operations, 7-10
booting, 1-22, 1-31, 11-2, 11-5
  bus pipeline depth, 8-17
  bus width, 8-13, 8-15
  connections, 8-6
deadlock recovery, 8-49
DMA, 7-3
flags, 5-1
  interface, 1-31, 8-3, 8-46 to 8-48
pipeline protocol, 8-1
  slow protocol, 8-26, 8-29
HOSTSLOW (host slow protocol select) bit, 2-74
HOSTWAIT (host wait select) bits, 2-75
HSTMSTR (host master) bit, 2-79
HSTWIDTH (host bus width select) bit, 2-74
HWERR (hardware error) interrupt, 6-17

I
I (instruction) bus, 1-2, 1-23
IAB (instruction alignment buffer), 1-2, 1-4
IALU (integer ALU), 1-2, 1-4, 1-14, 1-22, 1-33
  execution latency, 1-15
  IALU registers, 2-22, 6-13
  immediate extensions, 1-16
IALU (integer ALU), (continued)
  registers, 1-14, 2-22
  RF groups, 2-22
  status, 1-33, 2-26
IALU registers, 2-22
  access restrictions, 1-19
IALU registers group, 2-12, 2-22
J-IALU registers, 2-23, 2-25
K-IALU registers, 2-24, 2-25
IBIS (I/O buffer information specification) models, 11-29
I-bus, 3-1
  arbitration priority, 8-11
  SOC interface, 3-4
ID2–0 (processor ID) pins, 1-6, 2-2, 2-6, 8-5, 8-6
  an memory accesses, 8-9
  description, 8-35
  usage, 11-15
IDCODE (silicon version and processor model) register, 2-83
IDLE (idle cycle insertion) bit, 8-14, 8-19
IDLE (state) bit, 2-33
  idle cycle insertion, 8-19, 8-20, 8-34
  idle cycles, external port, 8-2
  idle state, timers, 4-2
IDLE state/instruction, 1-20
IEEE 754/854 standard exceptions, 1-32
  See also data alignment and data types
IFIFO (input FIFO), 2-58, 8-7
IFIFO (input FIFO), external port, 7-14, 8-7, 8-9
data flow, 8-10
  threshold, 3-6, 8-12
IFIFO (input FIFO), SOC maximum delay, 3-6
  threshold, 3-6
ILATH/L (interrupt latch) registers, 2-46, 2-48, 6-5, 6-7, 6-8, 6-9, 6-10, 6-11, 6-19
address, 2-43
DMA interrupts, 7-50
timer, 4-3
IMASKH/L (interrupt mask) registers, 2-47, 2-48, 6-5, 6-8, 6-9, 6-11, 6-17
address, 2-43
DMA interrupts, 7-50
reset value, 2-45
timer, 4-4
immediate address, 1-16
immediate data, 1-16
immediate extension, 1-15, 1-16
indirect addressing, 1-14
indirect jump, 1-16
INIT (SDRAM initialization sequence) bit, 2-77
initialization, link port, 9-11
INRESET (emulator output enable) bit, 2-83
instruction
address trace buffer (TBUF), 10-8
combination constraints, 1-16
dispatch/decode, See sequencer line, 1-16
slot restrictions, 1-16
instruction parallelism, 1-16, 1-33, 2-10, 2-11
instruction pipeline
exceptions, 6-15
interrupt operation, 6-9
interrupts, 6-1, 6-2
RETI register, 6-6
SOC interface, 3-6
instructions
bit field manipulation, 1-13
bit manipulation, 1-13
ccontrol flow, 1-15
immediate extension, 1-15, 1-16
INT (DMA complete interrupt enable) bit, 2-64, 7-19, 7-21
chained DMA, 7-42
INT_BUSLOCK (bus lock interrupt) bit, 2-48
INT_DMA13–0 (DMA interrupt) bits, 2-49
INT_HWERR (hardware error interrupt) bit, 2-48
INT_IRQ3–0 (IRQ3–0 interrupt pin) bits, 2-49
INT_LINK3–0 (link interrupt) bits, 2-49
INT_TIMERxH (timer high priority interrupt) bits, 2-48
INT_TIMERxL (timer low priority interrupt) bits, 2-49
INT_VIRPT (vector interrupt) bit, 2-48
INTCTL (interrupt control) register, 2-45, 4-2, 6-2, 6-5, 6-5
address, 2-43
timer, 4-1
integer arithmetic logic unit, See IALU
internal address space, 2-7
internal memory, 1-30, 1-31, 2-2, 2-3, 2-6, 2-7, 2-60, 2-61
bus arbitration, 7-1, 7-36
EPROM boot, 8-32
internal memory DMA, 1-26, 1-27, 7-3, 7-33
addressing, 7-53
chained, 7-41
internal-to-internal transfers, 7-3
link port transfers, 7-10
internal memory DMA, \textit{(continued)}
memory to link TCB, 7-68, 7-69
setup, 7-36
setup restrictions, 7-28
TCB, 7-56
TCB setup, 7-57, 7-58
throughput, 7-71
to external memory, 7-57
transfer complete, 7-49
transfers, 7-7, 7-8
type select, 7-22
internal memory space, 2-2
internal transfer, 1-23
interrupt, 1-26, 6-1, 6-1 to 6-32
autoDMA error interrupt, 7-33
clear pending, 6-8, 6-9
DMA, 6-3
exception, 2-29, 2-32, 6-2, 6-10, 6-15 to 6-17, 6-23 to 6-24
exception (software interrupt), 2-27
global enable/disable, 6-6
handling, 6-13, 6-14
hardware interrupt, 1-20
hardware interrupt, clear pending, 6-8
instruction pipeline, 6-1, 6-2, 6-9
latching, 2-46, 6-7
latency, 1-26
level/edged sensitivity, 6-5
link, 2-41, 9-10, 9-25
link port service request, 6-8, 6-20
masking, 6-5, 6-8
nesting, 6-13
priority masking, 6-6
program sequencer, 6-15
recursion, 6-13
sensitivity, edge or level, 6-2
software interrupt, 2-29, 2-32
software interrupt (exception), 2-27, 6-2, 6-10, 6-15 to 6-17, 6-23 to 6-24
interrupt, \textit{(continued)}
timer, 6-20
timer 0 expired interrupt, 4-1
timer expired interrupt, 1-24
timer expired interrupts, 4-3
types, 6-1 to 6-3
unmasking, 6-5
vector, 2-41 to 2-44, 6-21
interrupt, hardware, 6-10
interrupt bit, 6-10
interrupt control (INTCTL) register, 6-5
timer, 4-1
interrupt enable strap (IRQEN) pin, 6-21
interrupt input pin (IRQ3–0) interrupts, 6-18
interrupt latch (ILATH/L) registers, 6-5
timer, 4-3
interrupt mask, 6-1
interrupt mask (IMASKH/L) registers, 2-47, 6-5
timer, 4-4
interrupt pins, See IRQ3–0 (interrupt request) pins
interrupt priority mask (PMASKH/L) registers, 2-47
interrupt register groups, 2-12, 2-41, 2-41 to 2-44
interrupt registers, 2-41, 6-1, 6-6, 6-10, 6-11, 6-13, 6-21
RETI register, 10-5
interrupt registers group, 6-1
interrupts, DMA, 6-20
channel hardware error, 7-23
complete interrupt, 7-15, 7-42, 7-49
derror interrupt, 7-40
examples, 7-73
interrupt enable, 7-21
link port service request, 7-30
TCB load, 7-17
interrupts, hardware, 6-2, 6-10, 6-17, 6-22
interrupt flow, 6-12
operation, 6-11
interrupt service routine (ISR), 6-8
  GIE bit, 6-9
  operation, 6-9
  operation flow, 6-11
  restrictions, 6-15
interrupt service routine, See ISR (interrupt service routine)
interrupt vector, 1-31, 6-1, 6-3, 6-21
  data flow, 6-4
  register, 6-1, 6-3
interrupt vector table (IVT), 2-41 to 2-44,
  6-1, 6-3, 6-9
  register groups, 2-41
I/O buffer information specification (IBIS)
  models, 11-29
I/O device
  SOC IFIFO transactions, 3-7
  TCB setup (flyby), 7-60, 7-62
I/O enable (IOEN) pin, 1-6, 7-13, 7-32, 8-6, 8-74
I/O enable (IOEN) pin, 7-32
I/O programming
  flags, 5-3
  SOC interface, 3-8
  timer, 4-4
IORD (I/O read) pin, 1-6, 7-13, 7-32, 8-6, 8-74
I/O read (IORD) pin, 7-32
IOWR (I/O write) pin, 1-6, 7-13, 7-32, 8-6, 8-74
I/O write (IOWR) pin, 7-32
IRFREE (emulator instruction register free) bit, 2-83
IRF_EDGE (interrupt edge/level sensitivity select) bits, 2-45
IRQ3–0 (external interrupt) bits, 2-49
IRQ3–0 (external interrupt) pins, 1-6,
  1-26, 2-42, 2-45, 2-47, 3-2, 6-1, 6-2,
  6-3, 6-20, 8-6, 11-7
  clear pending, 6-8
  interrupts, 6-18
  sensitivity, 6-5
  timing, 6-32
IRQEN (interrupt enable strap) pin, 2-45,
  2-47, 6-21, 11-7
ISL (instruction line status) bit, 2-90
ISR (interrupt service routine), 6-3, 6-6,
  6-8 to 6-15
  GIE bit, 6-9
  nested ISRs, 6-13 to 6-14
  operation, 6-9, 6-11
  operation mode, 1-20
  restrictions, 6-15
  RETI register update, 6-6
  RTI/RDS instructions, 6-9
IVBUSLK (bus lock interrupt vector)
  register, 2-43, 6-21
IVDMAx (DMA complete interrupt vector) registers
  address, 2-41
  description, 6-20
IVEN (exception enable on invalid floating-point) bit, 2-20, 6-23
IVHW (hardware error interrupt vector)
  register
  description, 6-22
IVIRQx (external input interrupt vector) registers
  address, 2-42
  description, 6-20
IVKERNEL (VDK kernel interrupt vector)
  register
  address, 2-41
IVLINKx (link port service request interrupt vector) registers, 2-41
  description, 6-20
INDEX

IVSW (software interrupt vector) register, 6-15
address, 2-27
description, 6-23
IVTIMERxH/LP (timer high/low priority interrupt vector) registers
description, 6-19
IVTIMERxH/LP (timer interrupt vector) registers, 2-41, 2-43

J
J (J-IALU) bus, 1-2, 1-23
J31/JSTAT register, 2-26
J-bus, 3-1
arbitration priority, 8-11
SOC interface, 3-4
JC (J-IALU carry) bit, 2-26
See also IALU status
jitter requirements, clock, 11-10
JK (watch point J- or K-bus access monitoring) bits, 2-87
JN (J-IALU negative) bit, 2-26
See also IALU status
JSTAT (J-IALU status) register, 2-26
bit descriptions, 2-26
JTAG (joint test action group) port, 1-3, 1-21, 1-32, 2-92, 6-24, 10-11
access instruction decode, 10-13
connections, 8-6
controller, 1-21
functionality, 10-11
instruction, 1-20
instruction register, 10-13
pins, 10-11
port reset, 10-11
JTAG scan path design, 11-15
JUMP instruction latency, 1-17

junction-to-ambient thermal resistance ($\theta_{JA}$), 11-21, 11-27
junction-to-base thermal resistance ($\theta_{JB}$), 11-27
junction-to-case thermal resistance ($\theta_{JC}$), 11-27
JV (J-IALU overflow) bit, 2-26
See also IALU status
JZ (J-IALU zero) bit, 2-26
See also IALU status

K
K (K-IALU) bus, 1-2, 1-23
K-bus, 3-1
arbitration priority, 8-11
SOC interface, 3-4
KC (K-IALU carry) bit, 2-26
See also IALU status
KERNEL (VisualDSP kernel software) interrupt, 6-19
kernels, boot loader, 11-3
kernel software, 6-19
KN (K-IALU negative) bit, 2-26
See also IALU status
KV (K-IALU overflow) bit, 2-26
See also IALU status
KZ (K-IALU zero) bit, 2-26
See also IALU status

L
latency
IALU (integer ALU), 1-15
interrupt, 1-26
jump, 1-17
LBuFRXx (link port receive buffer) registers, 2-65, 9-3, 9-7
LBuFTXx (link port transmit buffer) registers, 2-65, 9-7
LCx (loop counter) register, 2-27
INDEX

LDQM (low word SDRAM data mask) pin, 1-6, 8-6
description, 8-56
description, 8-11
operations, 8-79
LEN (DMA operand length) bits, 2-64, 7-19, 7-21
restrictions, 7-27
usage, 7-35
usage restrictions, 7-29
length of data blocks, link port, 9-15
level sensitive interrupts, 6-3, 6-5, 6-19, 6-20
clearing/deasserting, 6-9
clear pending, 6-8
LINK_DWIDTH (link port data width strap) pin, 2-68
LINK3–0 (link port service request) interrupts, 6-19
link booting mode, 11-6
link clock, data latching, 9-1
link port, 1-3, 1-4, 1-5, 1-9, 1-21, 1-22, 1-32, 7-66 to 7-69
4-bit mode configuration example, 9-5
architecture, 9-2 to 9-12
block completion, 9-9
communication protocol, 9-12
configuration, 9-13
connections, 8-6
data blocks length, 9-15
data latching/driving, 9-1
data transfer timing, 9-13
description, 9-1
DMA, 9-9
error detection, 9-19
full-duplex operation, 9-1
initialization, 9-10, 9-11
interrupt, 2-41, 9-10
I/O pins, 9-4
pass-through transfers, 9-9
link port, (continued)
protocol, 9-12
receive channel, 9-3
receive data, 9-7
receiver configuration, 9-6
service request, 9-9
service requests, 7-30
SOC interface connections, 9-2
termination resistors (RT), 9-6
transfer complete, 9-14
transmission delays, 9-17
transmission start, 9-14
transmit buffer, 7-49
transmit channel, 9-3
transmit data, 9-7
transmitter configuration, 9-6
link port boot, 1-22, 9-10, 9-11, 11-2
link port clock (LxCLKIN/OUT) pins, 1-18, 3-2
link port DMA, 1-27, 7-3, 7-31, 7-66
channels, 1-27, 7-1, 7-28
control, 7-31
examples, 7-73
receive priority, 7-38
TCB setup, 7-67, 7-69, 7-70
to internal/external memory, 7-66
to internal/external memory TCB, 7-67
to link TCB, 7-70
transfers, 7-10
transfer types, 7-66
transmit priority, 7-38
link port receive buffer (LBUFRXx) registers, 2-65
link port receive control (LRCTLx) registers, 2-66
INDEX

link port registers, 2-65
  LRSTATx registers, 9-26
  LTSTATx registers, 9-26
  receive DMA registers group, 2-58
  registers group, 2-12, 2-65
  transmit DMA registers group, 2-57
link port service request (LINK3–0) interrupts, 6-8, 6-19, 6-20
link port service request interrupt, 1-26, 6-20
clear pending, 6-8
link port transmit buffer (LBUFTXx) registers, 2-65, 9-3
link port transmit control (LTCTLx) registers, 2-66
link transmit status (LTSTATx) register, 2-70, 9-26
load, 1-14
  CACMDx registers, restrictions, 2-36
  CCAIRx registers, restrictions, 2-36
loader kernels, boot, 11-3
load register, 1-23, 2-9
long word, 1-11
long word transactions, 8-7, 8-17
long word transfers, 7-7
low voltage differential signalling (LVDS), 9-1
low word SDRAM data mask (LDQM) pin, 8-1, 8-57
LRCTLx (link port receive control) registers, 2-66, 2-68, 9-8, 9-21
LREN (link receive enable) bit usage, 9-21
LRSTATx (link port receive status) registers, 9-8, 9-26
LTCTLx (link port transmit control) registers, 2-66, 2-69, 9-8, 9-21
LTEN (link transmit enable) bit, 9-24
usage, 9-21
LTSTATx (link port transmit status) registers, 2-71, 9-8, 9-26
L unit, See ALU
LVDS (low voltage differential signalling), 9-1, 11-17
LxACKI (link port transmit acknowledge input) pins, 1-6, 8-6, 9-4, 9-14
LxACKO (link port receiver acknowledge output) pins, 1-6, 8-6, 9-4
LxBCMPI (link port block complete input) pins, 1-6, 8-6, 9-4, 9-10
LxBCMPO (link port block complete output) pins, 1-6, 8-6, 9-4
delay restriction, 9-17
usage, 9-9, 9-16
LxCLKIN/OUT (link port clock) pins, 1-6, 1-18, 3-2, 8-6, 9-4
delay restriction, 9-17
LxDATI3–0/P/N (link ports data in) pins, 1-6, 8-6, 9-4
LxDATO3–0/P/N (link port data out) pins, 1-6, 8-6, 9-4

M
manual
  audience, xxi
  contents, xxii
  conventions, xxxiii
  new in this edition, xxiv
  related documents, xxviii
  manuals, related, 11-1
  masking interrupts, 6-1, 6-8
  master, bus, 8-10
  master boot mode, 11-4, 11-6
  master mode, 11-2
  masters, SOC bus, 3-1
INDEX

memory
blocks and banks, 2-2
bus arbitration, 3-3, 8-11, 8-12
connections to global, 8-6
external, 1-29
flyby mode transfers, 7-13
global, 1-28, 1-30
internal, external, multiprocessor space, 2-2
internal/external DMA, 7-3
internal-to-internal transfers, 7-3
map, 2-1
organization, 1-5
shared, 1-28
throughput, 1-22
memory, external, 1-28
memory, internal, 1-8, 1-22
memory access wait states, 1-29
memory banks, 8-19
See also external memory, 8-19
slow device protocol, 8-26
wait cycles, 8-23
memory blocks, 1-3, 1-8, 1-22
See also internal memory, 1-8
broadcast, 2-36
memory-mapped I/O, 1-28
memory read (RD) pin, 8-16, 8-27
memory refresh rate
(CASTAT_REFCNTR) bits, 2-39
memory select, 2-5
memory select SDRAM (MSSD) pin, 8-50, 8-57, 8-85, 8-89
memory write high (WRH) pin, 8-16, 8-27
memory write low (WRL) pin, 8-16, 8-27
MEMWIDTH (memory bus width select) bit, 2-74
merged access, 2-11, 2-17, 2-18
merged distribution, 2-3
mesh configuration, 1-30
MI (multiplier invalid) bit, 2-21
See also shifter status
micro architecture, 8-5
MIS (multiplier invalid, sticky) bit, 2-20
See also multiplier status
MN (multiplier negative) bit, 2-21
See also multiplier status
mode, 1-17, 1-18 to 1-19, 2-33, 10-4 to 10-5
emulator mode, 1-18, 1-20 to 1-21, 2-33, 6-10, 6-24, 10-4
supervisor, 2-73
supervisor mode, 1-18, 1-20, 2-33, 6-10, 8-13, 10-4
user, 2-73
user mode, 1-18, 1-19, 2-33, 8-13, 10-4
MODE (operation mode) bit, 2-33
mode register set (MRS) command, 8-81
MODULE (unit executing instruction) bits, 2-91
modulo addressing, 1-14
MOD-X (DMA X modify value) bits, 2-63
MOD-Y (DMA Y modify value) bits, 2-63
MOS (multiplier overflow, sticky) bit, 2-20
See also multiplier status
move register, 1-23, 2-9
MPWIDTH (multiprocessor bus width select) bit, 2-74
MR (multiplier results) registers, 2-21
MRS (mode register set) command, 8-81
pin state, 8-83
MRSCOMP (MRS complete) bit, 2-79
MSI-0 (memory bank select) pins, 1-6, 2-2, 2-3, 2-4, 2-5, 8-5, 8-6
MSH (host select) pin, 2-2, 2-3, 2-4, 2-5, 8-5, 8-6
MSSD3-0 (memory select SDRAM) pins, 1-6, 2-2, 2-4, 2-5, 8-6, 8-50
description, 8-55
example, 8-71
INDEX

MU (multiplier underflow) bit, 2-21
See also multiplier status

multiplier, 1-2, 1-4, 1-33
status, 1-33
multiplier registers, 2-21
multiply-accumulator, See multiplier
multiprocessing, 1-30, 1-31, 1-32, 2-3, 2-6, 2-7, 2-60, 8-34 to 8-49, 8-93, 10-4
external port, 8-1
illegal access, 6-23
self read error, 6-22
system, 1-7
multiprocessing bus request, See BR pin, 8-44
multiprocessing interface, 8-34 to 8-49
multiprocessor ID, See ID pin, 2-6
multiprocessor interface
SDRAM operations, 8-78
multiprocessor space, 2-6
DMA usage, 7-3
idle insertion, 8-19
register accesses, 2-8
register access restrictions, 2-9
multiprocessor system, 8-5
bus width, 8-13, 8-15
connections, 8-6
topology design, 11-16
multiprocessor systems
burst pipelined transfers, 8-19
M unit, See multiplier
MUS (multiplier underflow, sticky) bit, 2-20
See also multiplier status
MV (multiplier overflow) bit, 2-21
See also multiplier status
MVS (multiplier overflow, sticky) bit, 2-20
See also multiplier status
MZ (multiplier zero) bit, 2-21
See also multiplier status

N
nesting interrupts, 6-13
context save/restore, 6-14
NGR (non granted bus request) bits, 2-91
NMOD (normal mode) bit, 1-20, 2-29
no boot (run) mode, 11-2, 11-6
normal read or write access, 2-11
normal word
data format, 1-11
transactions, 8-7, 8-17
transfers, 7-7

O
OEN (exception enable on overflow) bit, 2-20, 6-23
OFIFO (output FIFO), external port, 7-14, 7-72, 8-8, 8-9
operand length
DMA, 7-21
setup, 7-27
operand size
external bus, 8-7
operation mode, 1-17, 1-18 to 1-19, 2-33, 10-4 to 10-5
delay emulator mode, 1-18, 1-20 to 1-21, 2-33, 6-10, 6-24, 10-4
supervisor mode, 1-18, 1-20, 2-33, 6-10, 10-4
user mode, 1-18, 1-19, 2-33, 10-4
OPMODE (watch point operation mode) bits, 2-86, 2-87, 2-88
overflow, circular buffer, 1-14

P
packing data, 1-29
INDEX

PAGE (SDRAM page boundary select) bits, 2-77
page boundary selection, 8-70
page/word size, 8-61, 8-62, 8-63, 8-65, 8-66, 8-67
parallelism rules, instruction, 1-33, 2-10
    register groups, 2-11
parallel results (PRx) registers, 2-20
pass-through transfers, link port, 9-9
    path search algorithm, 1-12
PAx (DMA channel pause) bits, 7-26
PC (program counter), 1-2
PDD (power consumed from VDD supply), 11-26
PDD_DRAM (power consumed from VDD_DRAM supply), 11-26
PDD_IO (power consumed from VDD_IO supply), 11-26
pending interrupt, clear, 6-8, 6-9
performance, clock domains and, 1-18
performance monitor, 1-21, 2-89, 2-92, 10-4, 10-8 to 10-10
    performance monitor counter (PRFCNT) register, 2-92, 10-10
    performance monitor mask (PRFM) register, 2-89, 10-9
peripheral, I/O
    SOC IFIFO transactions, 3-7
pins, 10-11
    See also pins by name
    pin state
        ACT command, 8-85
        MRS command, 8-83
        PRE command, 8-84
        Read command, 8-86
        REF command, 8-93, 8-94
        Write command, 8-90
PIPE (SDRAM pipe depth) bits, 2-77
pipeline, 1-9, 1-16, 1-17
    pipeline depth
        data transfer timing, 8-21, 8-22
        host processor, 8-46
    pipelined protocol, 2-4, 8-1, 8-14, 8-15
        basic transaction, 8-16, 8-16 to 8-18
        burst transactions, 8-18
        control signals, 8-16
        data transfer timing, 8-20
        idle cycle insertion, 8-19
        latency, 8-15
        pipeline depth, 8-2, 8-16, 8-17, 8-19, 8-51
        pipelined transactions, 8-18 to 8-22
        wait cycles, 8-23
PMASKH/L (interrupt priority mask) registers, 2-47, 2-48, 6-5, 6-6, 6-9, 6-11
    address, 2-43
    operation example, 6-7
    usage, 6-11
pointer, DMA chain, 7-20
    pointer wraparound, 1-14
POR_IN (power on reset, DRAM) pin, 1-6, 8-6
porting code, 11-8
post-modify addressing, 1-14
    [ += ] post-modify with update operator, 1-14
power supply
    decoupling, 11-10
    domains, 11-9
    grid, 11-11
    ripple, 11-10
PR (DMA priority) bit, 7-19, 7-22
PR (DMA priority) bits, 2-64
PRC2RAS (precharge to RAS delay) bits, 2-77
PRE (precharge) command, 8-51, 8-83, 8-84, 8-86, 8-90
INDEX

precharge (PRE) command, See PRE (precharge) command
precharging SDRAM, 8-83
pre-modify addressing, 1-14
[+] pre-modify no update operator, 1-14
PRE-to-ACTIV (tPRE) delay, 8-51
PRE-to-RAS (tRP) delay, 8-72
PRFCNT (performance monitor counter) register, 2-85, 2-92, 10-10
PRFM (performance monitor mask) register, 2-84, 2-89, 10-9
priority, DMA, 7-22, 7-36
priority, system, 8-37
priority access, 3-3, 8-11, 8-12
priority mask (PMASKH/L) registers, 6-5, 6-6
priority scheme, SOC bus, 3-3
processor booting methods, 11-2
processor core, 3-2
buses, 3-1
SOC OFIFO transactions, 3-5
processor core clock (CCLK), 3-1
processor ID input, 8-6
PROCID (processor ID pins input) bits, 2-79
program fetch, See sequencer
programming control registers, 10-5
programming model, 1-33
program sequencer, 1-33, 10-8
interrupts, 6-15
registers group, 2-12
protected registers, 10-4
protocol, link port, 9-12
protocol, pipelined or asynchronous, 2-4
PRx (parallel results) registers, 2-20
quad word, 1-11, 1-15
transactions, 8-7, 8-17
transfers, 7-7

R
RAS (row address select) pin, 1-6, 8-6
description, 8-55
example, 8-71
RAS-to-PRE (tRAS) delay, 8-72, 8-73
RBCMPE (receive block completion detect) bit, 2-68, 9-22
RCSER (receive checksum error) bit, 9-19, 9-20
RD (memory read) pin, 1-6, 8-6, 8-16
description, 8-4
pipelined transactions, 8-16
slow protocol, 8-27
transaction type, 8-18
RDS (return from subroutine) instruction
usage restriction, 6-9
RDSIZE (receive data size) bit, 2-68, 9-22
READ (read) command, 8-85
pin state, 8-86
READ (watch point read transaction monitoring) bit, 2-87, 2-88
receive channel, link port, 9-3
receive data, link port, 9-7
receiver configuration, link port, 9-6
receiver error detection, link port, 9-20
receiver TCB registers, 7-4, 7-8
receiving link port to link port, 7-69
recursion of interrupts, 6-13
REF (refresh) command, 8-93, 8-94
pin state, 8-93, 8-94
REF (SDRAM refresh rate) bits, 2-77
refresh cache, 2-39
refresh counter (refresh rate) value, 8-70
refresh rate, 8-1, 8-51
refresh rate (CASTAT_REFCNTR) bits, 2-39

Q
quad registers
accesses, 2-10
aligned, 1-10
INDEX

register
See also registers by name
accesses, multiprocessor, 2-8
accesses, single, dual, and quad, 2-10
access restrictions, 2-9
addresses, 2-13
addresses and register groups, 2-13
dependency check, 1-16
for addresses, 1-16
IALU registers, 6-13
interrupt operation, 6-3
load access, 2-9
memory mapped, 2-1
move access, 2-9
register groups, 2-8 to 2-65
single, double, quad, 1-10, 1-11
store access, 2-9
timer registers, 4-1
transfer access, 2-9
Ureg (universal) registers, 1-29, 1-31
register file, 1-4
register groups, 2-13
related documents, xxviii
REN (link port receive enable) bit, 2-68, 9-11, 9-12, 9-22
reserved bits, 2-13
reset, 1-21
RESET (reset) pin, 1-21
reset, interrupt vector registers, 6-3
reset, JTAG, 10-11
reset and boot, 9-11
reset initialization, link port, 9-10
reset input, system, 8-6
restore context, 6-12
restrictions
sequencer instructions, 1-16
result
availability, 1-16
RETI (return from interrupt) instruction, 1-17, 6-6
RET (return from interrupt) register, 6-6, 6-10, 6-11, 6-13, 10-5
address, 2-27
instruction pipeline, 6-6
usage, 6-10, 6-11
RETS (return from subroutine) register, 6-14
address, 2-27
exceptions, 6-15
usage, 6-10
return address, 6-10, 6-14
exceptions, 6-17
return from interrupt (RETI) registers, 6-6, 6-10, 6-11, 6-13
return from interrupt, See RTI instruction
returning from interrupt, 6-14
reverse carry addition, 1-14
RINIF (link port RX initialization forced) bit, 2-68, 9-12, 9-22
RINIV (link port RX initialization value) bit, 2-68, 9-12, 9-22
rotate, 1-13
rotating priority, DMA, 7-38
ROVER (receive overrun error) bit, 9-19
ROVRE (receive overrun error IRQ enable) bit, 2-68, 9-22
row address strobe (RAS) pin, 8-1, 8-57
rows (ROW), 7-46
rows, DMA (2D), 7-45
RST_IN (reset input) pin, 1-6, 8-6
RST_OUT (reset output) pin, 1-6, 8-6
RTER (receive timeout error) bit, 9-19
RTI (return from interrupt) instruction, 1-17, 6-6, 6-13, 6-14 to 6-15
operation, 6-15
usage restriction, 6-9
RTOE (receive time out IRQ enable) bit, 2-68, 9-22
run (no boot), 1-22
runtime initialization, link port, 9-11
INDEX

RVERE (receive enable verification) bit, 2-68, 9-14, 9-22
RWER (receive write error) bit, 9-19

S
S (SOC interface) bus, 1-3, 1-23
save context, 6-12
SBSRAM systems, 8-15
S-bus, 3-1, 8-9
  arbitration priority, 8-11
  SOC interface, 3-4
scalability and multiprocessing, 1-30
SCLK (system clock input) pin, 1-6, 1-17, 5-11, 8-6, 11-10
  usage, 11-11
SCLK_VREF (clock voltage reference) pin, 1-6, 8-6
SCLKRAT2–0 (clock ratio select) pins, 1-6, 1-18, 8-6
SCYCLE (stall cycle status) bit, 2-90
SDA10 (SDRAM address 10) pin, 1-6, 8-6, 8-57, 8-58
  description, 8-56
  example, 8-71
SDCKE (SDRAM clock enable) pin, 1-6, 8-6, 8-58, 8-89
  description, 8-56
  example, 8-71
SDRAM, (continued)
  refresh, 8-1, 8-51, 8-94
  refresh rate, 8-1, 8-51
  system example, 8-71
SDRAM clock enable (SDCKE) pin, 8-1
SDRAM configuration (SDRCON) register, 2-76, 8-2
SDRAM controller, 8-1
  commands, 8-80
  setup (EZ-Kit Lite), 8-95
SDRAM interface, 8-3, 8-50 to 8-95
  DQM operations, 8-79
  multiprocessor operations, 8-78
  powering up after reset, 8-79
  programming example, 8-94
  throughput, 8-77
SDRAM pins
  connections and data alignment, 8-60
  I/O pins, 8-55
  physical connection, 8-56
  SDCKE pin, 8-94
  SDRAM I/O pins, 8-55 to 8-56
  SDRAM physical connection, 8-60 to 8-67
  SDWE pin, 8-85
SDRAM write enable (SDWE) pin, 8-1
SDRCON (SDRAM configuration) register, 2-76, 8-50, 8-58, 8-93
  address, 2-72
  reset operations, 8-79
  usage restrictions, 8-2
SDREN (SDRAM enable) bit, 2-77
SDWE (SDRAM write enable) pin, 1-6, 8-6, 8-57
  description, 8-56
  example, 8-71
  self-refresh (SREF) command, 8-93
  semaphores, 1-30, 7-63, 8-44, 8-49
sequencer, 1-2, 1-4, 1-15, 1-22, 1-33
  immediate extensions, 1-16
  instruction restrictions, 1-16
  pipeline, 1-16
  registers, 2-27 to 2-33
  sequencer control (SQCTL) register, 2-27,
  2-29, 2-31, 6-21
  sequencer registers, 2-27
  access restrictions, 1-19
  registers group, 2-12, 2-27
  sequencer status (SQSTAT) register, 2-28,
  2-31
  sequential accesses, 8-18
  service request, link port, 9-9
  set bits, 1-13
  SFREG (static condition flags) register,
  2-28, 2-30
  shared memory, 1-28, 2-2
  shifter, 1-2, 1-4, 1-13, 1-33
    status, 1-33
  shifter registers, 2-21
  short word, 1-11
  signal integrity, 11-29
  SIMD (single-instruction, multiple-data),
  1-10
  single (Rm) registers, 1-10, 1-11
  single processor system, 1-5
  single register accesses, 2-10
  single words, 1-15
  sink-to-ambient thermal resistance ($\theta_{SA}$),
  11-27
  slave, bus, 8-10
  slave boot mode, 11-5, 11-6
  slave mode, 11-2
  slaves, SOC bus, 3-1
  slow protocol, 8-2, 8-3, 8-14, 8-26, 8-26 to
  8-31
    and back off, 8-49
    data transfer timing, 8-28, 8-29, 8-31
    EPROM access timing

  EPROM access timing, 8-33
  wait cycle insertion, 8-31
  SN (shifter negative) bit, 2-21
    See also shifter status
  SOC bus
    arbitration, 3-3
    clock (SOCCLK), 1-18, 3-1
    DMA unit, 7-1
    external port, 8-7
    flags, 5-1
    interrupts, 6-3, 6-4
    link port, 9-2
    masters/slaves, 3-1, 3-2
    timer, 4-1, 4-2
  SOCCLK (SOC bus clock), 1-18, 3-1
  SOC IFIFO
    maximum delay, 3-6
    threshold, 3-6
    transactions, 3-5, 3-7
  SOC IFIFO (input FIFO), 3-4
  SOC interface, 1-3, 2-13, 3-1
    clock (SOCCLK), 4-3
    clock rate, 1-24
    data flow, 3-2
    do’s and don’ts, 3-9
    external port, 8-10
    FIFOs and buffers, 3-4
    high priority, 3-6, 8-12
    instruction pipeline, 3-7
    operations, 3-5
    programming, 3-8
    SOC bus priority, 3-3
    SOC OBUF (output buffer), 3-4
    transactions, 3-5
  SOC OFIFO (output FIFO), 3-4
    transactions, 3-4, 3-5
  SOC Ureg (universal) registers, 2-2, 2-13,
  3-6
  software exception (SW) interrupt, 6-17
INDEX

software interrupt (exception), 1-26, 2-20, 6-2, 6-10, 6-15 to 6-17, 6-23, 6-24
operation flow, 6-16
SPD (transmit transfer speed) bits, 2-69, 9-24
special instructions, 10-5
split transactions, 3-6
SPVCMD (supervisor command) bits, 2-33
SQCTL (sequencer control) register, 1-20, 2-27, 2-29, 6-15, 6-21
address, 2-27
timer, 4-4
write stall, 2-29
SQSTAT (sequencer status) register, 1-25, 2-28, 2-31, 2-32
address, 2-28
SREF (self-refresh) command, 8-93
SSTP (watch point single step) bit, 2-86
stack, 6-13
See also nesting interrupts
stack pointer, 6-13
stalls, 1-16
static condition flag register (SFREG), 2-28, 2-30
Static Superscalar, 1-8
store/load instructions, 1-23
store register, 1-14, 1-23, 2-9
strap mode pins, boot select, 11-2
SUMEN (sum events count enable) bit, 2-90
S unit, See shifter
Supervisor, 8-13
supervisor mode, 1-20, 2-73, 8-13
supply decoupling, 11-11
support, technical or customer, xxv
SW (exception status) bit, 2-32
SW (software exception) interrupt, 6-17
SW (software interrupt enable) bit, 2-29
synchronous devices, slow protocol, 8-27
synchronous DRAM, See SDRAM
synchronous write transfer timing, 8-75
SYS_REG_WE (system register write enable strap) pin, 2-73, 2-76, 8-2, 8-13
SYSCON (system configuration) register, 2-73, 2-76, 8-19, 8-26
address, 2-72
and SDRAM, 8-56
bus pipeline depth, 8-17
host access, 2-4
host processor, 8-46
memory access protocol, 2-4
programming, 8-13 to 8-15
slow protocol, 8-30
usage restrictions, 8-2
SYSTAT (system status) register, 2-77, 6-22
address, 2-72
bus master ID, 8-45
system
design, 11-1
development enhancements, 1-5
multiprocessor, 1-7
single processor, 1-5
system, SDRAM, 8-71
system architecture, 8-5 to 8-12
system bus, 1-29
system clock (SCLK), 1-17
system clock ratio (SCLKRATx) pins, 1-18
system configuration (SYSCON) register
slow protocol, 8-30
usage restrictions, 8-2
system design guidelines, 11-17
system-on-chip (SOC), See SOC interface or SOC bus
system-on-chip (SOC) interface, 1-4, 3-1
clock rate, 1-24
system priority, 8-37
system register write enable strap  
(SYS_REG_WE) pin, 8-2
SZ (shifter zero) bit, 2-21
See also shifter status

T

target address, branch, 1-17
TBCMPE (transmit block complete enable) bit, 2-69, 9-9, 9-24
TCB, 2-56, 2-58, 6-20, 6-22, 7-4, 7-9, 
7-11, 7-15 to 7-23, 7-32, 7-34, 7-35, 7-55 to 7-63
arbitration, 8-12
description, 7-14
DMA channels, 7-34
DMA type restrictions, 7-30
external to internal transfers, 7-54 to 7-57
flyby mode, 7-65
link port DMA, 7-31, 7-66 to 7-69, 7-70
loading restrictions, 7-17
operand length restrictions, 7-27
operation, 7-12
restrictions, 7-5, 7-29
updating an active, 7-42
TCB, chained DMA, 7-12, 7-41, 7-42
chaining example, 7-44
chain loading, 7-41, 7-42, 7-43, 7-51
TCB, external I/O device to memory (flyby), 7-60, 7-61, 7-62
TCB, external memory
to internal memory, 7-54, 7-55, 7-56, 7-57
to I/O (flyby), 7-63
TCB, high priority DMA, 3-6, 7-37, 7-39, 8-12
TCB, internal memory to external memory, 7-58, 7-59
TCB, receiver, 7-4, 7-8
TCB, transmitter, 7-4, 7-8
TCB registers, 7-4, 7-5, 7-15, 7-16
DMA channel error interrupt, 7-23
DMA channels, 7-17
DMA channel status, 7-23
next TCB, 7-20
quad word relationship, 7-16
receiver, 7-4
transmitter, 7-4, 7-8
two dimension DMA operation, 7-45
usage, 7-15
TCK (test clock) pin, 8-6, 10-11
TDI (test data input) pin, 8-6, 10-11
TDO (test data output) pin, 8-6, 10-11
TDSIZE (transmit data size) bit, 2-69, 9-24
technical support, xxv
TEME bit, 1-20
TEMPTY (transmit buffer empty) bit, 2-71, 9-28
TEN (transmit enable) bit, 2-69
terminating read/write cycles, 8-83
termination resistors (RT), link port, 9-6
test, bit, 1-13
test access port (TAP) controller, 10-13
test clock (TCK) pin, 10-11
test data input (TDI) pin, 10-11
test data output (TDO) pin, 10-11
test mode select (TMS) pin, 10-11
test reset (TRST) pin, 10-11
thermal equilibrium, 11-26
thermal model, 11-24
thermal performance, 11-21
thermal power consumption, 11-21
thermal relief design, 11-17, 11-18, 11-27
thermal resistance, 11-20, 11-23, 11-26
thermal simulations, 11-25
thermal system model, 11-20
θ<sub>CA</sub> (case-to-ambient thermal resistance), 11-27

θ<sub>CA</sub> (case-to-ambient...
INDEX

θJA (junction-to-ambient thermal resistance), 11-27
θJB (junction-to-base thermal resistance), 11-27
θJC (junction-to-case thermal resistance), 11-27
θSA (sink-to-ambient thermal resistance), 11-27
THR3–0 (trellis history) registers, 2-22
time out error, link port, 9-19
timer 0 expired (TMR0E) pin, 1-24, 4-8
timer expired interrupts, 1-24, 4-3
timer high and low priority (TIMERxH/L) interrupts, 6-18, 6-19, 6-20
timer initial value (TMRINxH/L) registers, 4-1
timer run (TMRxRN) bits, 2-45, 4-1
timer running value (TIMERxH/L) registers, 4-1
timers, 1-24, 4-1 to 4-8
clock domains, 4-8
clock domains, 4-8
clock domains, 4-8
clock domains, 4-8
clock domains, 4-8
clock domains, 4-8
clock domains, 4-8
control procedure, 4-4
enable/disable, 4-1
idle state, 4-2
interrupts, 6-20
programming, 4-4
SOC bus, 4-1, 4-2
SOC interface clock (SOCCLK), 4-3
timer registers, 2-50, 4-1
TIMERxH/L (timer high and low priority) interrupts, 6-18
TIMERxH/L (timer running value) registers, 2-50, 4-1
address, 2-44
timing analysis, 11-29
TMR0E (timer 0 expired) pin, 1-6, 1-24, 3-2, 4-4, 4-8, 8-6, 11-6
TMRINxH/L (timer initial value) registers, 2-50, 4-1
TMRxRN (timer run) bits, 2-45, 4-1
TMS (test mode select) pin, 10-11
toggle, bit, 1-13
toroid configuration, 1-30
TR31–0 (trellis) data registers, 2-22
trace buffer (TRCBx) registers, 2-93
trace buffer pointer (TRCBPTR) register, 2-93
trace history, 10-4
transfer, 1-14
CACMDx register restrictions, 2-36
CCAIRx register restrictions, 2-36
transfer complete, DMA, 7-49
transfer complete, link port, 9-14
transfer control block (TCB), See TCB
transfer/move register instructions, 1-23
transfer register, 1-23, 2-9
transfers, normal/long/quad word, 7-7
transfer size (TXS), 7-46
transfer types, DMA, 7-3
transfer word count, DMA, 7-18
transmission delays, link port, 9-17
transmission start, link port, 9-14
transmit channel, link port, 9-3
transmit data, link port, 9-7
transmitter configuration, link port, 9-6
transmitter error detection, link port, 9-20
transmitter TCB registers, 7-4, 7-8
transmitter verification error, link port, 9-20
TRAP (supervisor trap) instruction, 6-23
tRAS, See ACT-to-PRE (tRAS) delay
TRCBEN (trace buffer enable) bit, 2-29
TRCBEXEN (trace buffer exception enable) bit, 2-29
TRCBMASK (trace buffer mask) register, 2-85
TRCBPTR (trace buffer pointer) register, 2-85
tree configuration, 1-30
INDEX

TROV (trellis overflow) bit, 2-21
See also CLU status

tRP, See PRE-to-ACT (tRP) delay

TRSOV (trellis overflow, sticky) bit, 2-21
See also CLU status

TRST (test reset) pin, 8-6, 10-11

TTER (transmit timeout error) bit, 2-71, 9-28
usage, 9-19

TTOE (transmit timeout IRQ enable) bit, 2-69, 9-24

Turbo code algorithm, 1-12

TVACANT (transmit buffer vacant) bit, 2-71, 9-28

TVERE (transmit verification enable) bit, 2-69, 9-24
usage, 9-14

TWER (transmit write error) bit, 2-71, 9-19, 9-28

two dimension DMA, 7-5, 7-6, 7-11, 7-19, 7-21, 7-36, 7-45
examples, 7-48
link port DMA, 7-31
register operations, 7-45
registers, 7-18
restrictions, 7-27
transfer steps, 7-46

TY (DMA type) bits, 2-64, 7-17, 7-19, 7-22
disabling DMA, 7-47
restrictions, 7-27
usage, 7-34
usage restrictions, 7-30

unmapped compute block registers, 2-19
unused bits, 2-13

Ureg (universal) registers, 1-23, 1-29, 2-2, 2-7, 2-13, 2-14, 2-19, 2-20, 2-22, 2-92
definition, 2-8
external access, 8-9
link port, 9-3
register space, 2-8
user mode, 1-19, 2-73, 8-13

V

VDD (internal supply voltage) pins, 11-9

VDD_A (analog PLL supply voltage) pins, 11-9

VDD_DRAM (DRAM supply voltage) pins, 11-9

VDD_IO (external I/O supply voltage) pins, 11-9

vector interrupt (VIRPT) interrupt, 1-26, 6-18

vector interrupt (VIRPT) register, 6-21

VIRPT (vector interrupt) register, 6-18, 6-21
address, 2-43

VisualDSP kernel software (KERNEL) interrupt, 6-19

Viterbi algorithm, 1-12

VREF (voltage reference) pin, 1-6

W

wait cycle insertion, 8-3, 8-23
slow protocol, 8-31
timing, 8-25, 8-26
wait states, memory access, 1-29
watchpoint, 1-20, 2-84, 2-86, 2-89, 10-2, 10-4, 10-5 to 10-8
match exception, 6-23
operation, 10-6

ADSP-TS201 TigerSHARC Processor Hardware Reference I-33
**INDEX**

<table>
<thead>
<tr>
<th>Watchpoint Address (WPxH/L) Registers</th>
<th>2-89</th>
</tr>
</thead>
<tbody>
<tr>
<td>Watchpoint Control (WPxCCTL) Registers</td>
<td>2-86, 10-5 to ??</td>
</tr>
<tr>
<td>Watchpoint Status (WPxSTAT) Registers</td>
<td>2-89, 10-7</td>
</tr>
<tr>
<td>WE (Write Enable) Pin, SDRAM</td>
<td>8-6</td>
</tr>
<tr>
<td>Word (Operand) Length, DMA</td>
<td>7-21</td>
</tr>
<tr>
<td>Word Alignment, External Bus</td>
<td>8-7</td>
</tr>
<tr>
<td>Word/Page Size</td>
<td>8-61, 8-62, 8-63, 8-65, 8-66, 8-67</td>
</tr>
<tr>
<td>Words, Single-, Dual-, or Quad-, 1-15</td>
<td></td>
</tr>
<tr>
<td>Word Sizes and Alignment, 1-11</td>
<td></td>
</tr>
<tr>
<td>Word Widths, 1-29</td>
<td></td>
</tr>
<tr>
<td>WPxCCTL (Watchpoint Control) Registers</td>
<td>2-86</td>
</tr>
<tr>
<td>WPxH/L (Watchpoint Address) Registers</td>
<td>2-89</td>
</tr>
<tr>
<td>Address, 2-84</td>
<td></td>
</tr>
<tr>
<td>WPxSTAT (Watchpoint Status) Registers</td>
<td>2-89</td>
</tr>
<tr>
<td>Address, 2-84</td>
<td></td>
</tr>
<tr>
<td>Bit Descriptions, 10-7</td>
<td></td>
</tr>
<tr>
<td>WRH (Memory Write High) Pin</td>
<td>1-6, 8-6, 8-16</td>
</tr>
<tr>
<td>Description, 8-4</td>
<td></td>
</tr>
<tr>
<td>Pipelined Transactions, 8-16</td>
<td></td>
</tr>
<tr>
<td>Slow Protocol, 8-27</td>
<td></td>
</tr>
<tr>
<td>Transaction Type, 8-18</td>
<td></td>
</tr>
<tr>
<td>WRITE (Watch Point Write Transaction Monitoring) Bit</td>
<td>2-87, 2-88</td>
</tr>
<tr>
<td>Write (Write) Command</td>
<td>8-89, 8-90</td>
</tr>
<tr>
<td>Write Errors, Link Port</td>
<td>9-21</td>
</tr>
<tr>
<td>Write Transaction Wait Cycle Insertion</td>
<td>8-24</td>
</tr>
<tr>
<td>WRL (Memory Write Low) Pin</td>
<td>1-6, 8-6, 8-16</td>
</tr>
<tr>
<td>Description, 8-4</td>
<td></td>
</tr>
<tr>
<td>Pipelined Transactions, 8-16</td>
<td></td>
</tr>
<tr>
<td>Slow Protocol, 8-27</td>
<td></td>
</tr>
<tr>
<td>Transaction Type, 8-18</td>
<td></td>
</tr>
</tbody>
</table>

**X**

| XSCFx (X Compute Block Static Flag) Bits | 2-30 |
| X/YSTAT Compute Block Status Registers | 2-20 |

**Y**

| YSCFx (Y Compute Block Static Flag) Bits | 2-30 |

**Z**

| ZBT SRAM | 8-15 |