ULL PCIe DMA Controller

Unlock High-Speed Data Transfer Between SmartNICs and Host CPU

ULL PCIe DMA Controller

Unlock High-Speed Data Transfer Between SmartNICs and Host CPU​

Reduce Latency and Maximize Throughput with our ULL PCIe DMA Controller

The ULL PCIe DMA Controller is a high-performance, bidirectional data transfer solution. It is designed for seamless communication between FPGAs and host CPUs over PCIe. With a round-trip time as low as 585ns*, this IP core empowers developers to maximize resource utilization and achieve ultra-low latency without compromising performance.

* RTT min latency, PCIe Gen4 x 8 lanes, Xilinx/AMD x3522pv, Intel® Core™ i9-14900KS CPU @5.90GHz, Synchronous AXI-4 Interfaces (512b, 250MHz), Payload counts: 128k

An ultra-low latency data mover engineered for real-time data performance

High speed & flexibility

Obtain exceptional performance for latency-critical core applications. The ULL PCIe DMA Controller is designed using cut-through techniques to optimize latency to move data and control/status messages across the PCIe link.

Seamless FPGA Integration

In the card-to-host direction, packets are transferred through PCIe from FPGA to host buffer memory in separate memory blocks, one or many blocks depending on the packet size. In the host-to-card path, packets are transferred directly to the FPGA memory space through optimized PCIe transactions. 

Kernel-Bypass Optimized for Linux Applications

Designed explicitly for Kernel-bypass Linux environments, our controller leverages a polling and ring buffer DMA architecture to support ultra-low latency needs in high-throughput systems.

High-Level Block Diagram

Experience lightning-fast, bidirectional data transfer between FPGA-based SmartNICs and Host CPU

ULL PCIe DMA Controller

635ns*

Dell/EMC PowerEdge R750
(Intel® Xeon® Gold 5315Y)

* RTT min latency, PCIe Gen4 x 8 lanes, Xilinx/AMD x3522pv, Intel(R) Xeon(R) Gold 5315Y CPU @3.20GHz, Synchronous AXI-4 Interfaces (512b, 250MHz), Payload counts: 128k

Our scalable frameworks evolve with your trading strategies, ensuring long-term performance and viability.

Learn less

learn-less

ULL PCIe DMA Controller

608ns*

Blackcore G3 SPR-M
(Intel® Xeon® w7-2495X)

* RTT min latency, PCIe Gen4 x 8 lanes, Xilinx/AMD x3522pv, Intel® Xeon® w7-2495X CPU @4.80GHz, Synchronous AXI-4 Interfaces (512b, 250MHz), Payload counts: 128k

Profile code to identify bottlenecks, choose efficient data structures, and implement compiler-level optimizations for high-speed performance.
Learn less
learn-less

ULL PCIe DMA Controller

585ns*

Blackcore ICON 3100-RL+
(Intel® Core™ i9-14900KS)

* RTT min latency, PCIe Gen4 x 8 lanes, Xilinx/AMD x3522pv, Intel® Core™ i9-14900KS CPU @5.90GHz, Synchronous AXI-4 Interfaces (512b, 250MHz), Payload counts: 128k

Profile code to identify bottlenecks, choose efficient data structures, and implement compiler-level optimizations for high-speed performance.
Learn less
learn-less
Ultra-Low Latency Trading Solutions

Move Data Faster with our Orthogone's ULL PCIe DMA Controller

Learn more about the specifications and advantages of our PCIe DMA Controller, engineered for latency-sensitive applications like high-frequency trading, networking, and HPC.

Reduce latency in host-FPGA communication with our high-performance PCIe DMA Controller

Delivering ultra-low latency, bidirectional data transfers between the host CPU and FPGA over a high-speed PCIe interface. User-friendly for software developers building low-latency network streaming applications. Easy to integrate with standard PCIe endpoint and OrthogoneULL TCP/IP, UDP/IP Offload Engine. Our PCIe DMA Controller supports both Intel and AMD processors.

General Features

Scalability

Highly parametrizable IP Core (Number of channels, Number of queues per channel, Memory size of each queue, Interfaces AXI-4 Stream bus width and synchronous or asynchronous to PCIe clock)​

Easy to use

Linux kernel module provided with standard delivery packages (rpm, deb), driver created to automatically detect PCIe card for early evaluation solution, library API easily handles buffer creation, transmission and reception

Implementation

Highly efficient FPGA implementation with low resources utilization, and high timing margin on -2 and -3 Xilinx/AMD UltraScale+ FPGA

Integration

Fully integrated with AMD/Xilinx PCIe Endpoint (up to Gen 4 x8), and easy integration with Orthogone TCP/IP and UDP/IP network stacks

Data Mover Tailored for High-Performance Core Applications

Designed primarily for high-frequency trading (HFT), our PCIe Direct Memeory Access (DMA) Controller also delivers ultra-low latency bidirectional data transfer for cutting-edge applications demanding the highest level of performance.

Electronic
Trading Systems

Applications
Security

Storage
Acceleration

Networking Infrastructure

High-Performance Computing (HPC)

Our ecosystem of technological partners

Here are the answers to some frequently asked questions:

We would be happy to develop FPGA solutions that combine our IP cores with modules specifically designed for your applications. This allows us to be very competitive since we have a detailed knowledge of all our IP cores as well as many blocks readily available to develop your solutions. Depending on your requirements, it is also possible for us to provide you with a complete design package allowing you to take ownership of the developed solution. We can also provide technical support and maintenance to give you peace of mind if you want us to make changes or add new features as your product evolves.

Ultra-low gate count & resources and ultra-low latency performances are the key differentiators. Our products also have excellent timing margin and a rich set of standard and advanced features. They are based on a unified architecture where all rates and options are supported using a unique Verilog source code and UVM environment.

All our IP cores are extensively verified in simulations using a Universal Verification Environment (UVM) methodology with a full suite of tests that ensures interoperability with the IEEE 802.3 standard. The products are also ported on FPGA platforms to test performances and verify interoperability with standard network testing appliance equipment.

Interested in Orthogone's products?

Fields marked with a * are mandatory.