async_tx: add the async_tx api · tjh.dev/kernel@9bc89cd

Linux kernel mirror (for testing) git.kernel.org/pub/scm/linux/kernel/git/torvalds/linux.git

kernel os linux

async_tx: add the async_tx api

The async_tx api provides methods for describing a chain of asynchronous
bulk memory transfers/transforms with support for inter-transactional
dependencies. It is implemented as a dmaengine client that smooths over
the details of different hardware offload engine implementations. Code
that is written to the api can optimize for asynchronous operation and the
api will fit the chain of operations to the available offload resources.

I imagine that any piece of ADMA hardware would register with the
'async_*' subsystem, and a call to async_X would be routed as
appropriate, or be run in-line. - Neil Brown

async_tx exploits the capabilities of struct dma_async_tx_descriptor to
provide an api of the following general format:

struct dma_async_tx_descriptor *
async_<operation>(..., struct dma_async_tx_descriptor *depend_tx,
dma_async_tx_callback cb_fn, void *cb_param)
{
struct dma_chan *chan = async_tx_find_channel(depend_tx, <operation>);
struct dma_device *device = chan ? chan->device : NULL;
int int_en = cb_fn ? 1 : 0;
struct dma_async_tx_descriptor *tx = device ?
device->device_prep_dma_<operation>(chan, len, int_en) : NULL;

if (tx) { /* run <operation> asynchronously */
...
tx->tx_set_dest(addr, tx, index);
...
tx->tx_set_src(addr, tx, index);
...
async_tx_submit(chan, tx, flags, depend_tx, cb_fn, cb_param);
} else { /* run <operation> synchronously */
...
<operation>
...
async_tx_sync_epilog(flags, depend_tx, cb_fn, cb_param);
}

return tx;
}

async_tx_find_channel() returns a capable channel from its pool. The
channel pool is organized as a per-cpu array of channel pointers. The
async_tx_rebalance() routine is tasked with managing these arrays. In the
uniprocessor case async_tx_rebalance() tries to spread responsibility
evenly over channels of similar capabilities. For example if there are two
copy+xor channels, one will handle copy operations and the other will
handle xor. In the SMP case async_tx_rebalance() attempts to spread the
operations evenly over the cpus, e.g. cpu0 gets copy channel0 and xor
channel0 while cpu1 gets copy channel 1 and xor channel 1. When a
dependency is specified async_tx_find_channel defaults to keeping the
operation on the same channel. A xor->copy->xor chain will stay on one
channel if it supports both operation types, otherwise the transaction will
transition between a copy and a xor resource.

Currently the raid5 implementation in the MD raid456 driver has been
converted to the async_tx api. A driver for the offload engines on the
Intel Xscale series of I/O processors, iop-adma, is provided in a later
commit. With the iop-adma driver and async_tx, raid456 is able to offload
copy, xor, and xor-zero-sum operations to hardware engines.

On iop342 tiobench showed higher throughput for sequential writes (20 - 30%
improvement) and sequential reads to a degraded array (40 - 55%
improvement). For the other cases performance was roughly equal, +/- a few
percentage points. On a x86-smp platform the performance of the async_tx
implementation (in synchronous mode) was also +/- a few percentage points
of the original implementation. According to 'top' on iop342 CPU
utilization drops from ~50% to ~15% during a 'resync' while the speed
according to /proc/mdstat doubles from ~25 MB/s to ~50 MB/s.

The tiobench command line used for testing was: tiobench --size 2048
--block 4096 --block 131072 --dir /mnt/raid --numruns 5
* iop342 had 1GB of memory available

Details:
* if CONFIG_DMA_ENGINE=n the asynchronous path is compiled away by making
async_tx_find_channel a static inline routine that always returns NULL
* when a callback is specified for a given transaction an interrupt will
fire at operation completion time and the callback will occur in a
tasklet. if the the channel does not support interrupts then a live
polling wait will be performed
* the api is written as a dmaengine client that requests all available
channels
* In support of dependencies the api implicitly schedules channel-switch
interrupts. The interrupt triggers the cleanup tasklet which causes
pending operations to be scheduled on the next channel
* Xor engines treat an xor destination address differently than a software
xor routine. To the software routine the destination address is an implied
source, whereas engines treat it as a write-only destination. This patch
modifies the xor_blocks routine to take a an explicit destination address
to mirror the hardware.

Changelog:
* fixed a leftover debug print
* don't allow callbacks in async_interrupt_cond
* fixed xor_block changes
* fixed usage of ASYNC_TX_XOR_DROP_DEST
* drop dma mapping methods, suggested by Chris Leech
* printk warning fixups from Andrew Morton
* don't use inline in C files, Adrian Bunk
* select the API when MD is enabled
* BUG_ON xor source counts <= 1
* implicitly handle hardware concerns like channel switching and
interrupts, Neil Brown
* remove the per operation type list, and distribute operation capabilities
evenly amongst the available channels
* simplify async_tx_find_channel to optimize the fast path
* introduce the channel_table_initialized flag to prevent early calls to
the api
* reorganize the code to mimic crypto
* include mm.h as not all archs include it in dma-mapping.h
* make the Kconfig options non-user visible, Adrian Bunk
* move async_tx under crypto since it is meant as 'core' functionality, and
the two may share algorithms in the future
* move large inline functions into c files
* checkpatch.pl fixes
* gpl v2 only correction

Cc: Herbert Xu <herbert@gondor.apana.org.au>
Signed-off-by: Dan Williams <dan.j.williams@intel.com>
Acked-By: NeilBrown <neilb@suse.de>

Dan Williams 19 years ago 9bc89cd8 685784aa

+1294 -50

14 changed files

expand all

crypto

Kconfig

Makefile

async_tx

Kconfig

Makefile

async_memcpy.c

async_memset.c

async_tx.c

async_xor.c

xor.c

drivers

dma

Kconfig

raid5.c

include

linux

async_tx.h

raid

xor.h

+5 -1

crypto/Kconfig

··· 5 5 tristate 6 6 7 7 # 8 + # async_tx api: hardware offloaded memory transfer/transform support 9 + # 10 + source "crypto/async_tx/Kconfig" 11 + 12 + # 8 13 # Cryptographic API Configuration 9 14 # 10 - 11 15 menu "Cryptographic options" 12 16 13 17 config CRYPTO

+1 -1

crypto/Makefile

··· 55 55 # generic algorithms and the async_tx api 56 56 # 57 57 obj-$(CONFIG_XOR_BLOCKS) += xor.o 58 - 58 + obj-$(CONFIG_ASYNC_CORE) += async_tx/

+16

crypto/async_tx/Kconfig

··· 1 + config ASYNC_CORE 2 + tristate 3 + 4 + config ASYNC_MEMCPY 5 + tristate 6 + select ASYNC_CORE 7 + 8 + config ASYNC_XOR 9 + tristate 10 + select ASYNC_CORE 11 + select XOR_BLOCKS 12 + 13 + config ASYNC_MEMSET 14 + tristate 15 + select ASYNC_CORE 16 +

crypto/async_tx/Makefile

··· 1 + obj-$(CONFIG_ASYNC_CORE) += async_tx.o 2 + obj-$(CONFIG_ASYNC_MEMCPY) += async_memcpy.o 3 + obj-$(CONFIG_ASYNC_MEMSET) += async_memset.o 4 + obj-$(CONFIG_ASYNC_XOR) += async_xor.o

+131

crypto/async_tx/async_memcpy.c

··· 1 + /* 2 + * copy offload engine support 3 + * 4 + * Copyright © 2006, Intel Corporation. 5 + * 6 + * Dan Williams <dan.j.williams@intel.com> 7 + * 8 + * with architecture considerations by: 9 + * Neil Brown <neilb@suse.de> 10 + * Jeff Garzik <jeff@garzik.org> 11 + * 12 + * This program is free software; you can redistribute it and/or modify it 13 + * under the terms and conditions of the GNU General Public License, 14 + * version 2, as published by the Free Software Foundation. 15 + * 16 + * This program is distributed in the hope it will be useful, but WITHOUT 17 + * ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or 18 + * FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for 19 + * more details. 20 + * 21 + * You should have received a copy of the GNU General Public License along with 22 + * this program; if not, write to the Free Software Foundation, Inc., 23 + * 51 Franklin St - Fifth Floor, Boston, MA 02110-1301 USA. 24 + * 25 + */ 26 + #include <linux/kernel.h> 27 + #include <linux/highmem.h> 28 + #include <linux/mm.h> 29 + #include <linux/dma-mapping.h> 30 + #include <linux/async_tx.h> 31 + 32 + /** 33 + * async_memcpy - attempt to copy memory with a dma engine. 34 + * @dest: destination page 35 + * @src: src page 36 + * @offset: offset in pages to start transaction 37 + * @len: length in bytes 38 + * @flags: ASYNC_TX_ASSUME_COHERENT, ASYNC_TX_ACK, ASYNC_TX_DEP_ACK, 39 + * ASYNC_TX_KMAP_SRC, ASYNC_TX_KMAP_DST 40 + * @depend_tx: memcpy depends on the result of this transaction 41 + * @cb_fn: function to call when the memcpy completes 42 + * @cb_param: parameter to pass to the callback routine 43 + */ 44 + struct dma_async_tx_descriptor * 45 + async_memcpy(struct page *dest, struct page *src, unsigned int dest_offset, 46 + unsigned int src_offset, size_t len, enum async_tx_flags flags, 47 + struct dma_async_tx_descriptor *depend_tx, 48 + dma_async_tx_callback cb_fn, void *cb_param) 49 + { 50 + struct dma_chan *chan = async_tx_find_channel(depend_tx, DMA_MEMCPY); 51 + struct dma_device *device = chan ? chan->device : NULL; 52 + int int_en = cb_fn ? 1 : 0; 53 + struct dma_async_tx_descriptor *tx = device ? 54 + device->device_prep_dma_memcpy(chan, len, 55 + int_en) : NULL; 56 + 57 + if (tx) { /* run the memcpy asynchronously */ 58 + dma_addr_t addr; 59 + enum dma_data_direction dir; 60 + 61 + pr_debug("%s: (async) len: %zu\n", __FUNCTION__, len); 62 + 63 + dir = (flags & ASYNC_TX_ASSUME_COHERENT) ? 64 + DMA_NONE : DMA_FROM_DEVICE; 65 + 66 + addr = dma_map_page(device->dev, dest, dest_offset, len, dir); 67 + tx->tx_set_dest(addr, tx, 0); 68 + 69 + dir = (flags & ASYNC_TX_ASSUME_COHERENT) ? 70 + DMA_NONE : DMA_TO_DEVICE; 71 + 72 + addr = dma_map_page(device->dev, src, src_offset, len, dir); 73 + tx->tx_set_src(addr, tx, 0); 74 + 75 + async_tx_submit(chan, tx, flags, depend_tx, cb_fn, cb_param); 76 + } else { /* run the memcpy synchronously */ 77 + void *dest_buf, *src_buf; 78 + pr_debug("%s: (sync) len: %zu\n", __FUNCTION__, len); 79 + 80 + /* wait for any prerequisite operations */ 81 + if (depend_tx) { 82 + /* if ack is already set then we cannot be sure 83 + * we are referring to the correct operation 84 + */ 85 + BUG_ON(depend_tx->ack); 86 + if (dma_wait_for_async_tx(depend_tx) == DMA_ERROR) 87 + panic("%s: DMA_ERROR waiting for depend_tx\n", 88 + __FUNCTION__); 89 + } 90 + 91 + if (flags & ASYNC_TX_KMAP_DST) 92 + dest_buf = kmap_atomic(dest, KM_USER0) + dest_offset; 93 + else 94 + dest_buf = page_address(dest) + dest_offset; 95 + 96 + if (flags & ASYNC_TX_KMAP_SRC) 97 + src_buf = kmap_atomic(src, KM_USER0) + src_offset; 98 + else 99 + src_buf = page_address(src) + src_offset; 100 + 101 + memcpy(dest_buf, src_buf, len); 102 + 103 + if (flags & ASYNC_TX_KMAP_DST) 104 + kunmap_atomic(dest_buf, KM_USER0); 105 + 106 + if (flags & ASYNC_TX_KMAP_SRC) 107 + kunmap_atomic(src_buf, KM_USER0); 108 + 109 + async_tx_sync_epilog(flags, depend_tx, cb_fn, cb_param); 110 + } 111 + 112 + return tx; 113 + } 114 + EXPORT_SYMBOL_GPL(async_memcpy); 115 + 116 + static int __init async_memcpy_init(void) 117 + { 118 + return 0; 119 + } 120 + 121 + static void __exit async_memcpy_exit(void) 122 + { 123 + do { } while (0); 124 + } 125 + 126 + module_init(async_memcpy_init); 127 + module_exit(async_memcpy_exit); 128 + 129 + MODULE_AUTHOR("Intel Corporation"); 130 + MODULE_DESCRIPTION("asynchronous memcpy api"); 131 + MODULE_LICENSE("GPL");

+109

crypto/async_tx/async_memset.c

··· 1 + /* 2 + * memory fill offload engine support 3 + * 4 + * Copyright © 2006, Intel Corporation. 5 + * 6 + * Dan Williams <dan.j.williams@intel.com> 7 + * 8 + * with architecture considerations by: 9 + * Neil Brown <neilb@suse.de> 10 + * Jeff Garzik <jeff@garzik.org> 11 + * 12 + * This program is free software; you can redistribute it and/or modify it 13 + * under the terms and conditions of the GNU General Public License, 14 + * version 2, as published by the Free Software Foundation. 15 + * 16 + * This program is distributed in the hope it will be useful, but WITHOUT 17 + * ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or 18 + * FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for 19 + * more details. 20 + * 21 + * You should have received a copy of the GNU General Public License along with 22 + * this program; if not, write to the Free Software Foundation, Inc., 23 + * 51 Franklin St - Fifth Floor, Boston, MA 02110-1301 USA. 24 + * 25 + */ 26 + #include <linux/kernel.h> 27 + #include <linux/interrupt.h> 28 + #include <linux/mm.h> 29 + #include <linux/dma-mapping.h> 30 + #include <linux/async_tx.h> 31 + 32 + /** 33 + * async_memset - attempt to fill memory with a dma engine. 34 + * @dest: destination page 35 + * @val: fill value 36 + * @offset: offset in pages to start transaction 37 + * @len: length in bytes 38 + * @flags: ASYNC_TX_ASSUME_COHERENT, ASYNC_TX_ACK, ASYNC_TX_DEP_ACK 39 + * @depend_tx: memset depends on the result of this transaction 40 + * @cb_fn: function to call when the memcpy completes 41 + * @cb_param: parameter to pass to the callback routine 42 + */ 43 + struct dma_async_tx_descriptor * 44 + async_memset(struct page *dest, int val, unsigned int offset, 45 + size_t len, enum async_tx_flags flags, 46 + struct dma_async_tx_descriptor *depend_tx, 47 + dma_async_tx_callback cb_fn, void *cb_param) 48 + { 49 + struct dma_chan *chan = async_tx_find_channel(depend_tx, DMA_MEMSET); 50 + struct dma_device *device = chan ? chan->device : NULL; 51 + int int_en = cb_fn ? 1 : 0; 52 + struct dma_async_tx_descriptor *tx = device ? 53 + device->device_prep_dma_memset(chan, val, len, 54 + int_en) : NULL; 55 + 56 + if (tx) { /* run the memset asynchronously */ 57 + dma_addr_t dma_addr; 58 + enum dma_data_direction dir; 59 + 60 + pr_debug("%s: (async) len: %zu\n", __FUNCTION__, len); 61 + dir = (flags & ASYNC_TX_ASSUME_COHERENT) ? 62 + DMA_NONE : DMA_FROM_DEVICE; 63 + 64 + dma_addr = dma_map_page(device->dev, dest, offset, len, dir); 65 + tx->tx_set_dest(dma_addr, tx, 0); 66 + 67 + async_tx_submit(chan, tx, flags, depend_tx, cb_fn, cb_param); 68 + } else { /* run the memset synchronously */ 69 + void *dest_buf; 70 + pr_debug("%s: (sync) len: %zu\n", __FUNCTION__, len); 71 + 72 + dest_buf = (void *) (((char *) page_address(dest)) + offset); 73 + 74 + /* wait for any prerequisite operations */ 75 + if (depend_tx) { 76 + /* if ack is already set then we cannot be sure 77 + * we are referring to the correct operation 78 + */ 79 + BUG_ON(depend_tx->ack); 80 + if (dma_wait_for_async_tx(depend_tx) == DMA_ERROR) 81 + panic("%s: DMA_ERROR waiting for depend_tx\n", 82 + __FUNCTION__); 83 + } 84 + 85 + memset(dest_buf, val, len); 86 + 87 + async_tx_sync_epilog(flags, depend_tx, cb_fn, cb_param); 88 + } 89 + 90 + return tx; 91 + } 92 + EXPORT_SYMBOL_GPL(async_memset); 93 + 94 + static int __init async_memset_init(void) 95 + { 96 + return 0; 97 + } 98 + 99 + static void __exit async_memset_exit(void) 100 + { 101 + do { } while (0); 102 + } 103 + 104 + module_init(async_memset_init); 105 + module_exit(async_memset_exit); 106 + 107 + MODULE_AUTHOR("Intel Corporation"); 108 + MODULE_DESCRIPTION("asynchronous memset api"); 109 + MODULE_LICENSE("GPL");

+497

crypto/async_tx/async_tx.c

··· 1 + /* 2 + * core routines for the asynchronous memory transfer/transform api 3 + * 4 + * Copyright © 2006, Intel Corporation. 5 + * 6 + * Dan Williams <dan.j.williams@intel.com> 7 + * 8 + * with architecture considerations by: 9 + * Neil Brown <neilb@suse.de> 10 + * Jeff Garzik <jeff@garzik.org> 11 + * 12 + * This program is free software; you can redistribute it and/or modify it 13 + * under the terms and conditions of the GNU General Public License, 14 + * version 2, as published by the Free Software Foundation. 15 + * 16 + * This program is distributed in the hope it will be useful, but WITHOUT 17 + * ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or 18 + * FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for 19 + * more details. 20 + * 21 + * You should have received a copy of the GNU General Public License along with 22 + * this program; if not, write to the Free Software Foundation, Inc., 23 + * 51 Franklin St - Fifth Floor, Boston, MA 02110-1301 USA. 24 + * 25 + */ 26 + #include <linux/kernel.h> 27 + #include <linux/async_tx.h> 28 + 29 + #ifdef CONFIG_DMA_ENGINE 30 + static enum dma_state_client 31 + dma_channel_add_remove(struct dma_client *client, 32 + struct dma_chan *chan, enum dma_state state); 33 + 34 + static struct dma_client async_tx_dma = { 35 + .event_callback = dma_channel_add_remove, 36 + /* .cap_mask == 0 defaults to all channels */ 37 + }; 38 + 39 + /** 40 + * dma_cap_mask_all - enable iteration over all operation types 41 + */ 42 + static dma_cap_mask_t dma_cap_mask_all; 43 + 44 + /** 45 + * chan_ref_percpu - tracks channel allocations per core/opertion 46 + */ 47 + struct chan_ref_percpu { 48 + struct dma_chan_ref *ref; 49 + }; 50 + 51 + static int channel_table_initialized; 52 + static struct chan_ref_percpu *channel_table[DMA_TX_TYPE_END]; 53 + 54 + /** 55 + * async_tx_lock - protect modification of async_tx_master_list and serialize 56 + * rebalance operations 57 + */ 58 + static spinlock_t async_tx_lock; 59 + 60 + static struct list_head 61 + async_tx_master_list = LIST_HEAD_INIT(async_tx_master_list); 62 + 63 + /* async_tx_issue_pending_all - start all transactions on all channels */ 64 + void async_tx_issue_pending_all(void) 65 + { 66 + struct dma_chan_ref *ref; 67 + 68 + rcu_read_lock(); 69 + list_for_each_entry_rcu(ref, &async_tx_master_list, node) 70 + ref->chan->device->device_issue_pending(ref->chan); 71 + rcu_read_unlock(); 72 + } 73 + EXPORT_SYMBOL_GPL(async_tx_issue_pending_all); 74 + 75 + /* dma_wait_for_async_tx - spin wait for a transcation to complete 76 + * @tx: transaction to wait on 77 + */ 78 + enum dma_status 79 + dma_wait_for_async_tx(struct dma_async_tx_descriptor *tx) 80 + { 81 + enum dma_status status; 82 + struct dma_async_tx_descriptor *iter; 83 + 84 + if (!tx) 85 + return DMA_SUCCESS; 86 + 87 + /* poll through the dependency chain, return when tx is complete */ 88 + do { 89 + iter = tx; 90 + while (iter->cookie == -EBUSY) 91 + iter = iter->parent; 92 + 93 + status = dma_sync_wait(iter->chan, iter->cookie); 94 + } while (status == DMA_IN_PROGRESS || (iter != tx)); 95 + 96 + return status; 97 + } 98 + EXPORT_SYMBOL_GPL(dma_wait_for_async_tx); 99 + 100 + /* async_tx_run_dependencies - helper routine for dma drivers to process 101 + * (start) dependent operations on their target channel 102 + * @tx: transaction with dependencies 103 + */ 104 + void 105 + async_tx_run_dependencies(struct dma_async_tx_descriptor *tx) 106 + { 107 + struct dma_async_tx_descriptor *dep_tx, *_dep_tx; 108 + struct dma_device *dev; 109 + struct dma_chan *chan; 110 + 111 + list_for_each_entry_safe(dep_tx, _dep_tx, &tx->depend_list, 112 + depend_node) { 113 + chan = dep_tx->chan; 114 + dev = chan->device; 115 + /* we can't depend on ourselves */ 116 + BUG_ON(chan == tx->chan); 117 + list_del(&dep_tx->depend_node); 118 + tx->tx_submit(dep_tx); 119 + 120 + /* we need to poke the engine as client code does not 121 + * know about dependency submission events 122 + */ 123 + dev->device_issue_pending(chan); 124 + } 125 + } 126 + EXPORT_SYMBOL_GPL(async_tx_run_dependencies); 127 + 128 + static void 129 + free_dma_chan_ref(struct rcu_head *rcu) 130 + { 131 + struct dma_chan_ref *ref; 132 + ref = container_of(rcu, struct dma_chan_ref, rcu); 133 + kfree(ref); 134 + } 135 + 136 + static void 137 + init_dma_chan_ref(struct dma_chan_ref *ref, struct dma_chan *chan) 138 + { 139 + INIT_LIST_HEAD(&ref->node); 140 + INIT_RCU_HEAD(&ref->rcu); 141 + ref->chan = chan; 142 + atomic_set(&ref->count, 0); 143 + } 144 + 145 + /** 146 + * get_chan_ref_by_cap - returns the nth channel of the given capability 147 + * defaults to returning the channel with the desired capability and the 148 + * lowest reference count if the index can not be satisfied 149 + * @cap: capability to match 150 + * @index: nth channel desired, passing -1 has the effect of forcing the 151 + * default return value 152 + */ 153 + static struct dma_chan_ref * 154 + get_chan_ref_by_cap(enum dma_transaction_type cap, int index) 155 + { 156 + struct dma_chan_ref *ret_ref = NULL, *min_ref = NULL, *ref; 157 + 158 + rcu_read_lock(); 159 + list_for_each_entry_rcu(ref, &async_tx_master_list, node) 160 + if (dma_has_cap(cap, ref->chan->device->cap_mask)) { 161 + if (!min_ref) 162 + min_ref = ref; 163 + else if (atomic_read(&ref->count) < 164 + atomic_read(&min_ref->count)) 165 + min_ref = ref; 166 + 167 + if (index-- == 0) { 168 + ret_ref = ref; 169 + break; 170 + } 171 + } 172 + rcu_read_unlock(); 173 + 174 + if (!ret_ref) 175 + ret_ref = min_ref; 176 + 177 + if (ret_ref) 178 + atomic_inc(&ret_ref->count); 179 + 180 + return ret_ref; 181 + } 182 + 183 + /** 184 + * async_tx_rebalance - redistribute the available channels, optimize 185 + * for cpu isolation in the SMP case, and opertaion isolation in the 186 + * uniprocessor case 187 + */ 188 + static void async_tx_rebalance(void) 189 + { 190 + int cpu, cap, cpu_idx = 0; 191 + unsigned long flags; 192 + 193 + if (!channel_table_initialized) 194 + return; 195 + 196 + spin_lock_irqsave(&async_tx_lock, flags); 197 + 198 + /* undo the last distribution */ 199 + for_each_dma_cap_mask(cap, dma_cap_mask_all) 200 + for_each_possible_cpu(cpu) { 201 + struct dma_chan_ref *ref = 202 + per_cpu_ptr(channel_table[cap], cpu)->ref; 203 + if (ref) { 204 + atomic_set(&ref->count, 0); 205 + per_cpu_ptr(channel_table[cap], cpu)->ref = 206 + NULL; 207 + } 208 + } 209 + 210 + for_each_dma_cap_mask(cap, dma_cap_mask_all) 211 + for_each_online_cpu(cpu) { 212 + struct dma_chan_ref *new; 213 + if (NR_CPUS > 1) 214 + new = get_chan_ref_by_cap(cap, cpu_idx++); 215 + else 216 + new = get_chan_ref_by_cap(cap, -1); 217 + 218 + per_cpu_ptr(channel_table[cap], cpu)->ref = new; 219 + } 220 + 221 + spin_unlock_irqrestore(&async_tx_lock, flags); 222 + } 223 + 224 + static enum dma_state_client 225 + dma_channel_add_remove(struct dma_client *client, 226 + struct dma_chan *chan, enum dma_state state) 227 + { 228 + unsigned long found, flags; 229 + struct dma_chan_ref *master_ref, *ref; 230 + enum dma_state_client ack = DMA_DUP; /* default: take no action */ 231 + 232 + switch (state) { 233 + case DMA_RESOURCE_AVAILABLE: 234 + found = 0; 235 + rcu_read_lock(); 236 + list_for_each_entry_rcu(ref, &async_tx_master_list, node) 237 + if (ref->chan == chan) { 238 + found = 1; 239 + break; 240 + } 241 + rcu_read_unlock(); 242 + 243 + pr_debug("async_tx: dma resource available [%s]\n", 244 + found ? "old" : "new"); 245 + 246 + if (!found) 247 + ack = DMA_ACK; 248 + else 249 + break; 250 + 251 + /* add the channel to the generic management list */ 252 + master_ref = kmalloc(sizeof(*master_ref), GFP_KERNEL); 253 + if (master_ref) { 254 + /* keep a reference until async_tx is unloaded */ 255 + dma_chan_get(chan); 256 + init_dma_chan_ref(master_ref, chan); 257 + spin_lock_irqsave(&async_tx_lock, flags); 258 + list_add_tail_rcu(&master_ref->node, 259 + &async_tx_master_list); 260 + spin_unlock_irqrestore(&async_tx_lock, 261 + flags); 262 + } else { 263 + printk(KERN_WARNING "async_tx: unable to create" 264 + " new master entry in response to" 265 + " a DMA_RESOURCE_ADDED event" 266 + " (-ENOMEM)\n"); 267 + return 0; 268 + } 269 + 270 + async_tx_rebalance(); 271 + break; 272 + case DMA_RESOURCE_REMOVED: 273 + found = 0; 274 + spin_lock_irqsave(&async_tx_lock, flags); 275 + list_for_each_entry_rcu(ref, &async_tx_master_list, node) 276 + if (ref->chan == chan) { 277 + /* permit backing devices to go away */ 278 + dma_chan_put(ref->chan); 279 + list_del_rcu(&ref->node); 280 + call_rcu(&ref->rcu, free_dma_chan_ref); 281 + found = 1; 282 + break; 283 + } 284 + spin_unlock_irqrestore(&async_tx_lock, flags); 285 + 286 + pr_debug("async_tx: dma resource removed [%s]\n", 287 + found ? "ours" : "not ours"); 288 + 289 + if (found) 290 + ack = DMA_ACK; 291 + else 292 + break; 293 + 294 + async_tx_rebalance(); 295 + break; 296 + case DMA_RESOURCE_SUSPEND: 297 + case DMA_RESOURCE_RESUME: 298 + printk(KERN_WARNING "async_tx: does not support dma channel" 299 + " suspend/resume\n"); 300 + break; 301 + default: 302 + BUG(); 303 + } 304 + 305 + return ack; 306 + } 307 + 308 + static int __init 309 + async_tx_init(void) 310 + { 311 + enum dma_transaction_type cap; 312 + 313 + spin_lock_init(&async_tx_lock); 314 + bitmap_fill(dma_cap_mask_all.bits, DMA_TX_TYPE_END); 315 + 316 + /* an interrupt will never be an explicit operation type. 317 + * clearing this bit prevents allocation to a slot in 'channel_table' 318 + */ 319 + clear_bit(DMA_INTERRUPT, dma_cap_mask_all.bits); 320 + 321 + for_each_dma_cap_mask(cap, dma_cap_mask_all) { 322 + channel_table[cap] = alloc_percpu(struct chan_ref_percpu); 323 + if (!channel_table[cap]) 324 + goto err; 325 + } 326 + 327 + channel_table_initialized = 1; 328 + dma_async_client_register(&async_tx_dma); 329 + dma_async_client_chan_request(&async_tx_dma); 330 + 331 + printk(KERN_INFO "async_tx: api initialized (async)\n"); 332 + 333 + return 0; 334 + err: 335 + printk(KERN_ERR "async_tx: initialization failure\n"); 336 + 337 + while (--cap >= 0) 338 + free_percpu(channel_table[cap]); 339 + 340 + return 1; 341 + } 342 + 343 + static void __exit async_tx_exit(void) 344 + { 345 + enum dma_transaction_type cap; 346 + 347 + channel_table_initialized = 0; 348 + 349 + for_each_dma_cap_mask(cap, dma_cap_mask_all) 350 + if (channel_table[cap]) 351 + free_percpu(channel_table[cap]); 352 + 353 + dma_async_client_unregister(&async_tx_dma); 354 + } 355 + 356 + /** 357 + * async_tx_find_channel - find a channel to carry out the operation or let 358 + * the transaction execute synchronously 359 + * @depend_tx: transaction dependency 360 + * @tx_type: transaction type 361 + */ 362 + struct dma_chan * 363 + async_tx_find_channel(struct dma_async_tx_descriptor *depend_tx, 364 + enum dma_transaction_type tx_type) 365 + { 366 + /* see if we can keep the chain on one channel */ 367 + if (depend_tx && 368 + dma_has_cap(tx_type, depend_tx->chan->device->cap_mask)) 369 + return depend_tx->chan; 370 + else if (likely(channel_table_initialized)) { 371 + struct dma_chan_ref *ref; 372 + int cpu = get_cpu(); 373 + ref = per_cpu_ptr(channel_table[tx_type], cpu)->ref; 374 + put_cpu(); 375 + return ref ? ref->chan : NULL; 376 + } else 377 + return NULL; 378 + } 379 + EXPORT_SYMBOL_GPL(async_tx_find_channel); 380 + #else 381 + static int __init async_tx_init(void) 382 + { 383 + printk(KERN_INFO "async_tx: api initialized (sync-only)\n"); 384 + return 0; 385 + } 386 + 387 + static void __exit async_tx_exit(void) 388 + { 389 + do { } while (0); 390 + } 391 + #endif 392 + 393 + void 394 + async_tx_submit(struct dma_chan *chan, struct dma_async_tx_descriptor *tx, 395 + enum async_tx_flags flags, struct dma_async_tx_descriptor *depend_tx, 396 + dma_async_tx_callback cb_fn, void *cb_param) 397 + { 398 + tx->callback = cb_fn; 399 + tx->callback_param = cb_param; 400 + 401 + /* set this new tx to run after depend_tx if: 402 + * 1/ a dependency exists (depend_tx is !NULL) 403 + * 2/ the tx can not be submitted to the current channel 404 + */ 405 + if (depend_tx && depend_tx->chan != chan) { 406 + /* if ack is already set then we cannot be sure 407 + * we are referring to the correct operation 408 + */ 409 + BUG_ON(depend_tx->ack); 410 + 411 + tx->parent = depend_tx; 412 + spin_lock_bh(&depend_tx->lock); 413 + list_add_tail(&tx->depend_node, &depend_tx->depend_list); 414 + if (depend_tx->cookie == 0) { 415 + struct dma_chan *dep_chan = depend_tx->chan; 416 + struct dma_device *dep_dev = dep_chan->device; 417 + dep_dev->device_dependency_added(dep_chan); 418 + } 419 + spin_unlock_bh(&depend_tx->lock); 420 + 421 + /* schedule an interrupt to trigger the channel switch */ 422 + async_trigger_callback(ASYNC_TX_ACK, depend_tx, NULL, NULL); 423 + } else { 424 + tx->parent = NULL; 425 + tx->tx_submit(tx); 426 + } 427 + 428 + if (flags & ASYNC_TX_ACK) 429 + async_tx_ack(tx); 430 + 431 + if (depend_tx && (flags & ASYNC_TX_DEP_ACK)) 432 + async_tx_ack(depend_tx); 433 + } 434 + EXPORT_SYMBOL_GPL(async_tx_submit); 435 + 436 + /** 437 + * async_trigger_callback - schedules the callback function to be run after 438 + * any dependent operations have been completed. 439 + * @flags: ASYNC_TX_ACK, ASYNC_TX_DEP_ACK 440 + * @depend_tx: 'callback' requires the completion of this transaction 441 + * @cb_fn: function to call after depend_tx completes 442 + * @cb_param: parameter to pass to the callback routine 443 + */ 444 + struct dma_async_tx_descriptor * 445 + async_trigger_callback(enum async_tx_flags flags, 446 + struct dma_async_tx_descriptor *depend_tx, 447 + dma_async_tx_callback cb_fn, void *cb_param) 448 + { 449 + struct dma_chan *chan; 450 + struct dma_device *device; 451 + struct dma_async_tx_descriptor *tx; 452 + 453 + if (depend_tx) { 454 + chan = depend_tx->chan; 455 + device = chan->device; 456 + 457 + /* see if we can schedule an interrupt 458 + * otherwise poll for completion 459 + */ 460 + if (device && !dma_has_cap(DMA_INTERRUPT, device->cap_mask)) 461 + device = NULL; 462 + 463 + tx = device ? device->device_prep_dma_interrupt(chan) : NULL; 464 + } else 465 + tx = NULL; 466 + 467 + if (tx) { 468 + pr_debug("%s: (async)\n", __FUNCTION__); 469 + 470 + async_tx_submit(chan, tx, flags, depend_tx, cb_fn, cb_param); 471 + } else { 472 + pr_debug("%s: (sync)\n", __FUNCTION__); 473 + 474 + /* wait for any prerequisite operations */ 475 + if (depend_tx) { 476 + /* if ack is already set then we cannot be sure 477 + * we are referring to the correct operation 478 + */ 479 + BUG_ON(depend_tx->ack); 480 + if (dma_wait_for_async_tx(depend_tx) == DMA_ERROR) 481 + panic("%s: DMA_ERROR waiting for depend_tx\n", 482 + __FUNCTION__); 483 + } 484 + 485 + async_tx_sync_epilog(flags, depend_tx, cb_fn, cb_param); 486 + } 487 + 488 + return tx; 489 + } 490 + EXPORT_SYMBOL_GPL(async_trigger_callback); 491 + 492 + module_init(async_tx_init); 493 + module_exit(async_tx_exit); 494 + 495 + MODULE_AUTHOR("Intel Corporation"); 496 + MODULE_DESCRIPTION("Asynchronous Bulk Memory Transactions API"); 497 + MODULE_LICENSE("GPL");

+327

crypto/async_tx/async_xor.c

··· 1 + /* 2 + * xor offload engine api 3 + * 4 + * Copyright © 2006, Intel Corporation. 5 + * 6 + * Dan Williams <dan.j.williams@intel.com> 7 + * 8 + * with architecture considerations by: 9 + * Neil Brown <neilb@suse.de> 10 + * Jeff Garzik <jeff@garzik.org> 11 + * 12 + * This program is free software; you can redistribute it and/or modify it 13 + * under the terms and conditions of the GNU General Public License, 14 + * version 2, as published by the Free Software Foundation. 15 + * 16 + * This program is distributed in the hope it will be useful, but WITHOUT 17 + * ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or 18 + * FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for 19 + * more details. 20 + * 21 + * You should have received a copy of the GNU General Public License along with 22 + * this program; if not, write to the Free Software Foundation, Inc., 23 + * 51 Franklin St - Fifth Floor, Boston, MA 02110-1301 USA. 24 + * 25 + */ 26 + #include <linux/kernel.h> 27 + #include <linux/interrupt.h> 28 + #include <linux/mm.h> 29 + #include <linux/dma-mapping.h> 30 + #include <linux/raid/xor.h> 31 + #include <linux/async_tx.h> 32 + 33 + static void 34 + do_async_xor(struct dma_async_tx_descriptor *tx, struct dma_device *device, 35 + struct dma_chan *chan, struct page *dest, struct page **src_list, 36 + unsigned int offset, unsigned int src_cnt, size_t len, 37 + enum async_tx_flags flags, struct dma_async_tx_descriptor *depend_tx, 38 + dma_async_tx_callback cb_fn, void *cb_param) 39 + { 40 + dma_addr_t dma_addr; 41 + enum dma_data_direction dir; 42 + int i; 43 + 44 + pr_debug("%s: len: %zu\n", __FUNCTION__, len); 45 + 46 + dir = (flags & ASYNC_TX_ASSUME_COHERENT) ? 47 + DMA_NONE : DMA_FROM_DEVICE; 48 + 49 + dma_addr = dma_map_page(device->dev, dest, offset, len, dir); 50 + tx->tx_set_dest(dma_addr, tx, 0); 51 + 52 + dir = (flags & ASYNC_TX_ASSUME_COHERENT) ? 53 + DMA_NONE : DMA_TO_DEVICE; 54 + 55 + for (i = 0; i < src_cnt; i++) { 56 + dma_addr = dma_map_page(device->dev, src_list[i], 57 + offset, len, dir); 58 + tx->tx_set_src(dma_addr, tx, i); 59 + } 60 + 61 + async_tx_submit(chan, tx, flags, depend_tx, cb_fn, cb_param); 62 + } 63 + 64 + static void 65 + do_sync_xor(struct page *dest, struct page **src_list, unsigned int offset, 66 + unsigned int src_cnt, size_t len, enum async_tx_flags flags, 67 + struct dma_async_tx_descriptor *depend_tx, 68 + dma_async_tx_callback cb_fn, void *cb_param) 69 + { 70 + void *_dest; 71 + int i; 72 + 73 + pr_debug("%s: len: %zu\n", __FUNCTION__, len); 74 + 75 + /* reuse the 'src_list' array to convert to buffer pointers */ 76 + for (i = 0; i < src_cnt; i++) 77 + src_list[i] = (struct page *) 78 + (page_address(src_list[i]) + offset); 79 + 80 + /* set destination address */ 81 + _dest = page_address(dest) + offset; 82 + 83 + if (flags & ASYNC_TX_XOR_ZERO_DST) 84 + memset(_dest, 0, len); 85 + 86 + xor_blocks(src_cnt, len, _dest, 87 + (void **) src_list); 88 + 89 + async_tx_sync_epilog(flags, depend_tx, cb_fn, cb_param); 90 + } 91 + 92 + /** 93 + * async_xor - attempt to xor a set of blocks with a dma engine. 94 + * xor_blocks always uses the dest as a source so the ASYNC_TX_XOR_ZERO_DST 95 + * flag must be set to not include dest data in the calculation. The 96 + * assumption with dma eninges is that they only use the destination 97 + * buffer as a source when it is explicity specified in the source list. 98 + * @dest: destination page 99 + * @src_list: array of source pages (if the dest is also a source it must be 100 + * at index zero). The contents of this array may be overwritten. 101 + * @offset: offset in pages to start transaction 102 + * @src_cnt: number of source pages 103 + * @len: length in bytes 104 + * @flags: ASYNC_TX_XOR_ZERO_DST, ASYNC_TX_XOR_DROP_DEST, 105 + * ASYNC_TX_ASSUME_COHERENT, ASYNC_TX_ACK, ASYNC_TX_DEP_ACK 106 + * @depend_tx: xor depends on the result of this transaction. 107 + * @cb_fn: function to call when the xor completes 108 + * @cb_param: parameter to pass to the callback routine 109 + */ 110 + struct dma_async_tx_descriptor * 111 + async_xor(struct page *dest, struct page **src_list, unsigned int offset, 112 + int src_cnt, size_t len, enum async_tx_flags flags, 113 + struct dma_async_tx_descriptor *depend_tx, 114 + dma_async_tx_callback cb_fn, void *cb_param) 115 + { 116 + struct dma_chan *chan = async_tx_find_channel(depend_tx, DMA_XOR); 117 + struct dma_device *device = chan ? chan->device : NULL; 118 + struct dma_async_tx_descriptor *tx = NULL; 119 + dma_async_tx_callback _cb_fn; 120 + void *_cb_param; 121 + unsigned long local_flags; 122 + int xor_src_cnt; 123 + int i = 0, src_off = 0, int_en; 124 + 125 + BUG_ON(src_cnt <= 1); 126 + 127 + while (src_cnt) { 128 + local_flags = flags; 129 + if (device) { /* run the xor asynchronously */ 130 + xor_src_cnt = min(src_cnt, device->max_xor); 131 + /* if we are submitting additional xors 132 + * only set the callback on the last transaction 133 + */ 134 + if (src_cnt > xor_src_cnt) { 135 + local_flags &= ~ASYNC_TX_ACK; 136 + _cb_fn = NULL; 137 + _cb_param = NULL; 138 + } else { 139 + _cb_fn = cb_fn; 140 + _cb_param = cb_param; 141 + } 142 + 143 + int_en = _cb_fn ? 1 : 0; 144 + 145 + tx = device->device_prep_dma_xor( 146 + chan, xor_src_cnt, len, int_en); 147 + 148 + if (tx) { 149 + do_async_xor(tx, device, chan, dest, 150 + &src_list[src_off], offset, xor_src_cnt, len, 151 + local_flags, depend_tx, _cb_fn, 152 + _cb_param); 153 + } else /* fall through */ 154 + goto xor_sync; 155 + } else { /* run the xor synchronously */ 156 + xor_sync: 157 + /* in the sync case the dest is an implied source 158 + * (assumes the dest is at the src_off index) 159 + */ 160 + if (flags & ASYNC_TX_XOR_DROP_DST) { 161 + src_cnt--; 162 + src_off++; 163 + } 164 + 165 + /* process up to 'MAX_XOR_BLOCKS' sources */ 166 + xor_src_cnt = min(src_cnt, MAX_XOR_BLOCKS); 167 + 168 + /* if we are submitting additional xors 169 + * only set the callback on the last transaction 170 + */ 171 + if (src_cnt > xor_src_cnt) { 172 + local_flags &= ~ASYNC_TX_ACK; 173 + _cb_fn = NULL; 174 + _cb_param = NULL; 175 + } else { 176 + _cb_fn = cb_fn; 177 + _cb_param = cb_param; 178 + } 179 + 180 + /* wait for any prerequisite operations */ 181 + if (depend_tx) { 182 + /* if ack is already set then we cannot be sure 183 + * we are referring to the correct operation 184 + */ 185 + BUG_ON(depend_tx->ack); 186 + if (dma_wait_for_async_tx(depend_tx) == 187 + DMA_ERROR) 188 + panic("%s: DMA_ERROR waiting for " 189 + "depend_tx\n", 190 + __FUNCTION__); 191 + } 192 + 193 + do_sync_xor(dest, &src_list[src_off], offset, 194 + xor_src_cnt, len, local_flags, depend_tx, 195 + _cb_fn, _cb_param); 196 + } 197 + 198 + /* the previous tx is hidden from the client, 199 + * so ack it 200 + */ 201 + if (i && depend_tx) 202 + async_tx_ack(depend_tx); 203 + 204 + depend_tx = tx; 205 + 206 + if (src_cnt > xor_src_cnt) { 207 + /* drop completed sources */ 208 + src_cnt -= xor_src_cnt; 209 + src_off += xor_src_cnt; 210 + 211 + /* unconditionally preserve the destination */ 212 + flags &= ~ASYNC_TX_XOR_ZERO_DST; 213 + 214 + /* use the intermediate result a source, but remember 215 + * it's dropped, because it's implied, in the sync case 216 + */ 217 + src_list[--src_off] = dest; 218 + src_cnt++; 219 + flags |= ASYNC_TX_XOR_DROP_DST; 220 + } else 221 + src_cnt = 0; 222 + i++; 223 + } 224 + 225 + return tx; 226 + } 227 + EXPORT_SYMBOL_GPL(async_xor); 228 + 229 + static int page_is_zero(struct page *p, unsigned int offset, size_t len) 230 + { 231 + char *a = page_address(p) + offset; 232 + return ((*(u32 *) a) == 0 && 233 + memcmp(a, a + 4, len - 4) == 0); 234 + } 235 + 236 + /** 237 + * async_xor_zero_sum - attempt a xor parity check with a dma engine. 238 + * @dest: destination page used if the xor is performed synchronously 239 + * @src_list: array of source pages. The dest page must be listed as a source 240 + * at index zero. The contents of this array may be overwritten. 241 + * @offset: offset in pages to start transaction 242 + * @src_cnt: number of source pages 243 + * @len: length in bytes 244 + * @result: 0 if sum == 0 else non-zero 245 + * @flags: ASYNC_TX_ASSUME_COHERENT, ASYNC_TX_ACK, ASYNC_TX_DEP_ACK 246 + * @depend_tx: xor depends on the result of this transaction. 247 + * @cb_fn: function to call when the xor completes 248 + * @cb_param: parameter to pass to the callback routine 249 + */ 250 + struct dma_async_tx_descriptor * 251 + async_xor_zero_sum(struct page *dest, struct page **src_list, 252 + unsigned int offset, int src_cnt, size_t len, 253 + u32 *result, enum async_tx_flags flags, 254 + struct dma_async_tx_descriptor *depend_tx, 255 + dma_async_tx_callback cb_fn, void *cb_param) 256 + { 257 + struct dma_chan *chan = async_tx_find_channel(depend_tx, DMA_ZERO_SUM); 258 + struct dma_device *device = chan ? chan->device : NULL; 259 + int int_en = cb_fn ? 1 : 0; 260 + struct dma_async_tx_descriptor *tx = device ? 261 + device->device_prep_dma_zero_sum(chan, src_cnt, len, result, 262 + int_en) : NULL; 263 + int i; 264 + 265 + BUG_ON(src_cnt <= 1); 266 + 267 + if (tx) { 268 + dma_addr_t dma_addr; 269 + enum dma_data_direction dir; 270 + 271 + pr_debug("%s: (async) len: %zu\n", __FUNCTION__, len); 272 + 273 + dir = (flags & ASYNC_TX_ASSUME_COHERENT) ? 274 + DMA_NONE : DMA_TO_DEVICE; 275 + 276 + for (i = 0; i < src_cnt; i++) { 277 + dma_addr = dma_map_page(device->dev, src_list[i], 278 + offset, len, dir); 279 + tx->tx_set_src(dma_addr, tx, i); 280 + } 281 + 282 + async_tx_submit(chan, tx, flags, depend_tx, cb_fn, cb_param); 283 + } else { 284 + unsigned long xor_flags = flags; 285 + 286 + pr_debug("%s: (sync) len: %zu\n", __FUNCTION__, len); 287 + 288 + xor_flags |= ASYNC_TX_XOR_DROP_DST; 289 + xor_flags &= ~ASYNC_TX_ACK; 290 + 291 + tx = async_xor(dest, src_list, offset, src_cnt, len, xor_flags, 292 + depend_tx, NULL, NULL); 293 + 294 + if (tx) { 295 + if (dma_wait_for_async_tx(tx) == DMA_ERROR) 296 + panic("%s: DMA_ERROR waiting for tx\n", 297 + __FUNCTION__); 298 + async_tx_ack(tx); 299 + } 300 + 301 + *result = page_is_zero(dest, offset, len) ? 0 : 1; 302 + 303 + tx = NULL; 304 + 305 + async_tx_sync_epilog(flags, depend_tx, cb_fn, cb_param); 306 + } 307 + 308 + return tx; 309 + } 310 + EXPORT_SYMBOL_GPL(async_xor_zero_sum); 311 + 312 + static int __init async_xor_init(void) 313 + { 314 + return 0; 315 + } 316 + 317 + static void __exit async_xor_exit(void) 318 + { 319 + do { } while (0); 320 + } 321 + 322 + module_init(async_xor_init); 323 + module_exit(async_xor_exit); 324 + 325 + MODULE_AUTHOR("Intel Corporation"); 326 + MODULE_DESCRIPTION("asynchronous xor/xor-zero-sum api"); 327 + MODULE_LICENSE("GPL");

+14 -15

crypto/xor.c

··· 26 26 static struct xor_block_template *active_template; 27 27 28 28 void 29 - xor_blocks(unsigned int count, unsigned int bytes, void **ptr) 29 + xor_blocks(unsigned int src_count, unsigned int bytes, void *dest, void **srcs) 30 30 { 31 - unsigned long *p0, *p1, *p2, *p3, *p4; 31 + unsigned long *p1, *p2, *p3, *p4; 32 32 33 - p0 = (unsigned long *) ptr[0]; 34 - p1 = (unsigned long *) ptr[1]; 35 - if (count == 2) { 36 - active_template->do_2(bytes, p0, p1); 33 + p1 = (unsigned long *) srcs[0]; 34 + if (src_count == 1) { 35 + active_template->do_2(bytes, dest, p1); 37 36 return; 38 37 } 39 38 40 - p2 = (unsigned long *) ptr[2]; 41 - if (count == 3) { 42 - active_template->do_3(bytes, p0, p1, p2); 39 + p2 = (unsigned long *) srcs[1]; 40 + if (src_count == 2) { 41 + active_template->do_3(bytes, dest, p1, p2); 43 42 return; 44 43 } 45 44 46 - p3 = (unsigned long *) ptr[3]; 47 - if (count == 4) { 48 - active_template->do_4(bytes, p0, p1, p2, p3); 45 + p3 = (unsigned long *) srcs[2]; 46 + if (src_count == 3) { 47 + active_template->do_4(bytes, dest, p1, p2, p3); 49 48 return; 50 49 } 51 50 52 - p4 = (unsigned long *) ptr[4]; 53 - active_template->do_5(bytes, p0, p1, p2, p3, p4); 51 + p4 = (unsigned long *) srcs[3]; 52 + active_template->do_5(bytes, dest, p1, p2, p3, p4); 54 53 } 55 54 EXPORT_SYMBOL(xor_blocks); 56 55 ··· 127 128 fastest->name); 128 129 xor_speed(fastest); 129 130 } else { 130 - printk(KERN_INFO "xor: measuring checksumming speed\n"); 131 + printk(KERN_INFO "xor: measuring software checksum speed\n"); 131 132 XOR_TRY_TEMPLATES; 132 133 fastest = template_list; 133 134 for (f = fastest; f; f = f->next)

+2 -3

drivers/dma/Kconfig

··· 8 8 config DMA_ENGINE 9 9 bool "Support for DMA engines" 10 10 ---help--- 11 - DMA engines offload copy operations from the CPU to dedicated 12 - hardware, allowing the copies to happen asynchronously. 11 + DMA engines offload bulk memory operations from the CPU to dedicated 12 + hardware, allowing the operations to happen asynchronously. 13 13 14 14 comment "DMA Clients" 15 15 ··· 31 31 default m 32 32 ---help--- 33 33 Enable support for the Intel(R) I/OAT DMA engine. 34 - 35 34 endmenu

+2 -1

drivers/md/Kconfig

··· 109 109 config MD_RAID456 110 110 tristate "RAID-4/RAID-5/RAID-6 mode" 111 111 depends on BLK_DEV_MD 112 - select XOR_BLOCKS 112 + select ASYNC_MEMCPY 113 + select ASYNC_XOR 113 114 ---help--- 114 115 A RAID-5 set of N drives with a capacity of C MB per drive provides 115 116 the capacity of C * (N - 1) MB, and protects against a failure

+27 -27

drivers/md/raid5.c

··· 916 916 } 917 917 } 918 918 919 - #define check_xor() do { \ 920 - if (count == MAX_XOR_BLOCKS) { \ 921 - xor_blocks(count, STRIPE_SIZE, ptr); \ 922 - count = 1; \ 923 - } \ 919 + #define check_xor() do { \ 920 + if (count == MAX_XOR_BLOCKS) { \ 921 + xor_blocks(count, STRIPE_SIZE, dest, ptr);\ 922 + count = 0; \ 923 + } \ 924 924 } while(0) 925 925 926 926 927 927 static void compute_block(struct stripe_head *sh, int dd_idx) 928 928 { 929 929 int i, count, disks = sh->disks; 930 - void *ptr[MAX_XOR_BLOCKS], *p; 930 + void *ptr[MAX_XOR_BLOCKS], *dest, *p; 931 931 932 932 PRINTK("compute_block, stripe %llu, idx %d\n", 933 933 (unsigned long long)sh->sector, dd_idx); 934 934 935 - ptr[0] = page_address(sh->dev[dd_idx].page); 936 - memset(ptr[0], 0, STRIPE_SIZE); 937 - count = 1; 935 + dest = page_address(sh->dev[dd_idx].page); 936 + memset(dest, 0, STRIPE_SIZE); 937 + count = 0; 938 938 for (i = disks ; i--; ) { 939 939 if (i == dd_idx) 940 940 continue; ··· 948 948 949 949 check_xor(); 950 950 } 951 - if (count != 1) 952 - xor_blocks(count, STRIPE_SIZE, ptr); 951 + if (count) 952 + xor_blocks(count, STRIPE_SIZE, dest, ptr); 953 953 set_bit(R5_UPTODATE, &sh->dev[dd_idx].flags); 954 954 } 955 955 ··· 957 957 { 958 958 raid5_conf_t *conf = sh->raid_conf; 959 959 int i, pd_idx = sh->pd_idx, disks = sh->disks, count; 960 - void *ptr[MAX_XOR_BLOCKS]; 960 + void *ptr[MAX_XOR_BLOCKS], *dest; 961 961 struct bio *chosen; 962 962 963 963 PRINTK("compute_parity5, stripe %llu, method %d\n", 964 964 (unsigned long long)sh->sector, method); 965 965 966 - count = 1; 967 - ptr[0] = page_address(sh->dev[pd_idx].page); 966 + count = 0; 967 + dest = page_address(sh->dev[pd_idx].page); 968 968 switch(method) { 969 969 case READ_MODIFY_WRITE: 970 970 BUG_ON(!test_bit(R5_UPTODATE, &sh->dev[pd_idx].flags)); ··· 987 987 } 988 988 break; 989 989 case RECONSTRUCT_WRITE: 990 - memset(ptr[0], 0, STRIPE_SIZE); 990 + memset(dest, 0, STRIPE_SIZE); 991 991 for (i= disks; i-- ;) 992 992 if (i!=pd_idx && sh->dev[i].towrite) { 993 993 chosen = sh->dev[i].towrite; ··· 1003 1003 case CHECK_PARITY: 1004 1004 break; 1005 1005 } 1006 - if (count>1) { 1007 - xor_blocks(count, STRIPE_SIZE, ptr); 1008 - count = 1; 1006 + if (count) { 1007 + xor_blocks(count, STRIPE_SIZE, dest, ptr); 1008 + count = 0; 1009 1009 } 1010 1010 1011 1011 for (i = disks; i--;) ··· 1037 1037 check_xor(); 1038 1038 } 1039 1039 } 1040 - if (count != 1) 1041 - xor_blocks(count, STRIPE_SIZE, ptr); 1042 - 1040 + if (count) 1041 + xor_blocks(count, STRIPE_SIZE, dest, ptr); 1042 + 1043 1043 if (method != CHECK_PARITY) { 1044 1044 set_bit(R5_UPTODATE, &sh->dev[pd_idx].flags); 1045 1045 set_bit(R5_LOCKED, &sh->dev[pd_idx].flags); ··· 1132 1132 static void compute_block_1(struct stripe_head *sh, int dd_idx, int nozero) 1133 1133 { 1134 1134 int i, count, disks = sh->disks; 1135 - void *ptr[MAX_XOR_BLOCKS], *p; 1135 + void *ptr[MAX_XOR_BLOCKS], *dest, *p; 1136 1136 int pd_idx = sh->pd_idx; 1137 1137 int qd_idx = raid6_next_disk(pd_idx, disks); 1138 1138 ··· 1143 1143 /* We're actually computing the Q drive */ 1144 1144 compute_parity6(sh, UPDATE_PARITY); 1145 1145 } else { 1146 - ptr[0] = page_address(sh->dev[dd_idx].page); 1147 - if (!nozero) memset(ptr[0], 0, STRIPE_SIZE); 1148 - count = 1; 1146 + dest = page_address(sh->dev[dd_idx].page); 1147 + if (!nozero) memset(dest, 0, STRIPE_SIZE); 1148 + count = 0; 1149 1149 for (i = disks ; i--; ) { 1150 1150 if (i == dd_idx || i == qd_idx) 1151 1151 continue; ··· 1159 1159 1160 1160 check_xor(); 1161 1161 } 1162 - if (count != 1) 1163 - xor_blocks(count, STRIPE_SIZE, ptr); 1162 + if (count) 1163 + xor_blocks(count, STRIPE_SIZE, dest, ptr); 1164 1164 if (!nozero) set_bit(R5_UPTODATE, &sh->dev[dd_idx].flags); 1165 1165 else clear_bit(R5_UPTODATE, &sh->dev[dd_idx].flags); 1166 1166 }

+156

include/linux/async_tx.h

··· 1 + /* 2 + * Copyright © 2006, Intel Corporation. 3 + * 4 + * This program is free software; you can redistribute it and/or modify it 5 + * under the terms and conditions of the GNU General Public License, 6 + * version 2, as published by the Free Software Foundation. 7 + * 8 + * This program is distributed in the hope it will be useful, but WITHOUT 9 + * ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or 10 + * FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for 11 + * more details. 12 + * 13 + * You should have received a copy of the GNU General Public License along with 14 + * this program; if not, write to the Free Software Foundation, Inc., 15 + * 51 Franklin St - Fifth Floor, Boston, MA 02110-1301 USA. 16 + * 17 + */ 18 + #ifndef _ASYNC_TX_H_ 19 + #define _ASYNC_TX_H_ 20 + #include <linux/dmaengine.h> 21 + #include <linux/spinlock.h> 22 + #include <linux/interrupt.h> 23 + 24 + /** 25 + * dma_chan_ref - object used to manage dma channels received from the 26 + * dmaengine core. 27 + * @chan - the channel being tracked 28 + * @node - node for the channel to be placed on async_tx_master_list 29 + * @rcu - for list_del_rcu 30 + * @count - number of times this channel is listed in the pool 31 + * (for channels with multiple capabiities) 32 + */ 33 + struct dma_chan_ref { 34 + struct dma_chan *chan; 35 + struct list_head node; 36 + struct rcu_head rcu; 37 + atomic_t count; 38 + }; 39 + 40 + /** 41 + * async_tx_flags - modifiers for the async_* calls 42 + * @ASYNC_TX_XOR_ZERO_DST: this flag must be used for xor operations where the 43 + * the destination address is not a source. The asynchronous case handles this 44 + * implicitly, the synchronous case needs to zero the destination block. 45 + * @ASYNC_TX_XOR_DROP_DST: this flag must be used if the destination address is 46 + * also one of the source addresses. In the synchronous case the destination 47 + * address is an implied source, whereas the asynchronous case it must be listed 48 + * as a source. The destination address must be the first address in the source 49 + * array. 50 + * @ASYNC_TX_ASSUME_COHERENT: skip cache maintenance operations 51 + * @ASYNC_TX_ACK: immediately ack the descriptor, precludes setting up a 52 + * dependency chain 53 + * @ASYNC_TX_DEP_ACK: ack the dependency descriptor. Useful for chaining. 54 + * @ASYNC_TX_KMAP_SRC: if the transaction is to be performed synchronously 55 + * take an atomic mapping (KM_USER0) on the source page(s) 56 + * @ASYNC_TX_KMAP_DST: if the transaction is to be performed synchronously 57 + * take an atomic mapping (KM_USER0) on the dest page(s) 58 + */ 59 + enum async_tx_flags { 60 + ASYNC_TX_XOR_ZERO_DST = (1 << 0), 61 + ASYNC_TX_XOR_DROP_DST = (1 << 1), 62 + ASYNC_TX_ASSUME_COHERENT = (1 << 2), 63 + ASYNC_TX_ACK = (1 << 3), 64 + ASYNC_TX_DEP_ACK = (1 << 4), 65 + ASYNC_TX_KMAP_SRC = (1 << 5), 66 + ASYNC_TX_KMAP_DST = (1 << 6), 67 + }; 68 + 69 + #ifdef CONFIG_DMA_ENGINE 70 + void async_tx_issue_pending_all(void); 71 + enum dma_status dma_wait_for_async_tx(struct dma_async_tx_descriptor *tx); 72 + void async_tx_run_dependencies(struct dma_async_tx_descriptor *tx); 73 + struct dma_chan * 74 + async_tx_find_channel(struct dma_async_tx_descriptor *depend_tx, 75 + enum dma_transaction_type tx_type); 76 + #else 77 + static inline void async_tx_issue_pending_all(void) 78 + { 79 + do { } while (0); 80 + } 81 + 82 + static inline enum dma_status 83 + dma_wait_for_async_tx(struct dma_async_tx_descriptor *tx) 84 + { 85 + return DMA_SUCCESS; 86 + } 87 + 88 + static inline void 89 + async_tx_run_dependencies(struct dma_async_tx_descriptor *tx, 90 + struct dma_chan *host_chan) 91 + { 92 + do { } while (0); 93 + } 94 + 95 + static inline struct dma_chan * 96 + async_tx_find_channel(struct dma_async_tx_descriptor *depend_tx, 97 + enum dma_transaction_type tx_type) 98 + { 99 + return NULL; 100 + } 101 + #endif 102 + 103 + /** 104 + * async_tx_sync_epilog - actions to take if an operation is run synchronously 105 + * @flags: async_tx flags 106 + * @depend_tx: transaction depends on depend_tx 107 + * @cb_fn: function to call when the transaction completes 108 + * @cb_fn_param: parameter to pass to the callback routine 109 + */ 110 + static inline void 111 + async_tx_sync_epilog(unsigned long flags, 112 + struct dma_async_tx_descriptor *depend_tx, 113 + dma_async_tx_callback cb_fn, void *cb_fn_param) 114 + { 115 + if (cb_fn) 116 + cb_fn(cb_fn_param); 117 + 118 + if (depend_tx && (flags & ASYNC_TX_DEP_ACK)) 119 + async_tx_ack(depend_tx); 120 + } 121 + 122 + void 123 + async_tx_submit(struct dma_chan *chan, struct dma_async_tx_descriptor *tx, 124 + enum async_tx_flags flags, struct dma_async_tx_descriptor *depend_tx, 125 + dma_async_tx_callback cb_fn, void *cb_fn_param); 126 + 127 + struct dma_async_tx_descriptor * 128 + async_xor(struct page *dest, struct page **src_list, unsigned int offset, 129 + int src_cnt, size_t len, enum async_tx_flags flags, 130 + struct dma_async_tx_descriptor *depend_tx, 131 + dma_async_tx_callback cb_fn, void *cb_fn_param); 132 + 133 + struct dma_async_tx_descriptor * 134 + async_xor_zero_sum(struct page *dest, struct page **src_list, 135 + unsigned int offset, int src_cnt, size_t len, 136 + u32 *result, enum async_tx_flags flags, 137 + struct dma_async_tx_descriptor *depend_tx, 138 + dma_async_tx_callback cb_fn, void *cb_fn_param); 139 + 140 + struct dma_async_tx_descriptor * 141 + async_memcpy(struct page *dest, struct page *src, unsigned int dest_offset, 142 + unsigned int src_offset, size_t len, enum async_tx_flags flags, 143 + struct dma_async_tx_descriptor *depend_tx, 144 + dma_async_tx_callback cb_fn, void *cb_fn_param); 145 + 146 + struct dma_async_tx_descriptor * 147 + async_memset(struct page *dest, int val, unsigned int offset, 148 + size_t len, enum async_tx_flags flags, 149 + struct dma_async_tx_descriptor *depend_tx, 150 + dma_async_tx_callback cb_fn, void *cb_fn_param); 151 + 152 + struct dma_async_tx_descriptor * 153 + async_trigger_callback(enum async_tx_flags flags, 154 + struct dma_async_tx_descriptor *depend_tx, 155 + dma_async_tx_callback cb_fn, void *cb_fn_param); 156 + #endif /* _ASYNC_TX_H_ */

+3 -2

include/linux/raid/xor.h

··· 3 3 4 4 #include <linux/raid/md.h> 5 5 6 - #define MAX_XOR_BLOCKS 5 6 + #define MAX_XOR_BLOCKS 4 7 7 8 - extern void xor_blocks(unsigned int count, unsigned int bytes, void **ptr); 8 + extern void xor_blocks(unsigned int count, unsigned int bytes, 9 + void *dest, void **srcs); 9 10 10 11 struct xor_block_template { 11 12 struct xor_block_template *next;