[498/622] lustre: vvp: dirty pages with pagevec

Message ID	1582838290-17243-499-git-send-email-jsimmons@infradead.org (mailing list archive)
State	New, archived
Headers	show Return-Path: <SRS0=hXa/=4P=lists.lustre.org=lustre-devel-bounces@kernel.org> DMARC-Filter: OpenDMARC Filter v1.3.2 mail.kernel.org C3CE424690 From: James Simmons <jsimmons@infradead.org> To: Andreas Dilger <adilger@whamcloud.com>, Oleg Drokin <green@whamcloud.com>, NeilBrown <neilb@suse.de> Date: Thu, 27 Feb 2020 16:16:06 -0500 Message-Id: <1582838290-17243-499-git-send-email-jsimmons@infradead.org> In-Reply-To: <1582838290-17243-1-git-send-email-jsimmons@infradead.org> References: <1582838290-17243-1-git-send-email-jsimmons@infradead.org> Subject: [lustre-devel] [PATCH 498/622] lustre: vvp: dirty pages with pagevec Precedence: list Cc: Lustre Development List <lustre-devel@lists.lustre.org> MIME-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit Errors-To: lustre-devel-bounces@lists.lustre.org Sender: "lustre-devel" <lustre-devel-bounces@lists.lustre.org>
Series	lustre: sync closely to 2.13.52 \| expand [000/622] lustre: sync closely to 2.13.52 [001/622] lustre: always enable special debugging, fhandles, and quota support. [002/622] lustre: osc_cache: remove __might_sleep() [003/622] lustre: uapi: remove enum hsm_progress_states [004/622] lustre: uapi: sync enum obd_statfs_state [005/622] lustre: llite: return compatible fsid for statfs [006/622] lustre: ldlm: Make kvzalloc \| kvfree use consistent [007/622] lustre: llite: limit smallest max_cached_mb value [008/622] lustre: obdecho: turn on async flag only for mode 3 [009/622] lustre: llite: reorganize variable and data structures [010/622] lustre: llite: increase whole-file readahead to RPC size [011/622] lustre: llite: handle ORPHAN/DEAD directories [012/622] lustre: lov: protected ost pool count updation [013/622] lustre: obdclass: fix llog_cat_cleanup() usage on Client [014/622] lustre: mdc: fix possible NULL pointer dereference [015/622] lustre: obdclass: allow specifying complex jobids [016/622] lustre: ldlm: don't disable softirq for exp_rpc_lock [017/622] lustre: obdclass: new wrapper to convert NID to string [018/622] lustre: ptlrpc: Add QoS for uid and gid in NRS-TBF [019/622] lustre: hsm: ignore compound_id [020/622] lnet: libcfs: remove unnecessary set_fs(KERNEL_DS) [021/622] lustre: ptlrpc: ptlrpc_register_bulk() LBUG on ENOMEM [022/622] lustre: llite: yield cpu after call to ll_agl_trigger [023/622] lustre: osc: Do not request more than 2GiB grant [024/622] lustre: llite: rename FSFILT_IOC_* to system flags [025/622] lnet: fix nid range format '@<net>' support [026/622] lustre: ptlrpc: fix test_req_buffer_pressure behavior [027/622] lustre: lu_object: improve debug message for lu_object_put() [028/622] lustre: idl: remove obsolete directory split flags [029/622] lustre: mdc: resend quotactl if needed [030/622] lustre: obd: create ping sysfs file [031/622] lustre: ldlm: change LDLM_POOL_ADD_VAR macro to inline function [032/622] lustre: obdecho: use vmalloc for lnb [033/622] lustre: mdc: deny layout swap for DoM file [034/622] lustre: mgc: remove obsolete IR swabbing workaround [035/622] lustre: ptlrpc: add dir migration connect flag [036/622] lustre: mds: remove obsolete MDS_VTX_BYPASS flag [037/622] lustre: ldlm: expose dirty age limit for flush-on-glimpse [038/622] lustre: ldlm: IBITS lock convert instead of cancel [039/622] lustre: ptlrpc: fix return type of boolean functions [040/622] lustre: llite: decrease sa_running if fail to start statahead [041/622] lustre: lmv: dir page is released while in use [042/622] lustre: ldlm: speed up preparation for list of lock cancel [043/622] lustre: checksum: enable/disable checksum correctly [044/622] lustre: build: armv7 client build fixes [045/622] lustre: ldlm: fix l_last_activity usage [046/622] lustre: ptlrpc: Add WBC connect flag [047/622] lustre: llog: remove obsolete llog handlers [048/622] lustre: ldlm: fix for l_lru usage [049/622] lustre: lov: Move lov_tgts_kobj init to lov_setup [050/622] lustre: osc: add T10PI support for RPC checksum [051/622] lustre: ldlm: Reduce debug to console during eviction [052/622] lustre: ptlrpc: idle connections can disconnect [053/622] lustre: osc: truncate does not update blocks count on client [054/622] lustre: ptlrpc: add LOCK_CONVERT connection flag [055/622] lustre: ldlm: handle lock converts in cancel handler [056/622] lustre: ptlrpc: Serialize procfs access to scp_hist_reqs using mutex [057/622] lustre: ldlm: don't add canceling lock back to LRU [058/622] lustre: quota: add default quota setting support [059/622] lustre: ptlrpc: don't zero request handle [060/622] lnet: ko2iblnd: determine gaps correctly [061/622] lustre: osc: increase default max_dirty_mb to 2G [062/622] lustre: ptlrpc: remove obsolete OBD RPC opcodes [063/622] lustre: ptlrpc: assign specific values to MGS opcodes [064/622] lustre: ptlrpc: remove obsolete LLOG_ORIGIN_ RPCs [065/622] lustre: osc: fix idle_timeout handling [066/622] lustre: ptlrpc: ASSERTION(!list_empty(imp->imp_replay_cursor)) [067/622] lustre: obd: keep dirty_max_pages a round number of MB [068/622] lustre: osc: depart grant shrinking from pinger [069/622] lustre: mdt: Lazy size on MDT [070/622] lustre: lfsck: layout LFSCK for mirrored file [071/622] lustre: mdt: read on open for DoM files [072/622] lustre: migrate: pack lmv ea in migrate rpc [073/622] lustre: hsm: add OBD_CONNECT2_ARCHIVE_ID_ARRAY to pass archive_id lists in array [074/622] lustre: llite: handle zero length xattr values correctly [075/622] lnet: refactor lnet_select_pathway() [076/622] lnet: add health value per ni [077/622] lnet: add lnet_health_sensitivity [078/622] lnet: add monitor thread [079/622] lnet: handle local ni failure [080/622] lnet: handle o2iblnd tx failure [081/622] lnet: handle socklnd tx failure [082/622] lnet: handle remote errors in LNet [083/622] lnet: add retry count [084/622] lnet: calculate the lnd timeout [085/622] lnet: sysfs functions for module params [086/622] lnet: timeout delayed REPLYs and ACKs [087/622] lnet: remove duplicate timeout mechanism [088/622] lnet: handle fatal device error [089/622] lnet: reset health value [090/622] lnet: add health statistics [091/622] lnet: Add ioctl to get health stats [092/622] lnet: remove obsolete health functions [093/622] lnet: set health value from user space [094/622] lnet: add global health statistics [095/622] lnet: print recovery queues content [096/622] lnet: health error simulation [097/622] lustre: ptlrpc: replace simple_strtol with kstrtol [098/622] lustre: obd: use correct ip_compute_csum() version [099/622] lustre: osc: serialize access to idle_timeout vs cleanup [100/622] lustre: mdc: remove obsolete intent opcodes [101/622] lustre: llite: fix setstripe for specific osts upon dir [102/622] lustre: osc: enable/disable OSC grant shrink [103/622] lustre: protocol: MDT as a statfs proxy [104/622] lustre: ldlm: correct logic in ldlm_prepare_lru_list() [105/622] lustre: llite: check truncate race for DOM pages [106/622] lnet: lnd: conditionally set health status [107/622] lnet: router handling [108/622] lustre: obd: check '-o network' and peer discovery conflict [109/622] lnet: update logging [110/622] lustre: ldlm: don't cancel DoM locks before replay [111/622] lnet: lnd: Clean up logging [112/622] lustre: mdt: revoke lease lock for truncate [113/622] lustre: ptlrpc: race in AT early reply [114/622] lustre: migrate: migrate striped directory [115/622] lustre: obdclass: remove unused ll_import_cachep [116/622] lustre: ptlrpc: add debugging for idle connections [117/622] lustre: obdclass: Add lbug_on_eviction option [118/622] lustre: lmv: support accessing migrating directory [119/622] lustre: mdc: move RPC semaphore code to lustre/osp [120/622] lnet: libcfs: fix wrong check in libcfs_debug_vmsg2() [121/622] lustre: ptlrpc: new request vs disconnect race [122/622] lustre: misc: name open file handles as such [123/622] lustre: ldlm: cleanup LVB handling [124/622] lustre: ldlm: pass preallocated env to methods [125/622] lustre: osc: move obdo_cache to OSC code [126/622] lustre: llite: zero lum for stripeless files [127/622] lustre: idl: remove obsolete RPC flags [128/622] lustre: flr: add 'nosync' flag for FLR mirrors [129/622] lustre: llite: create checksums to replace checksum_pages [130/622] lustre: ptlrpc: don't change buffer when signature is ready [131/622] lustre: ldlm: update l_blocking_lock under lock [132/622] lustre: mgc: don't proccess cld during stopping [133/622] lustre: obdclass: make mod rpc slot wait queue FIFO [134/622] lustre: mdc: use old statfs format [135/622] lnet: Fix selftest backward compatibility post health [136/622] lustre: osc: clarify short_io_bytes is maximum value [137/622] lustre: ptlrpc: Make CPU binding switchable [138/622] lustre: misc: quiet console messages at startup [139/622] lustre: ldlm: don't apply ELC to converting and DOM locks [140/622] lustre: class: use INIT_LIST_HEAD_RCU instead INIT_LIST_HEAD [141/622] lustre: uapi: add new changerec_type [142/622] lustre: ldlm: check double grant race after resource change [143/622] lustre: mdc: grow lvb buffer to hold layout [144/622] lustre: osc: re-check target versus available grant [145/622] lnet: unlink md if fail to send recovery [146/622] lustre: obd: use correct names for conn_uuid [147/622] lustre: idl: use proper ATTR/MDS_ATTR/MDS_OPEN flags [148/622] lustre: llite: optimize read on open pages [149/622] lnet: set the health status correctly [150/622] lustre: lov: add debugging info for statfs [151/622] lnet: Decrement health on timeout [152/622] lustre: quota: fix setattr project check [153/622] lnet: socklnd: dynamically set LND parameters [154/622] lustre: flr: add mirror write command [155/622] lnet: properly error check sensitivity [156/622] lustre: llite: add lock for dir layout data [157/622] lnet: configure recovery interval [158/622] lustre: osc: Do not walk full extent list [159/622] lnet: separate ni state from recovery [160/622] lustre: mdc: move empty xattr handling to mdc layer [161/622] lustre: obd: remove portals handle from OBD import [162/622] lustre: mgc: restore mgc binding for sptlrpc [163/622] lnet: peer deletion code may hide error [164/622] lustre: hsm: make changelog flag argument an enum [165/622] lustre: ldlm: don't skip bl_ast for local lock [166/622] lustre: clio: use pagevec_release for many pages [167/622] lustre: lmv: allocate fid on parent MDT in migrate [168/622] lustre: ptlrpc: Do not map unrecognized ELDLM errnos to EIO [169/622] lustre: llite: protect reading inode->i_data.nrpages [170/622] lustre: mdt: fix read-on-open for big PAGE_SIZE [171/622] lustre: llite: handle -ENODATA in ll_layout_fetch() [172/622] lustre: hsm: increase upper limit of maximum HSM backends registered with MDT [173/622] lustre: osc: wrong page offset for T10PI checksum [174/622] lnet: increase lnet transaction timeout [175/622] lnet: handle multi-md usage [176/622] lustre: uapi: fix warnings when lustre_user.h included [177/622] lustre: obdclass: lu_dirent record length missing '0' [178/622] lustre: update version to 2.11.99 [179/622] lustre: osc: limit chunk number of write submit [180/622] lustre: osc: speed up page cache cleanup during blocking ASTs [181/622] lustre: lmv: Fix style issues for lmv_fld.c [182/622] lustre: llite: Fix style issues for llite_nfs.c [183/622] lustre: llite: Fix style issues for lcommon_misc.c [184/622] lustre: llite: Fix style issues for symlink.c [185/622] lustre: headers: define pct(a, b) once [186/622] lustre: obdclass: report all obd states for OBD_IOC_GETDEVICE [187/622] lustre: ldlm: remove trace from ldlm_pool_count() [188/622] lustre: ptlrpc: clean up rq_interpret_reply callbacks [189/622] lustre: lov: quiet lov_dump_lmm_ console messages [190/622] lustre: lov: cl_cache could miss initialize [191/622] lnet: socklnd: improve scheduling algorithm [192/622] lustre: ldlm: Adjust search_* functions [193/622] lustre: sysfs: make ping sysfs file read and writable [194/622] lustre: ptlrpc: connect vs import invalidate race [195/622] lustre: ptlrpc: always unregister bulk [196/622] lustre: sptlrpc: split sptlrpc_process_config() [197/622] lustre: cfg: reserve flags for SELinux status checking [198/622] lustre: llite: remove cl_file_inode_init() LASSERT [199/622] lnet: add fault injection for bulk transfers [200/622] lnet: remove .nf_min_max handling [201/622] lustre: sec: create new function sptlrpc_get_sepol() [202/622] lustre: clio: fix incorrect invariant in cl_io_iter_fini() [203/622] lustre: mdc: Improve xattr buffer allocations [204/622] lnet: libcfs: allow file/func/line passed to CDEBUG() [205/622] lustre: llog: add startcat for wrapped catalog [206/622] lustre: llog: add synchronization for the last record [207/622] lustre: ptlrpc: improve memory allocation for service RPCs [208/622] lustre: llite: enable flock mount option by default [209/622] lustre: lmv: avoid gratuitous 64-bit modulus [210/622] lustre: Ensure crc-t10pi is enabled. [211/622] lustre: lov: fix lov_iocontrol for inactive OST case [212/622] lustre: llite: Initialize cl_dirty_max_pages [213/622] lustre: mdc: don't use ACL at setattr [214/622] lnet: o2iblnd: ibc_rxs is created and freed with different size [215/622] lustre: osc: reduce atomic ops in osc_enter_cache_try [216/622] lustre: llite: ll_fault should fail for insane file offsets [217/622] lustre: ptlrpc: reset generation for old requests [218/622] lustre: osc: check if opg is in lru list without locking [219/622] lnet: use right rtr address [220/622] lnet: use right address for routing message [221/622] lustre: lov: avoid signed vs. unsigned comparison [222/622] lustre: obd: use ldo_process_config for mdc and osc layer [223/622] lnet: check for asymmetrical route messages [224/622] lustre: llite: Lock inode on tiny write if setuid/setgid set [225/622] lustre: llite: make sure name pack atomic [226/622] lustre: ptlrpc: handle proper import states for recovery [227/622] lustre: ldlm: don't convert wrong resource [228/622] lustre: llite: limit statfs ffree if less than OST ffree [229/622] lustre: mdc: prevent glimpse lock count grow [230/622] lustre: dne: performance improvement for file creation [231/622] lustre: mdc: return DOM size on open resend [232/622] lustre: llite: optimizations for not granted lock processing [233/622] lustre: osc: propagate grant shrink interval immediately [234/622] lustre: osc: grant shrink shouldn't account skipped OSC [235/622] lustre: quota: protect quota flags at OSC [236/622] lustre: osc: pass client page size during reconnect too [237/622] lustre: ptlrpc: Change static defines to use macro for sec_gc.c [238/622] lnet: libcfs: do not calculate debug_mb if it is set [239/622] lustre: ldlm: Lost lease lock on migrate error [240/622] lnet: lnd: increase CQ entries [241/622] lustre: security: return security context for metadata ops [242/622] lustre: grant: prevent overflow of o_undirty [243/622] lustre: ptlrpc: manage SELinux policy info at connect time [244/622] lustre: ptlrpc: manage SELinux policy info for metadata ops [245/622] lustre: obd: make health_check sysfs compliant [246/622] lustre: misc: delete OBD_IOC_PING_TARGET ioctl [247/622] lustre: misc: remove LIBCFS_IOC_DEBUG_MASK ioctl [248/622] lustre: llite: add file heat support [249/622] lustre: obdclass: improve llog config record message [250/622] lustre: lov: remove KEY_CACHE_SET to simplify the code [251/622] lustre: ldlm: Fix style issues for ldlm_lockd.c [252/622] lustre: ldlm: Fix style issues for ldlm_request.c [253/622] lustre: ptlrpc: Fix style issues for sec_bulk.c [254/622] lustre: ldlm: Fix style issues for ptlrpcd.c [255/622] lustre: ptlrpc: IR doesn't reconnect after EAGAIN [256/622] lustre: llite: ll_fault fixes [257/622] lustre: lsom: Add an OBD_CONNECT2_LSOM connect flag [258/622] lustre: pcc: Reserve a new connection flag for PCC [259/622] lustre: uapi: reserve connect flag for plain layout [260/622] lustre: ptlrpc: allow stopping threads above threads_max [261/622] lnet: Avoid lnet debugfs read/write if ctl_table does not exist [262/622] lnet: lnd: bring back concurrent_sends [263/622] lnet: properly cleanup lnet debugfs files [264/622] lustre: mdc: reset lmm->lmm_stripe_offset in mdc_save_lovea [265/622] lnet: Cleanup lnet_get_rtr_pool_cfg [266/622] lustre: quota: make overquota flag for old req [267/622] lustre: osd: Set max ea size to XATTR_SIZE_MAX [268/622] lustre: lov: Remove unnecessary assert [269/622] lnet: o2iblnd: kib_conn leak [270/622] lustre: llite: switch to use ll_fsname directly [271/622] lustre: llite: improve max_readahead console messages [272/622] lustre: llite: fill copied dentry name's ending char properly [273/622] lustre: obd: update udev event handling [274/622] lustre: ptlrpc: Bulk assertion fails on -ENOMEM [275/622] lustre: obd: Add overstriping CONNECT flag [276/622] lustre: llite, readahead: fix to call ll_ras_enter() properly [277/622] lustre: ptlrpc: ASSERTION (req_transno < next_transno) failed [278/622] lustre: lov: new foreign LOV format [279/622] lustre: lmv: new foreign LMV format [280/622] lustre: obd: replace class_uuid with linux kernel version. [281/622] lustre: ptlrpc: Fix style issues for sec_null.c [282/622] lustre: ptlrpc: Fix style issues for service.c [283/622] lustre: uapi: fix file heat support [284/622] lnet: libcfs: poll fail_loc in cfs_fail_timeout_set() [285/622] lustre: obd: round values to nearest MiB for _mb syfs files [286/622] lustre: osc: don't check capability for every page [287/622] lustre: statahead: sa_handle_callback get lli_sa_lock earlier [288/622] lnet: use number of wrs to calculate CQEs [289/622] lustre: ldlm: Fix style issues for ldlm_resource.c [290/622] lustre: ptlrpc: Fix style issues for sec_gc.c [291/622] lustre: ptlrpc: Fix style issues for llog_client.c [292/622] lustre: dne: allow access to striped dir with broken layout [293/622] lustre: ptlrpc: ocd_connect_flags are wrong during reconnect [294/622] lnet: libcfs: fix panic for too large cpu partitions [295/622] lustre: obdclass: put all service's env on the list [296/622] lustre: mdt: fix mdt_dom_discard_data() timeouts [297/622] lustre: lov: Add overstriping support [298/622] lustre: rpc: support maximum 64MB I/O RPC [299/622] lustre: dom: per-resource ELC for WRITE lock enqueue [300/622] lustre: dom: mdc_lock_flush() improvement [301/622] lnet: Fix NI status in debugfs for loopback ni [302/622] lustre: ptlrpc: Add more flags to DEBUG_REQ_FLAGS macro [303/622] lustre: llite: Revalidate dentries in ll_intent_file_open [304/622] lustre: llite: hash just created files if lock allows [305/622] lnet: adds checking msg len [306/622] lustre: dne: add new dir hash type "space" [307/622] lustre: uapi: Add nonrotational flag to statfs [308/622] lnet: libcfs: crashes with certain cpu part numbers [309/622] lustre: lov: fix wrong calculated length for fiemap [310/622] lustre: obdclass: remove unprotected access to lu_object [311/622] lustre: push rcu_barrier() before destroying slab [312/622] lustre: ptlrpc: intent_getattr fetches default LMV [313/622] lustre: mdc: add async statfs [314/622] lustre: lmv: mkdir with balanced space usage [315/622] lustre: llite: check correct size in ll_dom_finish_open() [316/622] lnet: recovery event handling broken [317/622] lnet: clean mt_eqh properly [318/622] lnet: handle remote health error [319/622] lnet: setup health timeout defaults [320/622] lnet: fix cpt locking [321/622] lnet: detach response tracker [322/622] lnet: invalidate recovery ping mdh [323/622] lnet: fix list corruption [324/622] lnet: correct discovery LNetEQFree() [325/622] lnet: Protect lp_dc_pendq manipulation with lp_lock [326/622] lnet: Ensure md is detached when msg is not committed [327/622] lnet: verify msg is commited for send/recv [328/622] lnet: select LO interface for sending [329/622] lnet: remove route add restriction [330/622] lnet: Discover routers on first use [331/622] lnet: use peer for gateway [332/622] lnet: lnet_add/del_route() [333/622] lnet: Do not allow deleting of router nis [334/622] lnet: router sensitivity [335/622] lnet: cache ni status [336/622] lnet: Cache the routing feature [337/622] lnet: peer aliveness [338/622] lnet: router aliveness [339/622] lnet: simplify lnet_handle_local_failure() [340/622] lnet: Cleanup rcd [341/622] lnet: modify lnd notification mechanism [342/622] lnet: use discovery for routing [343/622] lnet: MR aware gateway selection [344/622] lnet: consider alive_router_check_interval [345/622] lnet: allow deleting router primary_nid [346/622] lnet: transfer routers [347/622] lnet: handle health for incoming messages [348/622] lnet: misleading discovery seqno. [349/622] lnet: drop all rule [350/622] lnet: handle discovery off [351/622] lnet: handle router health off [352/622] lnet: push router interface updates [353/622] lnet: net aliveness [354/622] lnet: discover each gateway Net [355/622] lnet: look up MR peers routes [356/622] lnet: check peer timeout on a router [357/622] lustre: lmv: reuse object alloc QoS code from LOD [358/622] lustre: llite: Add persistent cache on client [359/622] lustre: pcc: Non-blocking PCC caching [360/622] lustre: pcc: security and permission for non-root user access [361/622] lustre: llite: Rule based auto PCC caching when create files [362/622] lustre: pcc: auto attach during open for valid cache [363/622] lustre: pcc: change detach behavior and add keep option [364/622] lustre: lov: return error if cl_env_get fails [365/622] lustre: ptlrpc: Add more flags to DEBUG_REQ_FLAGS macro [366/622] lustre: ldlm: layout lock fixes [367/622] lnet: Do not allow gateways on remote nets [368/622] lustre: osc: reduce lock contention in osc_unreserve_grant [369/622] lnet: Change static defines to use macro for module.c [370/622] lustre: llite, readahead: don't always use max RPC size [371/622] lustre: llite: improve single-thread read performance [372/622] lustre: obdclass: allow per-session jobids. [373/622] lustre: llite: fix deadloop with tiny write [374/622] lnet: prevent loop in LNetPrimaryNID() [375/622] lustre: ldlm: Fix style issues for ldlm_lib.c [376/622] lustre: obdclass: protect imp_sec using rwlock_t [377/622] lustre: llite: console message for disabled flock call [378/622] lustre: ptlrpc: Add increasing XIDs CONNECT2 flag [379/622] lustre: ptlrpc: don't reset lru_resize on idle reconnect [380/622] lnet: use after free in lnet_discover_peer_locked() [381/622] lustre: obdclass: generate random u64 max correctly [382/622] lnet: fix peer ref counting [383/622] lustre: llite: collect debug info for ll_fsync [384/622] lustre: obdclass: use RCU to release lu_env_item [385/622] lustre: mdt: improve IBITS lock definitions [386/622] lustre: uapi: change "space" hash type to hash flag [387/622] lustre: osc: cancel osc_lock list traversal once found the lock is being used [388/622] lustre: obdclass: add comment for rcu handling in lu_env_remove [389/622] lnet: honor discovery setting [390/622] lustre: obdclass: don't send multiple statfs RPCs [391/622] lustre: lov: Correct bounds checking [392/622] lustre: lu_object: Add missed qos_rr_init [393/622] lustre: fld: let's caller to retry FLD_QUERY [394/622] lustre: llite: make sure readahead cover current read [395/622] lustre: ptlrpc: Add jobid to rpctrace debug messages [396/622] lnet: libcfs: Reduce memory frag due to HA debug msg [397/622] lustre: ptlrpc: change IMPORT_SET_ macros into real functions [398/622] lustre: uapi: add unused enum obd_statfs_state [399/622] lustre: llite: create obd_device with usercopy whitelist [400/622] lnet: warn if discovery is off [401/622] lustre: ldlm: always cancel aged locks regardless enabling or disabling lru resize [402/622] lustre: llite: cleanup stats of LPROC_LL_* [403/622] lustre: osc: Do not assert for first extent [404/622] lustre: llite: MS_* flags and SB_* flags split [405/622] lustre: llite: improve ll_dom_lock_cancel [406/622] lustre: llite: swab LOV EA user data [407/622] lustre: clio: support custom csi_end_io handler [408/622] lustre: llite: release active extent on sync write commit [409/622] lustre: obd: harden debugfs handling [410/622] lustre: obd: add rmfid support [411/622] lnet: Convert noisy timeout error to cdebug [412/622] lnet: Misleading error from lnet_is_health_check [413/622] lustre: llite: do not cache write open lock for exec file [414/622] lustre: mdc: polling mode for changelog reader [415/622] lnet: Sync the start of discovery and monitor threads [416/622] lustre: llite: don't check vmpage refcount in ll_releasepage() [417/622] lnet: Deprecate live and dead router check params [418/622] lnet: Detach rspt when md_threshold is infinite [419/622] lnet: Return EHOSTUNREACH for unreachable gateway [420/622] lustre: ptlrpc: Don't get jobid in body_v2 [421/622] lnet: Defer rspt cleanup when MD queued for unlink [422/622] lustre: lov: Correct write_intent end for trunc [423/622] lustre: mdc: hold lock while walking changelog dev list [424/622] lustre: import: fix race between imp_state & imp_invalid [425/622] lnet: support non-default network namespace [426/622] lustre: obdclass: 0-nlink race in lu_object_find_at() [427/622] lustre: osc: reserve lru pages for read in batch [428/622] lustre: uapi: Make lustre_user.h c++-legal [429/622] lnet: create existing net returns EEXIST [430/622] lustre: obdecho: reuse an cl env cache for obdecho survey [431/622] lustre: mdc: dir page ldp_hash_end mistakenly adjusted [432/622] lnet: handle unlink before send completes [433/622] lustre: osc: layout and chunkbits alignment mismatch [434/622] lnet: handle recursion in resend [435/622] lustre: llite: forget cached ACLs properly [436/622] lustre: osc: Fix dom handling in weight_ast [437/622] lustre: llite: Fix extents_stats [438/622] lustre: llite: don't miss every first stride page [439/622] lustre: llite: swab LOV EA data in ll_getxattr_lov() [440/622] lustre: llite: Mark lustre_inode_cache as reclaimable [441/622] lustre: osc: add preferred checksum type support [442/622] lustre: ptlrpc: Stop sending ptlrpc_body_v2 [443/622] lnet: Fix style issues for selftest/rpc.c [444/622] lnet: Fix style issues for module.c conctl.c [445/622] lustre: ptlrpc: check lm_bufcount and lm_buflen [446/622] lustre: uapi: Remove unused CONNECT flag [447/622] lustre: lmv: disable remote file statahead [448/622] lustre: llite: Fix page count for unaligned reads [449/622] lnet: discovery off route state update [450/622] lustre: llite: prevent mulitple group locks [451/622] lustre: ptlrpc: make DEBUG_REQ messages consistent [452/622] lustre: ptlrpc: check buffer length in lustre_msg_string() [453/622] lustre: uapi: fix building fail against Power9 little endian [454/622] lustre: ptlrpc: fix reply buffers shrinking and growing [455/622] lustre: dom: manual OST-to-DOM migration via mirroring [456/622] lustre: fld: remove fci_no_shrink field. [457/622] lustre: lustre: remove ldt_obd_type field of lu_device_type [458/622] lustre: lustre: remove imp_no_timeout field [459/622] lustre: llog: remove olg_cat_processing field. [460/622] lustre: ptlrpc: remove struct ptlrpc_bulk_page [461/622] lustre: ptlrpc: remove bd_import_generation field. [462/622] lustre: ptlrpc: remove srv_threads from struct ptlrpc_service [463/622] lustre: ptlrpc: remove scp_nthrs_stopping field. [464/622] lustre: ldlm: remove unused ldlm_server_conn [465/622] lustre: llite: remove lli_readdir_mutex [466/622] lustre: llite: remove ll_umounting field [467/622] lustre: llite: align field names in ll_sb_info [468/622] lustre: llite: remove lti_iter field [469/622] lustre: llite: remove ft_mtime field [470/622] lustre: llite: remove sub_reenter field. [471/622] lustre: osc: remove oti_descr oti_handle oti_plist [472/622] lustre: osc: remove oe_next_page [473/622] lnet: o2iblnd: remove some unused fields. [474/622] lnet: socklnd: remove ksnp_sharecount [475/622] lustre: llite: extend readahead locks for striped file [476/622] lustre: llite: Improve readahead RPC issuance [477/622] lustre: lov: Move page index to top level [478/622] lustre: readahead: convert stride page index to byte [479/622] lustre: osc: prevent use after free [480/622] lustre: mdc: hold obd while processing changelog [481/622] lnet: change ln_mt_waitq to a completion. [482/622] lustre: obdclass: align to T10 sector size when generating guard [483/622] lustre: ptlrpc: Hold imp lock for idle reconnect [484/622] lustre: osc: glimpse - search for active lock [485/622] lustre: lmv: use lu_tgt_descs to manage tgts [486/622] lustre: lmv: share object alloc QoS code with LMV [487/622] lustre: import: Fix missing spin_unlock() [488/622] lnet: o2iblnd: Make credits hiw connection aware [489/622] lustre: obdecho: avoid panic with partially object init [490/622] lnet: o2iblnd: cache max_qp_wr [491/622] lustre: som: integrate LSOM with lfs find [492/622] lustre: llite: error handling of ll_och_fill() [493/622] lnet: Don't queue msg when discovery has completed [494/622] lnet: Use alternate ping processing for non-mr peers [495/622] lustre: obdclass: qos penalties miscalculated [496/622] lustre: osc: wrong cache of LVB attrs [497/622] lustre: osc: wrong cache of LVB attrs, part2 [498/622] lustre: vvp: dirty pages with pagevec [499/622] lustre: ptlrpc: resend may corrupt the data [500/622] lnet: eliminate uninitialized warning [501/622] lnet: o2ib: Record rc in debug log on startup failure [502/622] lnet: o2ib: Reintroduce kiblnd_dev_search [503/622] lustre: ptlrpc: fix watchdog ratelimit logic [504/622] lustre: flr: avoid reading unhealthy mirror [505/622] lustre: obdclass: lu_tgt_descs cleanup [506/622] lustre: ptlrpc: Properly swab ll_fiemap_info_key [507/622] lustre: llite: clear flock when using localflock [508/622] lustre: sec: reserve flags for client side encryption [509/622] lustre: llite: limit max xattr size by kernel value [510/622] lustre: ptlrpc: return proper error code [511/622] lnet: fix peer_ni selection [512/622] lustre: pcc: Auto attach for PCC during IO [513/622] lustre: lmv: alloc dir stripes by QoS [514/622] lustre: llite: Don't clear d_fsdata in ll_release() [515/622] lustre: llite: move agl_thread cleanup out of thread. [516/622] lustre/lnet: remove unnecessary use of msecs_to_jiffies() [517/622] lnet: net_fault: don't pass struct member to do_div() [518/622] lustre: obd: discard unused enum [519/622] lustre: update version to 2.13.50 [520/622] lustre: llite: report latency for filesystem ops [521/622] lustre: osc: don't re-enable grant shrink on reconnect [522/622] lustre: llite: statfs to use NODELAY with MDS [523/622] lustre: ptlrpc: grammar fix. [524/622] lustre: lov: check all entries in lov_flush_composite [525/622] lustre: pcc: Incorrect size after re-attach [526/622] lustre: pcc: auto attach not work after client cache clear [527/622] lustre: pcc: Init saved dataset flags properly [528/622] lustre: use simple sleep in some cases [529/622] lustre: lov: use wait_event() in lov_subobject_kill() [530/622] lustre: llite: use wait_event in cl_object_put_last() [531/622] lustre: modules: Use LIST_HEAD for declaring list_heads [532/622] lustre: handle: move refcount into the lustre_handle. [533/622] lustre: llite: support page unaligned stride readahead [534/622] lustre: ptlrpc: ptlrpc_register_bulk LBUG on ENOMEM [535/622] lustre: osc: allow increasing osc..short_io_bytes [536/622] lnet: remove pt_number from lnet_peer_table. [537/622] lnet: Optimize check for routing feature flag [538/622] lustre: llite: file write pos mimatch [539/622] lustre: ldlm: FLOCK request can be processed twice [540/622] lnet: timers: correctly offset mod_timer. [541/622] lustre: ptlrpc: update wiretest for new values [542/622] lustre: ptlrpc: do lu_env_refill for any new request [543/622] lustre: obd: perform proper division [544/622] lustre: uapi: introduce OBD_CONNECT2_CRUSH [545/622] lnet: Wait for single discovery attempt of routers [546/622] lustre: mgc: config lock leak [547/622] lnet: check if current->nsproxy is NULL before using [548/622] lustre: ptlrpc: always reset generation for idle reconnect [549/622] lustre: obdclass: Allow read-ahead for write requests [550/622] lustre: ldlm: separate buckets from ldlm hash table [551/622] lustre: llite: don't cache MDS_OPEN_LOCK for volatile files [552/622] lnet: discard lnd_refcount [553/622] lnet: socklnd: rename struct ksock_peer to struct ksock_peer_ni [554/622] lnet: change ksocknal_create_peer() to return pointer [555/622] lnet: discard ksnn_lock [556/622] lnet: discard LNetMEInsert [557/622] lustre: lmv: fix to return correct MDT count [558/622] lustre: obdclass: remove assertion for imp_refcount [559/622] lnet: Prefer route specified by rtr_nid [560/622] lustre: all: prefer sizeof(var) for alloc [561/622] lustre: handle: discard OBD_FREE_RCU [562/622] lnet: use list_move where appropriate. [563/622] lnet: libcfs: provide an scnprintf and start using it [564/622] lustre: llite: fetch default layout for a directory [565/622] lnet: fix rspt counter [566/622] lustre: ldlm: add a counter to the per-namespace data [567/622] lnet: Add peer level aliveness information [568/622] lnet: always check return of try_module_get() [569/622] lustre: obdclass: don't skip records for wrapped catalog [570/622] lnet: Refactor lnet_find_best_lpni_on_net [571/622] lnet: Avoid comparing route to itself [572/622] lustre: sysfs: use string helper like functions for sysfs [573/622] lustre: rename ops to owner [574/622] lustre: ldlm: simplify ldlm_ns_hash_defs[] [575/622] lnet: prepare to make lnet_lnd const. [576/622] lnet: discard struct ksock_peer [577/622] lnet: Avoid extra lnet_remotenet lookup [578/622] lnet: Remove unused vars in lnet_find_route_locked [579/622] lnet: Refactor lnet_compare_routes [580/622] lustre: u_object: factor out extra per-bucket data [581/622] lustre: llite: replace lli_trunc_sem [582/622] lnet: Fix source specified route selection [583/622] lustre: uapi: turn struct lustre_nfs_fid to userland fhandle [584/622] lustre: uapi: LU-12521 llapi: add separate fsname and instance API [585/622] lnet: socklnd: initialize the_ksocklnd at compile-time. [586/622] lnet: remove locking protection ln_testprotocompat [587/622] lustre: ptlrpc: suppress connection restored message [588/622] lustre: llite: fix deadlock in ll_update_lsm_md() [589/622] lustre: ldlm: fix lock convert races [590/622] lustre: ldlm: signal vs CP callback race [591/622] lustre: uapi: properly pack data structures [592/622] lnet: peer lookup handle shutdown [593/622] lnet: lnet response entries leak [594/622] lustre: lmv: disable statahead for remote objects [595/622] lustre: llite: eviction during ll_open_cleanup() [596/622] lustre: ptlrpc: show target name in req_history [597/622] lustre: dom: check read-on-open buffer presents in reply [598/622] lustre: llite: proper names/types for offset/pages [599/622] lustre: llite: Accept EBUSY for page unaligned read [600/622] lustre: handle: remove locking from class_handle2object() [601/622] lustre: handle: use hlist for hash lists. [602/622] lustre: obdclass: convert waiting in cl_sync_io_wait(). [603/622] lnet: modules: use list_move were appropriate. [604/622] lnet: fix small race in unloading klnd modules. [605/622] lnet: me: discard struct lnet_handle_me [606/622] lnet: avoid extra memory consumption [607/622] lustre: uapi: remove unused LUSTRE_DIRECTIO_FL [608/622] lustre: lustre: Reserve OST_FALLOCATE(fallocate) opcode [609/622] lnet: libcfs: Cleanup use of bare printk [610/622] lnet: Do not assume peers are MR capable [611/622] lnet: socklnd: convert peers hash table to hashtable.h [612/622] lustre: llite: Update mdc and lite stats on open\|creat [613/622] lustre: osc: glimpse and lock cancel race [614/622] lustre: llog: keep llog handle alive until last reference [615/622] lnet: handling device failure by IB event handler [616/622] lustre: ptlrpc: simplify wait_event handling in unregister functions [617/622] lustre: ptlrpc: use l_wait_event_abortable in ptlrpcd_add_reg() [618/622] lnet: use LIST_HEAD() for local lists. [619/622] lustre: lustre: use LIST_HEAD() for local lists. [620/622] lustre: handle: discard h_lock. [621/622] lnet: remove lnd_query interface. [622/622] lnet: use conservative health timeouts

Message ID

1582838290-17243-499-git-send-email-jsimmons@infradead.org (mailing list archive)

State

New, archived

Headers

DMARC-Filter: OpenDMARC Filter v1.3.2 mail.kernel.org C3CE424690
From: James Simmons <jsimmons@infradead.org>
To: Andreas Dilger <adilger@whamcloud.com>, Oleg Drokin <green@whamcloud.com>,
 NeilBrown <neilb@suse.de>
Date: Thu, 27 Feb 2020 16:16:06 -0500
Message-Id: <1582838290-17243-499-git-send-email-jsimmons@infradead.org>
In-Reply-To: <1582838290-17243-1-git-send-email-jsimmons@infradead.org>
References: <1582838290-17243-1-git-send-email-jsimmons@infradead.org>
Subject: [lustre-devel] [PATCH 498/622] lustre: vvp: dirty pages with pagevec
Precedence: list
Cc: Lustre Development List <lustre-devel@lists.lustre.org>
MIME-Version: 1.0
Content-Type: text/plain; charset="us-ascii"
Content-Transfer-Encoding: 7bit
Errors-To: lustre-devel-bounces@lists.lustre.org
Sender: "lustre-devel" <lustre-devel-bounces@lists.lustre.org>

Series

lustre: sync closely to 2.13.52 | expand

Commit Message

James Simmons Feb. 27, 2020, 9:16 p.m. UTC

From: Patrick Farrell <pfarrell@whamcloud.com>

When doing i/o from multiple writers to a single file, the
per-file page cache lock (the mapping lock) becomes a
bottleneck.

Most current uses are single page at a time.  This converts
one prominent use, marking page as dirty, to use a pagevec.

When many threads are writing to one file, this improves
write performance by around 25%.

This requires implementing our own version of the
set_page_dirty-->__set_page_dirty_nobuffers functions.

This was modeled on upstream tip of tree:
v5.2-rc4-224-ge01e060fe0 (7/13/2019)

The relevant code is unchanged since Linux 4.17, and has
changed only minimally since before Linux 2.6.

WC-bug-id: https://jira.whamcloud.com/browse/LU-9920
Lustre-commit: a7299cb012f8 ("LU-9920 vvp: dirty pages with pagevec")
Signed-off-by: Patrick Farrell <pfarrell@whamcloud.com>
Reviewed-on: https://review.whamcloud.com/28711
Reviewed-by: Andreas Dilger <adilger@whamcloud.com>
Reviewed-by: Shaun Tancheff <stancheff@cray.com>
Reviewed-by: Li Dongyang <dongyangli@ddn.com>
Reviewed-by: Oleg Drokin <green@whamcloud.com>
Signed-off-by: James Simmons <jsimmons@infradead.org>
---
 fs/lustre/include/cl_object.h   |   2 +-
 fs/lustre/include/lustre_osc.h  |   6 +--
 fs/lustre/llite/llite_lib.c     |   5 +-
 fs/lustre/llite/vvp_io.c        | 102 +++++++++++++++++++++++++++++++++++-----
 fs/lustre/mdc/mdc_request.c     |   7 +--
 fs/lustre/obdecho/echo_client.c |  11 ++++-
 fs/lustre/osc/osc_cache.c       |  13 ++++-
 fs/lustre/osc/osc_io.c          |  23 +++++++--
 fs/lustre/osc/osc_page.c        |   7 ++-
 mm/page-writeback.c             |   1 +
 10 files changed, 144 insertions(+), 33 deletions(-)

diff --git a/fs/lustre/include/cl_object.h b/fs/lustre/include/cl_object.h
index 4c68d7b..75ece62 100644
--- a/fs/lustre/include/cl_object.h
+++ b/fs/lustre/include/cl_object.h
@@ -1458,7 +1458,7 @@  struct cl_io_slice {
 };
 
 typedef void (*cl_commit_cbt)(const struct lu_env *, struct cl_io *,
-			      struct cl_page *);
+			      struct pagevec *);
 
 struct cl_read_ahead {
 	/*
diff --git a/fs/lustre/include/lustre_osc.h b/fs/lustre/include/lustre_osc.h
index de7ccd6..2cd23f2 100644
--- a/fs/lustre/include/lustre_osc.h
+++ b/fs/lustre/include/lustre_osc.h
@@ -584,9 +584,9 @@  int osc_set_async_flags(struct osc_object *obj, struct osc_page *opg,
 int osc_prep_async_page(struct osc_object *osc, struct osc_page *ops,
 			struct page *page, loff_t offset);
 int osc_queue_async_io(const struct lu_env *env, struct cl_io *io,
-		       struct osc_page *ops);
-int osc_page_cache_add(const struct lu_env *env,
-		       const struct cl_page_slice *slice, struct cl_io *io);
+		       struct osc_page *ops, cl_commit_cbt cb);
+int osc_page_cache_add(const struct lu_env *env, struct osc_page *opg,
+		       struct cl_io *io, cl_commit_cbt cb);
 int osc_teardown_async_page(const struct lu_env *env, struct osc_object *obj,
 			    struct osc_page *ops);
 int osc_flush_async_page(const struct lu_env *env, struct cl_io *io,
diff --git a/fs/lustre/llite/llite_lib.c b/fs/lustre/llite/llite_lib.c
index ad7c2e2..5d74f30 100644
--- a/fs/lustre/llite/llite_lib.c
+++ b/fs/lustre/llite/llite_lib.c
@@ -2149,6 +2149,7 @@  void ll_delete_inode(struct inode *inode)
 	struct ll_inode_info *lli = ll_i2info(inode);
 	struct address_space *mapping = &inode->i_data;
 	unsigned long nrpages;
+	unsigned long flags;
 
 	if (S_ISREG(inode->i_mode) && lli->lli_clob) {
 		/* It is last chance to write out dirty pages,
@@ -2172,9 +2173,9 @@  void ll_delete_inode(struct inode *inode)
 	 */
 	nrpages = mapping->nrpages;
 	if (nrpages) {
-		xa_lock_irq(&mapping->i_pages);
+		xa_lock_irqsave(&mapping->i_pages, flags);
 		nrpages = mapping->nrpages;
-		xa_unlock_irq(&mapping->i_pages);
+		xa_unlock_irqrestore(&mapping->i_pages, flags);
 	} /* Workaround end */
 
 	LASSERTF(nrpages == 0,
diff --git a/fs/lustre/llite/vvp_io.c b/fs/lustre/llite/vvp_io.c
index d0d8b1f..aa8f2e1 100644
--- a/fs/lustre/llite/vvp_io.c
+++ b/fs/lustre/llite/vvp_io.c
@@ -39,7 +39,8 @@ 
 #define DEBUG_SUBSYSTEM S_LLITE
 
 #include <obd.h>
-
+#include <linux/pagevec.h>
+#include <linux/memcontrol.h>
 #include "llite_internal.h"
 #include "vvp_internal.h"
 
@@ -860,19 +861,98 @@  static int vvp_io_commit_sync(const struct lu_env *env, struct cl_io *io,
 	return bytes > 0 ? bytes : rc;
 }
 
+/* Taken from kernel set_page_dirty, __set_page_dirty_nobuffers
+ * Last change to this area: b93b016313b3ba8003c3b8bb71f569af91f19fc7
+ *
+ * Current with Linus tip of tree (7/13/2019):
+ * v5.2-rc4-224-ge01e060fe0
+ *
+ */
+void vvp_set_pagevec_dirty(struct pagevec *pvec)
+{
+	struct page *page = pvec->pages[0];
+	struct address_space *mapping = page->mapping;
+	unsigned long flags;
+	int count = pagevec_count(pvec);
+	int dirtied = 0;
+	int i = 0;
+
+	/* From set_page_dirty */
+	for (i = 0; i < count; i++)
+		ClearPageReclaim(pvec->pages[i]);
+
+	LASSERTF(page->mapping,
+		 "mapping must be set. page %p, page->private (cl_page) %p",
+		 page, (void *) page->private);
+
+	/* Rest of code derived from __set_page_dirty_nobuffers */
+	xa_lock_irqsave(&mapping->i_pages, flags);
+
+	/* Notes on differences with __set_page_dirty_nobuffers:
+	 * 1. We don't need to call page_mapping because we know this is a page
+	 * cache page.
+	 * 2. We have the pages locked, so there is no need for the careful
+	 * mapping/mapping2 dance.
+	 * 3. No mapping is impossible. (Race w/truncate mentioned in
+	 * dirty_nobuffers should be impossible because we hold the page lock.)
+	 * 4. All mappings are the same because i/o is only to one file.
+	 * 5. We invert the lock order on lock_page_memcg(page) and the mapping
+	 * xa_lock, but this is the only function that should use that pair of
+	 * locks and it can't race because Lustre locks pages throughout i/o.
+	 */
+	for (i = 0; i < count; i++) {
+		page = pvec->pages[i];
+		lock_page_memcg(page);
+		if (TestSetPageDirty(page)) {
+			unlock_page_memcg(page);
+			continue;
+		}
+		LASSERTF(page->mapping == mapping,
+			 "all pages must have the same mapping.  page %p, mapping %p, first mapping %p\n",
+			 page, page->mapping, mapping);
+		WARN_ON_ONCE(!PagePrivate(page) && !PageUptodate(page));
+		account_page_dirtied(page, mapping);
+		__xa_set_mark(&mapping->i_pages, page_index(page),
+			      PAGECACHE_TAG_DIRTY);
+		dirtied++;
+		unlock_page_memcg(page);
+	}
+	xa_unlock_irqrestore(&mapping->i_pages, flags);
+
+	CDEBUG(D_VFSTRACE, "mapping %p, count %d, dirtied %d\n", mapping,
+	       count, dirtied);
+
+	if (mapping->host && dirtied) {
+		/* !PageAnon && !swapper_space */
+		__mark_inode_dirty(mapping->host, I_DIRTY_PAGES);
+	}
+}
+
 static void write_commit_callback(const struct lu_env *env, struct cl_io *io,
-				  struct cl_page *page)
+				  struct pagevec *pvec)
 {
-	struct page *vmpage = page->cp_vmpage;
+	struct cl_page *page;
+	struct page *vmpage;
+	int count = 0;
+	int i = 0;
 
-	SetPageUptodate(vmpage);
-	set_page_dirty(vmpage);
+	count = pagevec_count(pvec);
+	LASSERT(count > 0);
 
-	cl_page_disown(env, io, page);
+	for (i = 0; i < count; i++) {
+		vmpage = pvec->pages[i];
+		SetPageUptodate(vmpage);
+	}
+
+	vvp_set_pagevec_dirty(pvec);
 
-	/* held in ll_cl_init() */
-	lu_ref_del(&page->cp_reference, "cl_io", cl_io_top(io));
-	cl_page_put(env, page);
+	for (i = 0; i < count; i++) {
+		vmpage = pvec->pages[i];
+		page = (struct cl_page *) vmpage->private;
+		cl_page_disown(env, io, page);
+		lu_ref_del(&page->cp_reference, "cl_io", cl_io_top(io));
+		cl_page_put(env, page);
+	}
 }
 
 /* make sure the page list is contiguous */
@@ -1128,9 +1208,9 @@  static int vvp_io_kernel_fault(struct vvp_fault_io *cfio)
 }
 
 static void mkwrite_commit_callback(const struct lu_env *env, struct cl_io *io,
-				    struct cl_page *page)
+				    struct pagevec *pvec)
 {
-	set_page_dirty(page->cp_vmpage);
+	vvp_set_pagevec_dirty(pvec);
 }
 
 static int vvp_io_fault_start(const struct lu_env *env,
diff --git a/fs/lustre/mdc/mdc_request.c b/fs/lustre/mdc/mdc_request.c
index 34cf177..287013f 100644
--- a/fs/lustre/mdc/mdc_request.c
+++ b/fs/lustre/mdc/mdc_request.c
@@ -1138,16 +1138,17 @@  static struct page *mdc_page_locate(struct address_space *mapping, u64 *hash,
 	 */
 	unsigned long offset = hash_x_index(*hash, hash64);
 	struct page *page;
+	unsigned long flags;
 	int found;
 
-	xa_lock_irq(&mapping->i_pages);
+	xa_lock_irqsave(&mapping->i_pages, flags);
 	found = radix_tree_gang_lookup(&mapping->i_pages,
 				       (void **)&page, offset, 1);
 	if (found > 0 && !xa_is_value(page)) {
 		struct lu_dirpage *dp;
 
 		get_page(page);
-		xa_unlock_irq(&mapping->i_pages);
+		xa_unlock_irqrestore(&mapping->i_pages, flags);
 		/*
 		 * In contrast to find_lock_page() we are sure that directory
 		 * page cannot be truncated (while DLM lock is held) and,
@@ -1197,7 +1198,7 @@  static struct page *mdc_page_locate(struct address_space *mapping, u64 *hash,
 			page = ERR_PTR(-EIO);
 		}
 	} else {
-		xa_unlock_irq(&mapping->i_pages);
+		xa_unlock_irqrestore(&mapping->i_pages, flags);
 		page = NULL;
 	}
 	return page;
diff --git a/fs/lustre/obdecho/echo_client.c b/fs/lustre/obdecho/echo_client.c
index 172fe11..8e04636 100644
--- a/fs/lustre/obdecho/echo_client.c
+++ b/fs/lustre/obdecho/echo_client.c
@@ -998,16 +998,23 @@  static int __cl_echo_cancel(struct lu_env *env, struct echo_device *ed,
 }
 
 static void echo_commit_callback(const struct lu_env *env, struct cl_io *io,
-				 struct cl_page *page)
+				 struct pagevec *pvec)
 {
 	struct echo_thread_info *info;
 	struct cl_2queue *queue;
+	int i = 0;
 
 	info = echo_env_info(env);
 	LASSERT(io == &info->eti_io);
 
 	queue = &info->eti_queue;
-	cl_page_list_add(&queue->c2_qout, page);
+
+	for (i = 0; i < pagevec_count(pvec); i++) {
+		struct page *vmpage = pvec->pages[i];
+		struct cl_page *page = (struct cl_page *)vmpage->private;
+
+		cl_page_list_add(&queue->c2_qout, page);
+	}
 }
 
 static int cl_echo_object_brw(struct echo_object *eco, int rw, u64 offset,
diff --git a/fs/lustre/osc/osc_cache.c b/fs/lustre/osc/osc_cache.c
index 3d47c02..dde03bd 100644
--- a/fs/lustre/osc/osc_cache.c
+++ b/fs/lustre/osc/osc_cache.c
@@ -2303,13 +2303,14 @@  int osc_prep_async_page(struct osc_object *osc, struct osc_page *ops,
 EXPORT_SYMBOL(osc_prep_async_page);
 
 int osc_queue_async_io(const struct lu_env *env, struct cl_io *io,
-		       struct osc_page *ops)
+		       struct osc_page *ops, cl_commit_cbt cb)
 {
 	struct osc_io *oio = osc_env_io(env);
 	struct osc_extent *ext = NULL;
 	struct osc_async_page *oap = &ops->ops_oap;
 	struct client_obd *cli = oap->oap_cli;
 	struct osc_object *osc = oap->oap_obj;
+	struct pagevec        *pvec = &osc_env_info(env)->oti_pagevec;
 	pgoff_t index;
 	unsigned int grants = 0, tmp;
 	int brw_flags = OBD_BRW_ASYNC;
@@ -2431,7 +2432,15 @@  int osc_queue_async_io(const struct lu_env *env, struct cl_io *io,
 
 		rc = 0;
 		if (grants == 0) {
-			/* we haven't allocated grant for this page. */
+			/* We haven't allocated grant for this page, and we
+			 * must not hold a page lock while we do enter_cache,
+			 * so we must mark dirty & unlock any pages in the
+			 * write commit pagevec.
+			 */
+			if (pagevec_count(pvec)) {
+				cb(env, io, pvec);
+				pagevec_reinit(pvec);
+			}
 			rc = osc_enter_cache(env, cli, oap, tmp);
 			if (rc == 0)
 				grants = tmp;
diff --git a/fs/lustre/osc/osc_io.c b/fs/lustre/osc/osc_io.c
index 8e299d4..f340266 100644
--- a/fs/lustre/osc/osc_io.c
+++ b/fs/lustre/osc/osc_io.c
@@ -40,6 +40,7 @@ 
 
 #include <lustre_obdo.h>
 #include <lustre_osc.h>
+#include <linux/pagevec.h>
 
 #include "osc_internal.h"
 
@@ -288,6 +289,7 @@  int osc_io_commit_async(const struct lu_env *env,
 	struct cl_page *page;
 	struct cl_page *last_page;
 	struct osc_page *opg;
+	struct pagevec  *pvec = &osc_env_info(env)->oti_pagevec;
 	int result = 0;
 
 	LASSERT(qin->pl_nr > 0);
@@ -306,6 +308,8 @@  int osc_io_commit_async(const struct lu_env *env,
 		}
 	}
 
+	pagevec_init(pvec);
+
 	while (qin->pl_nr > 0) {
 		struct osc_async_page *oap;
 
@@ -325,7 +329,7 @@  int osc_io_commit_async(const struct lu_env *env,
 
 		/* The page may be already in dirty cache. */
 		if (list_empty(&oap->oap_pending_item)) {
-			result = osc_page_cache_add(env, &opg->ops_cl, io);
+			result = osc_page_cache_add(env, opg, io, cb);
 			if (result != 0)
 				break;
 		}
@@ -335,12 +339,21 @@  int osc_io_commit_async(const struct lu_env *env,
 
 		cl_page_list_del(env, qin, page);
 
-		(*cb)(env, io, page);
-		/* Can't access page any more. Page can be in transfer and
-		 * complete at any time.
-		 */
+		/* if there are no more slots, do the callback & reinit */
+		if (pagevec_add(pvec, page->cp_vmpage) == 0) {
+			(*cb)(env, io, pvec);
+			pagevec_reinit(pvec);
+		}
 	}
 
+	/* Clean up any partially full pagevecs */
+	if (pagevec_count(pvec) != 0)
+		(*cb)(env, io, pvec);
+
+	/* Can't access these pages any more. Page can be in transfer and
+	 * complete at any time.
+	 */
+
 	/* for sync write, kernel will wait for this page to be flushed before
 	 * osc_io_end() is called, so release it earlier.
 	 * for mkwrite(), it's known there is no further pages.
diff --git a/fs/lustre/osc/osc_page.c b/fs/lustre/osc/osc_page.c
index 0910f3a..6685968 100644
--- a/fs/lustre/osc/osc_page.c
+++ b/fs/lustre/osc/osc_page.c
@@ -92,14 +92,13 @@  static void osc_page_transfer_add(const struct lu_env *env,
 	osc_lru_use(osc_cli(obj), opg);
 }
 
-int osc_page_cache_add(const struct lu_env *env,
-		       const struct cl_page_slice *slice, struct cl_io *io)
+int osc_page_cache_add(const struct lu_env *env, struct osc_page *opg,
+		       struct cl_io *io, cl_commit_cbt cb)
 {
-	struct osc_page *opg = cl2osc_page(slice);
 	int result;
 
 	osc_page_transfer_get(opg, "transfer\0cache");
-	result = osc_queue_async_io(env, io, opg);
+	result = osc_queue_async_io(env, io, opg, cb);
 	if (result != 0)
 		osc_page_transfer_put(env, opg);
 	else
diff --git a/mm/page-writeback.c b/mm/page-writeback.c
index 50055d2..3b5a43d 100644
--- a/mm/page-writeback.c
+++ b/mm/page-writeback.c
@@ -2433,6 +2433,7 @@  void account_page_dirtied(struct page *page, struct address_space *mapping)
 		mem_cgroup_track_foreign_dirty(page, wb);
 	}
 }
+EXPORT_SYMBOL(account_page_dirtied);
 
 /*
  * Helper function for deaccounting dirty page without writeback.

[498/622] lustre: vvp: dirty pages with pagevec

Commit Message

Patch