[PATCH v9 0/6] CXL Poison List Retrieval & Tracing

From: alison . schofield
Date: Mon Mar 20 2023 - 00:32:01 EST


From: Alison Schofield <alison.schofield@xxxxxxxxx>

Changes in v9:

Patch 3: cxl/memdev: Add trigger_poison_list sysfs attribute
- Make trigger_poison_list a driver attribute to protect against unbinds
while poison is being read. (Dan)

Patch 4: cxl/region: Provide region info to the cxl_poison trace event
- Remove already held cxl_dpa_rwsem in cxl_get_poison_by_endpoint() (Ira)
- Add hold of cxl_region_rwsem in cxl_get_poison_by_endpoint()
- Move the 'remains' handling to memdev driver (Ira)
Previously, after the region driver read poison for the last committed
endpoint decoder, it also read poison for remaining unmapped resources.
Add a context struct to pass the poison read state between memdev and
region drivers, so that memdev driver can complete the poison read of
unmapped resources.

Patches 1,2,5,6: no changes.

Link to v8:
https://lore.kernel.org/linux-cxl/cover.1678468593.git.alison.schofield@xxxxxxxxx/

Add support for retrieving device poison lists and store the returned
error records as kernel trace events.

The handling of the poison list is guided by the CXL 3.0 Specification
Section 8.2.9.8.4.1. [1]

Example:
$ echo 1 > /sys/bus/cxl/devices/mem0/trigger_poison_list
cxl_poison: memdev=mem0 host=cxl_mem.0 serial=0 region=region4 region_uuid=117b2cf4-b160-4090-9361-ba31b9649317 hpa=0xf0d0000000 dpa=0x40000000 length=0x40 source=Internal flags= overflow_time=0

[1]: https://www.computeexpresslink.org/download-the-specification


Alison Schofield (6):
cxl/mbox: Add GET_POISON_LIST mailbox command
cxl/trace: Add TRACE support for CXL media-error records
cxl/memdev: Add trigger_poison_list sysfs attribute
cxl/region: Provide region info to the cxl_poison trace event
cxl/trace: Add an HPA to cxl_poison trace events
tools/testing/cxl: Mock support for Get Poison List

Documentation/ABI/testing/sysfs-bus-cxl | 14 +++
drivers/cxl/core/core.h | 11 +++
drivers/cxl/core/mbox.c | 74 ++++++++++++++++
drivers/cxl/core/memdev.c | 108 ++++++++++++++++++++++++
drivers/cxl/core/region.c | 63 ++++++++++++++
drivers/cxl/core/trace.c | 94 +++++++++++++++++++++
drivers/cxl/core/trace.h | 91 ++++++++++++++++++++
drivers/cxl/cxlmem.h | 72 +++++++++++++++-
drivers/cxl/mem.c | 36 ++++++++
drivers/cxl/pci.c | 4 +
tools/testing/cxl/test/mem.c | 42 +++++++++
11 files changed, 608 insertions(+), 1 deletion(-)


base-commit: e686c32590f40bffc45f105c04c836ffad3e531a
--
2.37.3