Re: RFC on drivers/memory vs drivers/edac memory mapping for DDR Controller

From: Krzysztof Kozlowski
Date: Mon Jan 02 2023 - 08:59:24 EST


On 02/01/2023 14:44, Shenhar, Talel wrote:
>
> On 1/2/2023 2:47 PM, Krzysztof Kozlowski wrote:
>> CAUTION: This email originated from outside of the organization. Do not click links or open attachments unless you can confirm the sender and know the content is safe.
>>
>>
>>
>> On 02/01/2023 13:17, Shenhar, Talel wrote:
>>> Hi,
>>>
>>> Want to consult on a topic that involve both drivers/memory and
>>> drivers/edac.
>>>
>>> * We want to introduce driver that reads DDR controller RAS register and
>>> notify for ECC errors by using EDAC MC API found in drivers/edac.
>>> * We also want to have a capability to dynamically change DDR refresh
>>> rate based on thermal values (best to be done in drivers/memory ?).
>>>
>>> The pain point here is that both capabilities are controlled from the
>>> DDR controller.
>>> This create issue while using
>>> devm_platform_ioremap_resource*->devm_request_mem_region which prevent
>>> two mapping of same area.
>> This could be avoided but the true problem is that you have two devices
>> for same memory mapping. Devicetree does not allow it and it points to
>> some wrong hardware representation in DTS.
>>
>>> It seems to be expected problem as we have 2 "framework" (edac and
>>> memory) split while both aim for same HW unit.
>>> What is the recommended way to face such conflicts?
>> You now mix Devicetree and Linux drivers. You can have same IO address
>> space used by multiple drivers, even though it is not always good
>> approach (concurrent and conflicting change of same settings).
>>
>> HW description is irrelevant to this.
>>
>>> We had several concept in mind but would love to get your point of view
>>> first.
>> Describe hardware accurately and completely. This solves all the
>> problems, doesn't it? Linux drivers do not depend on it and you can make
>> it differently.
>
> Describing the hardware accurately and completely means to have multiple
> reg property in the device-tree, right?

Not necessarily. It means to describe the devices, the hardware, not
drivers.

>
> That way we will split the HW description to smaller bits rather then
> just big "ddrc", and that shall allow us to have two drivers and each
> one will get its own share of the split, right?

Not necessarily. If you have one hardware, why would you split it? It's
only one memory controller, not two.

> That was the intent of solution 1 below.
>
>>
>>> Things we had in mind:
>>> 1) map more specific region to avoid conflict (we don't need the same
>>> registers on both entity so if we do very specific multiple mapping this
>>> shall be resolved)
>>> 2) use other kernel API for mapping that doesn't do request_mem_region
>>> (or use the reserve only for one of them)
>>> 3) have single driver (edac mc) handle also the refresh rate
>>> 4) export edac_mc.h and have the drivers/memory have all the needed code
>>> to do both edac and refresh rate under drivers/memory
>> None of these address the core problem - possibly inaccurate hardware
>> description...
>
> Can you elaborate on this inaccurate hardware description?

I explained - using same IO address suggests you used Linux driver
structure in your hardware description. I assume we talk here about
Devicetree. If not, that's quite different case... then I guess ACPI,
which I do not care - I am not it's maintainer.

> Also, I'd like to write down my understanding of your response from above:
>
> it seems you see as possible solution both using different API that
> allow overlapping (solution 2) and also for splitting the IO address
> space to finer pieces to achieve full HW description (solution 1)

No. Sorry, we probably talk about two different things.

You started writing that you have a hardware described as one IO address
space and now have a problem developing drivers for it.

The driver model for this is entirely different problem than problem of
accurate hardware description. Whether you described HW correct or not,
I don't know. You did not provide any details here, like DTS or bindings
(if we talk about Devicetree).

Having multiple drivers using similar resources is already solved many
times (MFD, syscon).

Whether the solution is correct or not is one more (third) topic: poking
to same IO address space from two different drivers is error-prone. This
one is solvable with splitting IO address space.

Best regards,
Krzysztof