svd2ir: rework handling of names with placeholders by datdenkikniet · Pull Request #136 · embassy-rs/chiptool

datdenkikniet · 2026-04-19T08:25:06Z

Some parts of Renesas' SVDs have the following pattern:

<register>
    <dim>2</dim>
    <dimIncrement>0x4</dimIncrement>
    <dimIndex>P0B,P2B,P3B,P4B</dimIndex>
    <name>BUSSCNT%s</name>
</register>
<register>
    <dim>4</dim>
    <dimIncrement>0x4</dimIncrement>
    <dimIndex>MBIU,RAM0</dimIndex>
    <name>BUSSCNT%s</name>
</register>

Currently, chiptool crashes while generating fieldsets for such patterns, as it normalizes both BUSSCNT%s to BUSSCNT and fails to find unique names for the individual fieldsets.

In order to improve that behaviour, we add an extra level of mapping:

We create a fieldset with the name stripped of its placeholder (adding a number at the end if there are duplicates)
We map the full register name to this fieldset name
We use the full register name to generate the register fieldset access, unless the fieldset is an actual array. Then we use the numeric access pattern (fieldset_name(n: usize)) to get fieldsets at offsets.

This has the pleasant side effect of resolving the fact that convert_periperhal converts input like this:

<register>
    <dim>2</dim>
    <dimIncrement>0x4</dimIncrement>
    <dimIndex>A,B</dimIndex>
    <name>DLLCR%s</name>
</register>
<register>
    <dim>2</dim>
    <dimIncrement>0x4</dimIncrement>
    <dimIndex>MAIN,BACKUP</dimIndex>
    <name>INST%sREG0</name>
</register>

into

impl Peripheral {
    fn dlccr(&self, n: usize) -> Reg<Dlccr>;
    fn instreg0(&self, n: usize) -> Reg<_>;
}

instead, we now get this (identical fieldsets are returned, but at different offsets):

impl Peripheral {
    fn dlccra(&self) -> Reg<Dlccr>;
    fn dlccrb(&self) -> Reg<Dlccr>;
    fn instmainreg0(&self) -> Reg<_>;
    fn instbackupreg0(&self) -> REg<_>;
}

We can probably do the same for fields, but this PR doesn't do that yet.

This fixes #90 and closes #92

inferiorhumanorgans · 2026-04-19T20:33:13Z

This works with the SVDs for:

RA2A1
RA4M1

This does not work for these SVDs:

RA4L1

Error: Failed to find unique name for ["CANFD", "CFDRMDF_11"]. n: ["CANFD", "CFDRMDF_11"]

RA6M5

Err: on rename Fieldset "CANFD::regs::CFDTMDF1__01", new name "canfd::regs::Cfdtmdf101" already exist
Err: on rename Fieldset "CANFD::regs::CFDTMDF1__11", new name "canfd::regs::Cfdtmdf111" already exist

Also, it would be nice to keep the indices noted somewhere (appended to the description, as a replacement for %s in the description, or as a separate ir field).

datdenkikniet · 2026-04-19T20:56:41Z

Added an ugly-hack workaround for now. It seems to handle the RA4L1 SVD now, but not sure it'll also hold up to the transforms. There is a better way to do it but I cannot come up with it right now :P

inferiorhumanorgans · 2026-04-19T21:03:36Z

Yeah, there are lots of ugly hacks to deal with these SVDs unfortunately (and the less said about the other metadata I'm ingesting the better 😂).

FWIW the MCUs I've got on hand are the 2A1, 4L1, 4M1, 6M5, and 8M1 so that's what I'm checking. The 8M1's SVD causes the svd-parser validation to choke so I'm skipping that for now.

I'm testing these with the ra-pac workflow (just extract-peripherals ra4m1). I can walk you through setting that up if you'd like but that also depends on some transforms that aren't in the main branch so it may not be worth the time.

datdenkikniet · 2026-04-20T06:13:46Z

Did some mucking about and have come to the conclusion that having an underscore suffix, and just checking if the new name has been used at all, is easiest and is most likely to resolve most issues. There is probably a better way to find a unique name beyond "iterate until we've found one that doesn't exist yet" but I don't want to figure that one out just yet

inferiorhumanorgans · 2026-04-20T23:37:00Z

I think the more elegant solution would be to include the indicies in the name, but that has the potential for unreasonably long fieldset names. What I landed on was to keep a hashmap of seen fieldset names to keep track of the counts.

datdenkikniet · 2026-04-21T06:14:34Z

to include the indicies in the name

Like you say, this could cause really long names. Additionally, it doesn't necessarily improve the naming: inserting the dimension indices at the end of the name is equally "bad" IMO, and inserting them at the placeholder name doesn't really make it any better either.

Take, for example, CFDRMDF%s_1 with indicies 1-7, CFDRMDF%s_1 with indicies 8-15, and CFDRMDF%s_11 with indicies 8-15 (picked from the RA4L1 SVD). They're not arrays (by the standards implemented in this PR at least) because the placeholder is not at the end. We'd end up with:

Approach	CFDRMDF%s_1,`1-7`	CFDRMDF%s_1,`8-15`	CFDRMDF%s_11, `8-15`
`main`	CFDRMDF_1	CFDRMDF_1 (causes collision)	CFDRMDF_11
This PR	CFDRMDF_1	CFDRMDF_1_1	CFDRMDF_11
With indices at placeholder	CFDRMDF1234567_1	CFDRMDF_89101112131415_1	CFDRMDF89101112131415_11
With indices at end	CFDRMDF_1_1234567	CFDRMDF_1_89101112131415	CFDRMDF_11_89101112131415

I don't really see including the indices helping much of anything. We could/should include them in the description, but then they become stale as soon as anyone merges or deletes anything, and having some sort of two-way binding seems really heavy.

inferiorhumanorgans · 2026-04-21T22:08:52Z

In the case of CFDRMDF%s_1, the field itself is CFDRMDF1 or "RX Message Buffer Data Field 1 Registers". For the indicies I would collapse contiguous ranges to m_n with an end result along the lines of CFDRMDF_1_m_n. I've not actually read any of the docs on the CANFD peripheral but I'd expect the indicies are individual mailboxes.

That breaks down with PFS where you get constructs like P00%sPFS where the intended index is actually 00 or 000. PmnnPFS breaks down to m = index of the port peripheral, nn = index of the pin on the port.

Ultimately I'm not so concerned with the generated form as with PFS I rewrote the whole thing by hand because the fieldsets were so useless, and I may do the same with CANFD.

datdenkikniet · 2026-04-22T06:30:19Z

I would collapse contiguous ranges to m_n with an end result along the lines of CFDRMDF_1_m_n

I may have misunderstood the intent of your comment, but to make sure we're on the same page: I agree that this as an end result would be great, but it also sounds like something that is way out of scope for svd2ir. Merging fieldsets based on compatibility (or assuming that they are compatible based on names) sounds like a transform-level concern, not an svd2ir-level concern. svd2ir should mirror the SVD as closely as possible, which it would do with this change. This approach doesn't merge anything, but just uses what the SVD tells us (names containing placeholders refer to the same thing by definition) to generate a slightly more concise IR.

An alternative would be that we always expand all placeholders (except for arrays, [%s]), e.g. generate 8 fieldsets for CFDRMDF{0,1,2,3,4,5,6,7}_1 and leave it up to the user to re-merge them, but that seems unnecessary since the IR is made to deduplicate identical fieldsets, and we know that their definitions are identical.

Since we'll have to do manual transforms to get it into a good shape anyways, I don't think the naming scheme matters beyond that it is mostly recognizable as belonging to certain inner blocks, which it is with the current naming scheme.

Ultimately I'm not so concerned with the generated form

OK, let's conclude the naming discussion then: the name of the fieldset doesn't really matter so long as it is unique and can be associated with the inner block functions that use it, because whatever it becomes, it is likely to get renamed anyways. The current approach does that, while preserving the names we could already generate, and doesn't generate comically long names.

Opinions? I don't think bikeshedding the naming thing any more will help.

inferiorhumanorgans · 2026-04-23T00:19:27Z

Opinions? I don't think bikeshedding the naming thing any more will help.

Agreed.

However as of 163fc34 chiptool still chokes on the CANFD peripheral in the 4L1 and 6M5.

4L1:

thread 'main' (20068057) panicked at src/transform/mod.rs:84:60:
called `Result::unwrap()` on an `Err` value:
Err: on rename Fieldset "CANFD::regs::CFDRMDF_1_1", new name "canfd::regs::Cfdrmdf11" already exist
Err: on rename Fieldset "CANFD::regs::CFDRMDF_1_3", new name "canfd::regs::Cfdrmdf13" already exist
Err: on rename Fieldset "CANFD::regs::CFDRMDF_1_2", new name "canfd::regs::Cfdrmdf12" already exist

6M5:

thread 'main' (20068572) panicked at src/transform/mod.rs:84:60:
called `Result::unwrap()` on an `Err` value:
Err: on rename Fieldset "CANFD::regs::CFDTMDF1__1_1", new name "canfd::regs::Cfdtmdf111" already exist
Err: on rename Fieldset "CANFD::regs::CFDTMDF1__0_1", new name "canfd::regs::Cfdtmdf101" already exist

The minimal set of transforms I'm applying is:

Sanitize
DeleteUselessEnums
Sort

datdenkikniet · 2026-04-23T05:44:37Z

Ahh, okay: Sanitize is removing underscores from fieldset names, but underscores are the thing we use for separating... I'll look into what we can do about that

datdenkikniet · 2026-04-23T17:26:13Z

OK, after some thinking, I think the way to go here is:

Introduce a RenameFieldsets transform
Merge this PR as-is, and force the user to choose better names before calling Sanitize.

I've listed some other alternatives in #141, but I don't think any of them are better than this.

ETA: Have opened #141

With RenameFieldsets, I end up with the following:

  - !RenameFieldsets
    from: CANFD::regs::CFDRMDF_(\d+)_(\d+)
    to: CANFD::regs::CFDRMDF${1}copy${2}
  - !Sanitize
  - !MergeFieldsets
    from: canfd::regs::Cfdrmdf(\d+)(copy(\d+))?
    to: canfd::regs::Cfdrmdf

but of course, in this case, one could:

  - !MergeFieldsets
    from: CANFD::regs::CFDRMDF_(\d+)(_(\d+))?
    to: CANFD::regs::CFDRMDf
  - !Sanitize

instead, even without RenameFieldsets, because the register can actually be merged.

inferiorhumanorgans · 2026-04-28T10:10:32Z

Another alternative might be to have the counts zero padded. so that 11,1 → 011_001 and 1,1 → 001_001 don't collide. It would be nice (but is not a deal breaker for me) to not have to pre-sanitize input.

datdenkikniet force-pushed the clusters branch from 67d5374 to ab57951 Compare April 19, 2026 08:27

datdenkikniet mentioned this pull request Apr 19, 2026

Handle non-numeric array indices for registers #92

Open

datdenkikniet force-pushed the clusters branch from ab57951 to 732d7fd Compare April 19, 2026 12:00

datdenkikniet changed the title ~~Support converting SVDs with (duplicate) clustered names to IR~~ svd2ir: Support converting SVDs with (duplicate) clustered names to IR Apr 19, 2026

datdenkikniet changed the title ~~svd2ir: Support converting SVDs with (duplicate) clustered names to IR~~ svd2ir: Support converting SVDs with (duplicate) clustered names Apr 19, 2026

datdenkikniet force-pushed the clusters branch 2 times, most recently from 4f94476 to 9a9d61d Compare April 19, 2026 18:27

datdenkikniet marked this pull request as draft April 19, 2026 20:56

datdenkikniet force-pushed the clusters branch 2 times, most recently from e145d1e to 81c3d1a Compare April 20, 2026 06:11

datdenkikniet marked this pull request as ready for review April 20, 2026 06:12

datdenkikniet force-pushed the clusters branch from 81c3d1a to b4f7e2d Compare April 20, 2026 06:14

datdenkikniet added 3 commits April 21, 2026 22:39

reduce significant rightward drift

9b70ab5

replace_suffix -> remove_placeholder

be2d5a8

Split out create_block_item

7019219

datdenkikniet force-pushed the clusters branch from b4f7e2d to 163fc34 Compare April 21, 2026 20:56

datdenkikniet marked this pull request as draft April 21, 2026 20:58

datdenkikniet changed the title ~~svd2ir: Support converting SVDs with (duplicate) clustered names~~ svd2ir: rework handling of names with placeholders Apr 22, 2026

Support loading names with placeholders

25c59bd

datdenkikniet mentioned this pull request Apr 23, 2026

Add RenameFieldsets transform #141

Open

datdenkikniet force-pushed the clusters branch from 163fc34 to 25c59bd Compare April 23, 2026 19:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

svd2ir: rework handling of names with placeholders#136

svd2ir: rework handling of names with placeholders#136
datdenkikniet wants to merge 4 commits intoembassy-rs:mainfrom
datdenkikniet:clusters

datdenkikniet commented Apr 19, 2026 •

edited

Loading

Uh oh!

inferiorhumanorgans commented Apr 19, 2026 •

edited

Loading

Uh oh!

datdenkikniet commented Apr 19, 2026

Uh oh!

inferiorhumanorgans commented Apr 19, 2026

Uh oh!

datdenkikniet commented Apr 20, 2026 •

edited

Loading

Uh oh!

inferiorhumanorgans commented Apr 20, 2026

Uh oh!

datdenkikniet commented Apr 21, 2026

Uh oh!

inferiorhumanorgans commented Apr 21, 2026

Uh oh!

datdenkikniet commented Apr 22, 2026

Uh oh!

inferiorhumanorgans commented Apr 23, 2026

Uh oh!

datdenkikniet commented Apr 23, 2026

Uh oh!

datdenkikniet commented Apr 23, 2026 •

edited

Loading

Uh oh!

inferiorhumanorgans commented Apr 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

datdenkikniet commented Apr 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

inferiorhumanorgans commented Apr 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

datdenkikniet commented Apr 19, 2026

Uh oh!

inferiorhumanorgans commented Apr 19, 2026

Uh oh!

datdenkikniet commented Apr 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

inferiorhumanorgans commented Apr 20, 2026

Uh oh!

datdenkikniet commented Apr 21, 2026

Uh oh!

inferiorhumanorgans commented Apr 21, 2026

Uh oh!

datdenkikniet commented Apr 22, 2026

Uh oh!

inferiorhumanorgans commented Apr 23, 2026

Uh oh!

datdenkikniet commented Apr 23, 2026

Uh oh!

datdenkikniet commented Apr 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

inferiorhumanorgans commented Apr 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

datdenkikniet commented Apr 19, 2026 •

edited

Loading

inferiorhumanorgans commented Apr 19, 2026 •

edited

Loading

datdenkikniet commented Apr 20, 2026 •

edited

Loading

datdenkikniet commented Apr 23, 2026 •

edited

Loading