Add option to override per-VR character set usage by Enet4 · Pull Request #676 · Enet4/dicom-rs

Enet4 · 2025-08-21T19:04:48Z

Should resolve #675 by offering a way to ignore the fact that some VRs should always use the default character repertoire.

Summary

[parser] Add support for character set override logic
[object] Extend charset_override option to collector and in-mem DICOM object

naterichman

Sorry if I'm missing something, but figured I would hop on a review in case it helped!

parser/src/stateful/decode.rs

momostarsky · 2025-09-22T03:10:53Z

@Enet4 Hi, why hasn't this PR been merged yet? Is there anything blocking it? Does it need more testing?

Enet4 · 2025-09-22T11:03:29Z

I just didn't get to it yet. Let me assess and revise on Nathan's comment first, hopefully within the next few days.

- CharacterSetOverride allows one to indicate that the more extended character repertoire available should be used for decoding other VRs such as CS.

… object - re-export dicom_parser options accordingly

- Include UR in VRs which should use the default character set

Enet4 · 2025-09-24T08:22:31Z

I rebased the branch and added UR as one of the VRs to use the default character set by default. Things seem to be in order. @momostarsky I noticed a deleted post here, are there still any issues?

momostarsky · 2025-09-25T06:36:06Z

@Enet4 It might be an issue with testing or configuration. I'll verify later using storescu and storescp to check if it affects read/write operations.

momostarsky · 2025-09-25T08:44:46Z

@Enet4 Building on the earlier tests, Test some DICOM files with character sets such as "ISO 192", "ISO 100", "GB18030", and "ISO 2022 IR 58". Modify the storescp implementation in store_sync.rs around line 192, add code similar to the following:
'''
match dicom_object::OpenFileOptions::new()
.charset_override(CharacterSetOverride::AnyVr)
.read_until(tags::PIXEL_DATA)
.open_file(&file_path)
{
Ok(dcm_obj) => {
match dcm_obj.element(tags::SPECIFIC_CHARACTER_SET) {
Ok(elm) => {
println!(
"SPECIFIC_CHARACTER_SET: {:?}",
elm.value().to_str()
);
}
Err(_) => {
warn!("Specific Character Set tag not found in the DICOM file");
}
};

                                    match dcm_obj.element(tags::BODY_PART_EXAMINED) {
                                        Ok(elm) => {
                                            println!(
                                                "BODY_PART_EXAMINED: {:?}",
                                                elm.value().to_str()
                                            );
                                        }
                                        Err(_) => {
                                            warn!("Body Part Examined tag not found in the DICOM file");
                                        }
                                    };
                                }
                                Err(e) => {
                                    warn!("Failed to read back stored file: {}", e);
                                }
                            }

'''
no abnormalities were found. Should we test with more datasets?

Enet4 · 2025-09-25T08:49:00Z

That sounds to offer good coverage already, thanks! If in the meantime you found what could be considered a bug, please file a new issue.

Enet4 added enhancement A-lib Area: library C-object Crate: dicom-object C-parser Crate: dicom-parser labels Aug 21, 2025

Enet4 mentioned this pull request Aug 21, 2025

Charset Convert #675

Closed

naterichman reviewed Sep 4, 2025

View reviewed changes

parser/src/stateful/decode.rs Outdated Show resolved Hide resolved

Enet4 added 2 commits September 24, 2025 08:59

[parser] Add support for character set override logic

7e87859

- CharacterSetOverride allows one to indicate that the more extended character repertoire available should be used for decoding other VRs such as CS.

[object] Extend charset_override option to collector and in-mem DICOM…

6231ccf

… object - re-export dicom_parser options accordingly

Enet4 force-pushed the imp/parser/extend-charset-cs branch from 3941f03 to 6231ccf Compare September 24, 2025 07:59

[parser] Tweak character set override logic

36c4e99

- Include UR in VRs which should use the default character set

Enet4 merged commit 76e4e5a into master Sep 25, 2025
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add option to override per-VR character set usage#676

Add option to override per-VR character set usage#676
Enet4 merged 3 commits intomasterfrom
imp/parser/extend-charset-cs

Enet4 commented Aug 21, 2025

Uh oh!

naterichman left a comment

Uh oh!

Uh oh!

momostarsky commented Sep 22, 2025

Uh oh!

Enet4 commented Sep 22, 2025

Uh oh!

Enet4 commented Sep 24, 2025

Uh oh!

momostarsky commented Sep 25, 2025

Uh oh!

momostarsky commented Sep 25, 2025

Uh oh!

Enet4 commented Sep 25, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

Enet4 commented Aug 21, 2025

Summary

Uh oh!

naterichman left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

momostarsky commented Sep 22, 2025

Uh oh!

Enet4 commented Sep 22, 2025

Uh oh!

Enet4 commented Sep 24, 2025

Uh oh!

momostarsky commented Sep 25, 2025

Uh oh!

momostarsky commented Sep 25, 2025

Uh oh!

Enet4 commented Sep 25, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants