Issue 1438 write tar consistency by marco66colombo · Pull Request #1442 · fastmachinelearning/hls4ml

marco66colombo · 2026-02-17T22:37:10Z

Description

This PR fixes #1438 by making WriteTar handling consistent and explicit across writers.

Summary

Move WriteTar if condition one layer higher: check WriterConfig.WriteTar in write_hls() before calling write_tar().
Keep write_tar() as a pure “write tarball” method (no internal config check).
Add write_tar support to Catapult backend config (create_initial_config(..., write_tar=False) + WriterConfig['WriteTar']), so Catapult follows the same config path as other backends.
Extend test_writer_config::test_write_tar coverage to include Catapult.
Add write_tar support to VivadoAcceleratorBackend.create_initial_config(...) and include VivadoAccelerator in test_writer_config::test_write_tar coverage.

Motivation / Context

Issue #1438 highlights inconsistent artifact behavior and writer semantics.

A key point from prior maintainer discussion is that keeping tar generation disabled by default is better for users, to avoid generating large numbers of extra files during local development and debugging.

This PR keeps that default (WriteTar=False) while making behavior consistent across backends.

Notable behavior note

For Catapult specifically, when WriteTar=True and tar creation is triggered:

previous behavior: keep existing tar and print “already exists”
new behavior: overwrite existing tar (consistent with other backends)

This is intentional for consistency across backends.

For VivadoAccelerator, with WriteTar=True, tar can be generated/regenerated twice (once via the Vivado writer path and once via write_new_tar after accelerator-specific files are added). This matches past behavior and is not introduced in this PR.

Follow-up (in progress)

I still need to update tests/CI so artifact generation is always enabled where CI expects tar artifacts.

Possible approaches under consideration:

Add a central switch in test/pytest/conftest.py (for example wrapping conversion/config creation to force write_tar=True in CI).
Update each test case explicitly to pass write_tar=True (more explicit, but many more edits required).

I am working on this as a follow-up step.

Type of change

Bug fix (non-breaking change that fixes an issue)
Documentation update
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
A new research paper code implementation
Other (Specify)

Tests

I ran test/pytest/test_writer_config.py in a local environment.

Test Configuration:

Local development environment using same image used in CI.

Checklist

I have read the guidelines for contributing.
I have commented my code, particularly in hard-to-understand areas.
I have made corresponding changes to the documentation.
My changes generate no new warnings.
I have installed and run pre-commit on the files I edited or added.
I have added tests that prove my fix is effective or that my feature works.

vloncar · 2026-02-18T11:45:44Z

hls4ml/writer/quartus_writer.py

        self.write_activation_tables(model)
        self.write_yml(model)
-        self.write_tar(model)
+        write_tar = model.config.get_writer_config().get('WriteTar', False)


Why should we move the check outside of the write_tar() function? It was deliberately placed inside so we have a clean set of steps for the writer. Also it is easier to log (we're not doing it now, but could do in the future)

Because right now if you don't set 'WriteTar' field in the writer config, the behaviour of write_tar() is not the one expected, i.e. it doesn't create the tarball archive. To my understanding, that field this is mainly leveraged for local testing, in which you don't want to have the test folder filled with tarball files.
I think a different and cleaner solution would be to set an environment variable that prevents the tar files to be generated (basically moving back the check inside write_tar(), but checking if the env variable is set this time). This solution has the advantage of being set only once before running the tests and it is applied to all the tests.

The check is identical, it is just outside of the function now. The idea of the writer is that it follows a set of steps. The tar writer function may or may not write the tar, depending on the config, just like all the other functions write contents based on the provided config. Using env vars to change the behavior of the conversion process while you also have the conversion config itself that controls all other aspects of conversion would be splitting control and introducing inconsistency that will likely be a burden later on.

Thanks, this makes sense. I moved the WriteTar check outside write_tar() following earlier discussion, but I agree that keeping that logic inside write_tar() can preserve the writer-step semantics and original design.
If preferred, I can restore that behavior (move the check back inside write_tar()) while keeping the backend config fixes for Catapult and VivadoAccelerator, plus the related test coverage updates.

marco66colombo · 2026-02-18T21:04:00Z

As mentioned in the Follow-up section, we still need to decide how to (and if) ensure tar artifact generation in CI.

Should we enable tar artifact generation for every CI test case?

If yes, I see two practical approaches:

Central CI toggle in test/pytest/conftest.py
- Implementation: force write_tar=True for conversions only when a CI toggle is enabled (env var like HLS4ML_FORCE_WRITE_TAR=1 or pytest flag like --force-write-tar).
- Pros: single-point change, no per-test edits, easy CI on/off.
- Cons: more implicit behavior (wrapper/monkeypatch), may miss uncommon paths if they bypass wrapped entrypoints.
Explicit per-test updates
- Implementation: update tests to pass write_tar=True in each conversion call.
- Pros: explicit behavior in each test, finer-grained control.
- Cons: many edits across the suite, higher maintenance burden.

Happy to proceed with either approach if we want to activate artifact generation.

marco66colombo · 2026-03-02T23:04:54Z

Since the group was leaning toward using an env var, avoiding edits across many test files, and keeping the default CI behavior as “no tar generation,” I implemented the env-var control in the writer path.

I chose this over a conftest.py approach because forcing tar generation there would require global wrappers/monkeypatching of the methods that create the configs (e.g. convert_from_keras_model), which is more implicit and harder to maintain/debug.

Please let me know if this approach is acceptable, or if you’d prefer the pytest-fixture route instead.

I also restored self.write_tar(...) as an explicit writer step and moved the WriteTar check back inside the tar-writing methods (as suggested in review), so semantics stay consistent.

nghielme · 2026-03-04T13:01:48Z

test/pytest/test_writer_config.py

+        ('false', False),
+        ('1', True),
+    ],
+)


I think you need to add the id=[...] here for the proper naming of the tests

thanks for the comment, done

marco66colombo · 2026-03-04T17:00:55Z

I have one WriterConfig question before I finalize this PR.

Currently, HLSConfig uses backend WriterConfig as-is when present, and only applies defaults when WriterConfig is not there:

hls4ml/hls4ml/model/graph.py

Lines 63 to 71 in b50bd46

    
           if 'WriterConfig' in self.config: 
        
               self.writer_config = self.config['WriterConfig'] 
        
           else: 
        
               self.writer_config = { 
        
                   'Namespace': None, 
        
                   'WriteWeightsTxt': True, 
        
                   'WriteTar': False, 
        
                   'TBOutputStream': 'both', 
        
               }

In this PR, Catapult now sets:

 config['WriterConfig'] = {
       'WriteTar': write_tar,
}

That matches existing patterns in some backends (for example):

hls4ml/hls4ml/backends/quartus/quartus_backend.py

Lines 140 to 142 in b50bd46

    
           config['WriterConfig'] = { 
        
               'WriteTar': write_tar, 
        
           }

hls4ml/hls4ml/backends/oneapi/oneapi_backend.py

Lines 171 to 174 in b50bd46

    
           config['WriterConfig'] = { 
        
               # TODO:  add namespace 
        
               'WriteTar': write_tar, 
        
           }

But other backends provide the full writer-config key set (for example Vivado):

hls4ml/hls4ml/backends/vivado/vivado_backend.py

Lines 282 to 287 in b50bd46

    
           config['WriterConfig'] = { 
        
               'Namespace': namespace, 
        
               'WriteWeightsTxt': write_weights_txt, 
        
               'WriteTar': write_tar, 
        
               'TBOutputStream': tb_output_stream, 
        
           }

Given this mixed pattern, I was wondering what is the expected policy.

Is it acceptable for Catapult to set only WriteTar in WriterConfig?
Or should backends that define WriterConfig always provide the full key set for consistency/compatibility?

marco66colombo added 3 commits February 17, 2026 13:37

move WriteTar check to write_hls and align Catapult behavior

39741b0

test_writer_config:test_write_tar include Catapult backend

bd2f33f

fix write_tar for VivadoAccelerator

ebd2fd5

marco66colombo requested review from jmitrevs and nghielme February 17, 2026 22:37

marco66colombo self-assigned this Feb 17, 2026

vloncar reviewed Feb 18, 2026

View reviewed changes

writer: restore write_tar semantics and add env override

e77010f

nghielme reviewed Mar 4, 2026

View reviewed changes

test: add explicit ids for write_tar env override parametrization

1f4f187

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issue 1438 write tar consistency#1442

Issue 1438 write tar consistency#1442
marco66colombo wants to merge 5 commits intofastmachinelearning:mainfrom
marco66colombo:issue-1438-write-tar-consistency

marco66colombo commented Feb 17, 2026

Uh oh!

vloncar Feb 18, 2026

Uh oh!

nghielme Feb 18, 2026

Uh oh!

vloncar Feb 18, 2026

Uh oh!

marco66colombo Feb 18, 2026

Uh oh!

marco66colombo commented Feb 18, 2026

Uh oh!

marco66colombo commented Mar 2, 2026

Uh oh!

nghielme Mar 4, 2026

Uh oh!

marco66colombo Mar 4, 2026

Uh oh!

marco66colombo commented Mar 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

marco66colombo commented Feb 17, 2026

Description

Summary

Motivation / Context

Notable behavior note

Follow-up (in progress)

Type of change

Tests

Checklist

Uh oh!

vloncar Feb 18, 2026

Choose a reason for hiding this comment

Uh oh!

nghielme Feb 18, 2026

Choose a reason for hiding this comment

Uh oh!

vloncar Feb 18, 2026

Choose a reason for hiding this comment

Uh oh!

marco66colombo Feb 18, 2026

Choose a reason for hiding this comment

Uh oh!

marco66colombo commented Feb 18, 2026

Uh oh!

marco66colombo commented Mar 2, 2026

Uh oh!

nghielme Mar 4, 2026

Choose a reason for hiding this comment

Uh oh!

marco66colombo Mar 4, 2026

Choose a reason for hiding this comment

Uh oh!

marco66colombo commented Mar 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants