Skip to content

Super-streaming SXSSF#141

Closed
mobreza wants to merge 9679 commits intoapache:trunkfrom
mobreza:super-sxssf
Closed

Super-streaming SXSSF#141
mobreza wants to merge 9679 commits intoapache:trunkfrom
mobreza:super-sxssf

Conversation

@mobreza
Copy link

@mobreza mobreza commented Jan 23, 2019

https://bz.apache.org/bugzilla/show_bug.cgi?id=63100

SXSSF works as designed and manages a small memory footprint when generating large files from a database. But it only writes data to an output stream once everything has been written to SXSSF. This is problematic when used in web applications:

In our use case (our website allows users to generate Excel from the database), generating the SXSSF on the server takes about 5 minutes. Most clients give up within a minute (or the browser does it automatically), or the proxy times out due to no data being sent. Some users also retry the download request. A new request for download is initiated (while the server is busy generating the SXSSF for a client that already gave up). This can potentially lead to DOS.

To work around this issue, I've implemented a super-streaming version of SXSSF, a SuperSXSSF, that relies on rowWriter callback to generate row data.

With this approach our service is able to stream the generated Excel directly to the client and, best of all, is terminated in case the user cancels the download request.

The SuperSXSSF prevents both download timeouts and potential DOS, while allowing developers all other XSSF actions (i.e. define styles) that don't take much processing time.

Loading
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

9 participants