Does RarArchive support reading block sizes at time for large 2GB+ rar files, and unrar based block sizes ? #447

New Issue

claunia · 2026-01-29T22:12:11Z

claunia commented

2026-01-29 22:12:11 +00:00

Originally created by @jameelahmeds on GitHub (Mar 30, 2021).

Does the RarArchive support reading a user-defined number of bytes at a time from compressed file and then uncompressing bytes at time.
Is this supported ? If so which class and function to use ? any samples ?

I have a large 2GB+ rar file on Azure Blob Storage, and want to uncompress to Azure Blob storage.

Thank you

Originally created by @jameelahmeds on GitHub (Mar 30, 2021). Does the RarArchive support reading a user-defined number of bytes at a time from compressed file and then uncompressing bytes at time. Is this supported ? If so which class and function to use ? any samples ? I have a large 2GB+ rar file on Azure Blob Storage, and want to uncompress to Azure Blob storage. Thank you

claunia commented

2026-01-29 22:12:12 +00:00

@adamhathcock commented on GitHub (Mar 31, 2021):

Well, using a RarReader you don't actually decompress or read from the underlying stream until you start reading the entry. You can't seek to a specific position in a compressed stream as the bytes for an entry is just a single stream of compressed bytes that requires the previous knowledge to continue forward.

Rar is a simple format that is

with no directory of entries. The RarArchive will scan a seekable stream for entries by skipping the compressed bytes which is fast. Of course this is only a seekable stream and won't work with a network stream from Azure Blob Storage.

RarReader will skip entries but by reading the non-seekable stream bytes to a null stream til it gets to the next entry.

@adamhathcock commented on GitHub (Mar 31, 2021): Well, using a RarReader you don't actually decompress or read from the underlying stream until you start reading the entry. You can't seek to a specific position in a compressed stream as the bytes for an entry is just a single stream of compressed bytes that requires the previous knowledge to continue forward. Rar is a simple format that is <Header><bytes><Header><bytes> with no directory of entries. The RarArchive will scan a seekable stream for entries by skipping the compressed bytes which is fast. Of course this is only a seekable stream and won't work with a network stream from Azure Blob Storage. RarReader will skip entries but by reading the non-seekable stream bytes to a null stream til it gets to the next entry.

claunia commented

2026-01-29 22:12:12 +00:00

@marcussacana commented on GitHub (Dec 19, 2021):

@adamhathcock I don't know about the rar format, but I see the reason why he isn't seekable, that make sense.
But anyway, a idea, maybe is possible if we have the file partially already extracted resume his extraction by reading the extracted file and rebuilding the extraction dictionary to resume from the end of the partially extracted file?
it seems absurd to waste time reprocessing the file, but I'm extracting directly from the cloud, and on top of files over 90gb, if that were possible I would like to put it into practice, even if I have to study the format and formulate the strategy.

@marcussacana commented on GitHub (Dec 19, 2021): @adamhathcock I don't know about the rar format, but I see the reason why he isn't seekable, that make sense. But anyway, a idea, maybe is possible if we have the file partially already extracted resume his extraction by reading the extracted file and rebuilding the extraction dictionary to resume from the end of the partially extracted file? it seems absurd to waste time reprocessing the file, but I'm extracting directly from the cloud, and on top of files over 90gb, if that were possible I would like to put it into practice, even if I have to study the format and formulate the strategy.

claunia commented

2026-01-29 22:12:12 +00:00

@adamhathcock commented on GitHub (Dec 20, 2021):

Your strategy can work but you have to manage the Stream of bytes yourself. Sharpcompress will take any Stream and try to decompress if there are bytes available in it.

@adamhathcock commented on GitHub (Dec 20, 2021): Your strategy can work but you have to manage the Stream of bytes yourself. Sharpcompress will take any Stream and try to decompress if there are bytes available in it.

Sign in to join this conversation.

Branches Tags

master

release

adam/merge-release-to-master

dependabot/nuget/xunit.v3-3.2.2

adam/more-explode-async

copilot/fix-infinite-loop-rar-archive

adam/data-descriptor-fix

adam/fix-tests-with-proper-rewind

copilot/fix-data-descriptor-stream-bug

adam/lmza-investigation

adam/create-rar-async

adam/async-rar2

copilot/support-multi-threading-path

copilot/sub-pr-1132-again

adam/memory-perf

copilot/add-performance-benchmarking

copilot/sub-pr-1121

copilot/add-password-support-zip-files

copilot/add-so-optimized-zip-support

adam/rar-async-only

copilot/add-buffered-stream-async-read

copilot/sub-pr-1076

copilot/fix-decompression-exception

copilot/fix-archivefactory-issue

copilot/rationalize-sourcestream-volumes

adam/open-async

copilot/add-ace-archive-support

copilot/sub-pr-1040-again

adam/more-async-3

copilot/fix-tararchive-incomplete-iteration

adam/multi-threaded

copilot/sub-pr-1040

adam/awesome-copilot

copilot/fix-ziparchive-extraction-issue

copilot/fix-tararchive-open-crash

copilot/fix-tar-xz-file-reading-issue

copilot/setup-copilot-instructions

copilot/fix-decompression-performance-issue

copilot/convert-stream-access-to-async

adam/enable-agent

adam/async-deflate

adam/async-rar

adam/more-cleanup

adam/zstd

async-2

zstandard

net461-tests

dmg

async

build-netcore3

recycle-memory-stream

presentation

pax

netcore2

zip_encryption

dotnet-tool

tar_redux

native_zlib

Issue-197

system_buffers

TarNames

7zip_sfx

portable_crypto

WinRT

new_7zip

1 Participants

Notifications

Due Date

No due date set.

Dependencies

No dependencies set.

Reference: starred/sharpcompress#447