Questions tagged [split]
The split tag has no summary.
283 questions
3 votes
3 answers
2k views
How do I extract some pages of a PDF into another PDF file?
I have a PDF file with multiple pages, and I want to write a command which extracts some of these pages into a new, separate PDF file; and the pages of interest are not necessarily a contiguous range. ...
0 votes
1 answer
94 views
Spliting PDF while keeping index in the new file
I have got a PDF file with many tomes in it. Because it contains a lot (>5,000) of pages I want to split it. I have used pdftk like this: pdftk input.pdf cat 487-2987 output second_tome.pdf It ...
-1 votes
2 answers
96 views
Split string with 0-2 / (or determine there's none) (Bash)
Update: Up to 2 "/" in the string. String structure is either: Character set name/LF Character set name/CRLF Character set name/CRLF/(unknown purpose, likely a number) Character set name ...
0 votes
1 answer
133 views
Archiving stdout to multiple tapes
I have large files which are generated on the fly to stdout, one every 24hours. I would like to archive these files progressively on tapes, ideally in a single archive which potentially spans multiple ...
0 votes
1 answer
119 views
Slow down a `split`
I have a really large archive consisting of really small files, concatenated into a single text file, with a "" dilimiter. For smaller archives, I would split the archive using "" ...
0 votes
3 answers
148 views
Read file till special char, copy that section into another file, and continue till eof
I am trying to read a file in Linux and as soon as a "&" character is encountered, I am writing the output to another file, sending that file to another folder and then continuing to ...
2 votes
1 answer
191 views
Is it safe to slicing byte directly to split a big file?
In my case, the big file is tar.gz, I have myBigFile.tar.gz with size 52GB, I splitted it with chunk size 2GB therefore I have 27 parts file. Here is the code I program from scratch: from time import ...
11 votes
3 answers
8k views
What is the state of the art of splitting a binary file by size?
Some background you can happily skip Twenty years ago or so, when navigating the web costed a lot, when I was a Windows-only user, and when CDs/DVDs were a large storage means, and when sharing video ...
2 votes
1 answer
210 views
Merge the split file vs Original compressed file
On Ubuntu 22.04, I found these 2 methods will get different sha256 of archive_tgz tar czf /a/archive_tgz . tar czf /dev/stdout . | split -d -b 200M - /a/archive. && cat /a/archive.* > /a/...
1 vote
1 answer
1k views
What is the max size limit for using split and cat combination?
I have backed up a file in zip format on Ubuntu whose size is 6.5GB. Trying to transfer it to a USB stick so needed to split since I got error “file size too large” while copying. (Ideally I want my ...
1 vote
3 answers
786 views
Split large file in realtime whilst it is still being written to
I've got a program that will be generating 4 large binary files (400GB+ each) that I need to upload to AWS S3 as quickly as possible. I'd like to begin uploading before the files are completely ...
1 vote
2 answers
298 views
Does splitting a file in more files necessarily mean that some/all of the overall content will not be where it was?
I guess that given a file of a certain size, not all of its bytes will be contiguous on disk (or will they? Just for existence of the phrase "defragmenting a disk" I assume they will not). ...
0 votes
2 answers
382 views
Linux - split a big file into small files according to specific character and how many it repeats
I have a really big file that looks like this: >name1 ACGTACGTACGT ACGTACGTACGT ACGTACGTACGT ACGTACGTACGT >name2 ACGTACGTACGT ACGTACGTACGT ACGTACGTACGT >name ACGTACGTACGT ACGTACGTACGT ...
0 votes
1 answer
16k views
"gzip: stdin: unexpected end of file" after using tar with split
I have a folder of roughly 600MB of images in a folder output_test. I then create 100MB chunks of tar.gz files using the following command: tar -czf - output_test/ | split --bytes=100MB -d -a 3 - ...
1 vote
2 answers
240 views
Divide a fasta file with scaffolds into same lenght files respecting the scaffold ID and the sequence
I am currently working with a large fasta file (3.7GB) that has scaffolds in it. Each scaffold has a unique identifier that starts with > on the first line and on the consecutive line it has the ...