This repository was archived by the owner on Dec 19, 2019. It is now read-only.
-
Notifications
You must be signed in to change notification settings - Fork 130
/
Copy pathfdupes.1
180 lines (169 loc) · 4.87 KB
/
fdupes.1
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
.TH FDUPES 1
.\" NAME should be all caps, SECTION should be 1-8, maybe w/ subsection
.\" other parms are allowed: see man(7), man(1)
.SH NAME
fdupes \- finds duplicate files in a given set of directories
.SH SYNOPSIS
.B fdupes
[
.I options
]
.I DIRECTORY
\|.\|.\|.
.SH "DESCRIPTION"
Searches the given path(s) for duplicate files. Such files are found by
comparing file sizes, then partial and full file hashes, followed by a
byte-by-byte comparison.
.SH OPTIONS
.TP
.B -r --recurse
for every directory given follow subdirectories encountered within
.TP
.B -R --recurse:
for each directory given after this option follow subdirectories
encountered within (note the ':' at the end of option; see the
Examples section below for further explanation)
.TP
.B -s --symlinks
follow symlinked directories
.TP
.B -H --hardlinks
normally, when two or more files point to the same disk area they are
treated as non-duplicates; this option will change this behavior
.TP
.B -n --noempty
exclude zero-length files from consideration
.TP
.B -x --xsize=SIZE
exclude files of size < SIZE from consideration; following suffixes can
be used:
.RS
.IP `k'
for kilobytes (units of 1024 bytes)
.IP `M'
for Megabytes (units of 1024 x 1024 bytes)
.IP `G'
for Gigabytes (units of 1024 x 1024 x 1024 bytes)
.RE
.TP
.B -f --omitfirst
omit the first file in each set of matches
.TP
.B -A --nohidden
exclude hidden files from consideration
.TP
.B -1 --sameline
list each set of matches on a single line
.TP
.B -S --size
show size of duplicate files
.TP
.B -m --summarize
summarize duplicate files information
.TP
.B -q --quiet
hide progress indicator
.TP
.B -Q --quick
.B [WARNING: RISK OF DATA LOSS, SEE CAVEATS]
skip byte-for-byte verification of duplicate pairs (use hashes only)
.TP
.B -d --delete
prompt user for files to preserve, deleting all others (see
.B CAVEATS
below)
.TP
.B -L --hardlink
replace all duplicate files with hardlinks to the first file in each set
of duplicates
.TP
.B -N --noprompt
when used together with \-\-delete, preserve the first file in each set of
duplicates and delete the others without prompting the user
.TP
.B -p --permissions
don't consider files with different owner/group or permission bits as
duplicates
.TP
.B -o --order\fR=\fIWORD\fR
order files according to WORD:
time - sort by modification time
name - sort by filename
.TP
.B -O --paramorder
parameter order preservation is more important than the chosen sort; this
is particularly useful with the \fB\-N\fP option to ensure that automatic
deletion behaves in a controllable way
.TP
.B -v --version
display fdupes version
.TP
.B -h --help
displays help
.SH NOTES
Unless
.B -1
or
.B --sameline
is specified, duplicate files are listed together in groups, each file
displayed on a separate line. The groups are then separated from each
other by blank lines.
When
.B -1
or
.B --sameline
is specified, spaces and backslash characters (\fB\e\fP) appearing in
a filename are preceded by a backslash character.
.SH EXAMPLES
.TP
.B fdupes a --recurse: b
will follow subdirectories under b, but not those under a.
.TP
.B fdupes a --recurse b
will follow subdirectories under both a and b.
.TP
.B fdupes -O dir1 dir3 dir2
will always place 'dir1' results first in any match set (where relevant)
.SH CAVEATS
When using
.B \-d
or
.BR \-\-delete ,
care should be taken to insure against accidental data loss.
The
.B \-O
or
.BR \-\-paramorder
option allows the user greater control over what appears in the first
position of a match set, specifically for keeping the \fB\-N\fP option
from deleting all but one file in a set in a seemingly random way. All
directories specified on the command line will be used as the sorting
order of result sets first, followed by the sorting algorithm set by
the \fB\-o\fP or \fB\-\-order\fP option. This means that the order of
all match pairs for a single directory specification will retain the
old sorting behavior even if this option is specified.
When used together with options
.B \-s
or
.BR \-\-symlink ,
a user could accidentally preserve a symlink while deleting the file it
points to.
Furthermore, when specifying a particular directory more than once, all
files within that directory will be listed as their own duplicates,
leading to data loss should a user preserve a file without its "duplicate"
(the file itself!).
The
.B \-Q
or
.BR \-\-quick
option only reads each file once, hashes it, and performs comparisons
based solely on the hashes. There is a small but significant risk of a
hash collision which is the purpose of the failsafe byte-for-byte
comparison that this option explicitly bypasses. Do not use it on ANY data
set for which any amount of data loss is unacceptable. This option is not
included in the help text for the program due to its risky nature.
.B You have been warned!
.SH AUTHOR
Adrian Lopez <[email protected]>
This version, 'fdupes-jody', was forked, is maintained by, and contains
extra code copyrighted by Jody Bruchon <[email protected]>