package File::Modified; use strict; use warnings; use vars qw( @ISA $VERSION ); $VERSION = '0.07'; sub new { my ($class, %args) = @_; my $method = $args{method} || "MD5"; my $files = $args{files} || []; my $self = { Defaultmethod => $method, Files => {}, }; bless $self, $class; $self->addfile(@$files); return $self; }; sub _make_digest_signature { my ($self,$digest) = @_; eval "use Digest::$digest"; if (! $@) { no strict 'refs'; if (defined @{"Digest::${digest}::ISA"}) { @{"File::Modified::Signature::${digest}::ISA"} = qw(File::Modified::Signature::Digest); return 1; }; }; return undef; }; sub add { my ($self,$filename,$method) = @_; $method ||= $self->{Defaultmethod}; my $signatureclass = "File::Modified::Signature::$method"; my $s = eval { $signatureclass->new($filename) }; if (! $@) { return $self->{Files}->{$filename} = $s; } else { # retry and try Digest::$method if ($self->_make_digest_signature($method)) { my $s = $signatureclass->new($filename); return $self->{Files}->{$filename} = $s; } else { return undef; }; }; }; sub addfile { my ($self,@files) = @_; my @result; # We only return something if the caller wants it if (defined wantarray) { push @result, $self->add($_) for @files; return @result; } else { $self->add($_) for @files; }; }; sub update { my ($self) = @_; $_->initialize() for values %{$self->{Files}}; }; sub changed { my ($self) = @_; return map {$_->{Filename}} grep {$_->changed()} (values %{$self->{Files}}); }; 1; { package File::Modified::Signature; # This is a case where Python would be nicer. With Python, we could have (paraphrased) # class File::Modified::Signature; # def initialize(self): # self.hash = self.identificate() # return self # def signature(self): # return MD5(self.filename) # def changed(self): # return self.hash != self.signature() # and it would work as expected, (almost) regardless of the structure that is returned # by self.signature(). This is some DWIMmery that I sometimes miss in Perl. # For now, only string comparisions are allowed. sub create { my ($class,$filename,$signature) = @_; my $self = { Filename => $filename, Signature => $signature, }; bless $self, $class; }; sub new { my ($class,$filename) = @_; my $self = $class->create($filename); $self->initialize(); return $self; }; sub initialize { my ($self) = @_; $self->{Signature} = $self->signature(); return $self; }; sub from_scalar { my ($baseclass,$scalar) = @_; die "Strange value in from_scalar: $scalar\n" unless $scalar =~ /^([^|]+)\|([^|]+)\|(.+)$/; my ($class,$filename,$signature) = ($1,$2,$3); return $class->create($filename,$signature); }; sub as_scalar { my ($self) = @_; return ref($self) . "|" . $self->{Filename} . "|" . $self->{Signature}; }; sub changed { my ($self) = @_; my $currsig = $self->signature(); # FIXME: Deep comparision of the two signatures instead of equality ! # And what's this about string comparisions anyway ? if ((ref $currsig) or (ref $self->{Signature})) { die "Implementation error in $self : changed() can't handle references and complex structures (yet) !\n"; #return $currsig != $self->{Signature}; } else { return $currsig ne $self->{Signature}; }; }; }; { package File::Modified::Signature::mtime; use base 'File::Modified::Signature'; sub signature { my ($self) = @_; my @stat = stat $self->{Filename} or die "Couldn't stat '$self->{Filename}' : $!"; return $stat[9]; }; }; { package File::Modified::Signature::Checksum; use base 'File::Modified::Signature'; sub signature { my ($self) = @_; my $result; if (-e $self->{Filename} and -r $self->{Filename}) { local *F; open F, $self->{Filename} or die "Couldn't read from file '$self->{Filename}' : $!"; binmode F; my $buf; while (read(F,$buf,32768)) { $result += unpack("%32C*", $buf); $result %= 0xFFFFFFFF; }; close F; }; return $result; }; }; { package File::Modified::Signature::Digest; use base 'File::Modified::Signature'; sub digestname { my ($class) = @_; $class = ref $class || $class; return $1 if ($class =~ /^File::Modified::Signature::([^:]+)$/); }; sub digest { my ($self) = @_; if (! exists $self->{Digest}) { my $digestclass = "Digest::" . $self->digestname; eval "use $digestclass"; $self->{Digest} = $digestclass->new(); }; return $self->{Digest}; }; sub signature { my ($self) = @_; my $result; if (-e $self->{Filename} and -r $self->{Filename}) { local *F; open F, $self->{Filename} or die "Couldn't read from file '$self->{Filename}' : $!"; binmode F; $result = $self->digest->addfile(*F)->b64digest(); close F; }; return $result; }; }; 1; __END__ =head1 NAME File::Modified - checks intelligently if files have changed =head1 SYNOPSIS use strict; use File::Modified; my $d = File::Modified->new(files=>['Import.cfg','Export.cfg']); while (1) { my (@changes) = $d->changed; if (@changes) { print "$_ was changed\n" for @changes; $d->update(); }; sleep 60; }; Second example - a script that knows when any of its modules have changed : use File::Modified; my $files = File::Modified->new(files=>[values %INC, $0]); # We want to restart when any module was changed exec $0, @ARGV if $files->changed(); =head1 DESCRIPTION The Modified module is intended as a simple method for programs to detect whether configuration files (or modules they rely on) have changed. There are currently two methods of change detection implemented, C and C. The C method will fall back to use timestamps if the C module cannot be loaded. There is another module, L, which has many similar features, so if this module doesn't do what you need, maybe File::Signature does. There also is quite some overlap between the two modules, code wise. =over 4 =item new %ARGS Creates a new instance. The C<%ARGS> hash has two possible keys, C, which denotes the method used for checking as default, and C, which takes an array reference to the filenames to watch. =item add filename, method Adds a new file to watch. C is the method (or rather, the subclass of C) to use to determine whether a file has changed or not. The result is either the C subclass or undef if an error occurred. =item addfile LIST Adds a list of files to watch. The method used for watching is the default method as set in the constructor. The result is a list of C subclasses. =item update Updates all signatures to the current state. All pending changes are discarded. =item changed Returns a list of the filenames whose files did change since the construction or the last call to C (whichever last occurred). =back =head2 Signatures The module also creates a new namespace C, which sometime will evolve into its own module in its own file. A file signature is most likely of little interest to you; the only time you might want to access the signature directly is to store the signature in a file for persistence and easy comparision whether an index database is current with the actual data. The interface is settled, there are two methods, C and C, that you use to freeze and thaw the signatures. The implementation of these methods is very frugal, there are no provisions made against filenames that contain weird characters like C<\n> or C<|> (the pipe bar), both will be likely to mess up your one-line-per-file database. An interesting method could be to URL-encode all filenames, but I will visit this topic in the next release. Also, complex (that is, non-scalar) signatures are handled rather ungraceful at the moment. Currently, I'm planning to use L as a quoting mechanism to protect against multiline filenames. =head2 Adding new methods for signatures Adding a new signature method is as simple as creating a new subclass of C. See C for a simple example. There is one point of laziness in the implementation of C, the C method can only compare strings instead of arbitrary structures (yes, there ARE things that are easier in Python than in Perl). C is a wrapper for Gisle Aas' L module and allows you to use any module below the C namespace as a signature, for example C and C. =head2 TODO * Make the simple persistence solution for the signatures better using L. * Allow complex structures for the signatures. * Document C or put it down into another namespace. * Extract the C subclasses out into their own file. * Create an easy option to watch a whole directory tree. =head2 EXPORT None by default. =head2 COPYRIGHT AND LICENSE This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself. Copyright (C) 2002 Max Maischein =head1 AUTHOR Max Maischein, Ecorion@cpan.orgE Please contact me if you find bugs or otherwise improve the module. More tests are also very welcome ! =head1 SEE ALSO L,L,L, L. =cut