Environment:
Master and media servers: Windows Server 2008 running NBU 7.6.0.1
Client: HP-UX B.11.31 running NBU client 7.1
Problem: One HP-UX client has recently started having intermittent backup failures on its incremental backups. The incremental backup kicks off as normal, it starts writing for a bit, then fails. Full backups are successful. I am sometimes able to re-run the incremental backup in the morning with success, sometimes not. Backup job detail follows:
2/4/2014 8:37:03 AM - Info nbjm(pid=20084) starting backup job (jobid=2911692) for client bkusolalhpv02, policy AL_FF_Prod_Weekly, schedule Differential
2/4/2014 8:37:03 AM - Info nbjm(pid=20084) requesting STANDARD_RESOURCE resources from RB for backup job (jobid=2911692, request id:{A51247EB-BE3B-44BF-970D-9EA38464D63A})
2/4/2014 8:37:03 AM - requesting resource ALL_DSSU
2/4/2014 8:37:03 AM - requesting resource bkusolcwbdcs001.NBU_CLIENT.MAXJOBS.bkusolalhpv02
2/4/2014 8:37:03 AM - requesting resource bkusolcwbdcs001.NBU_POLICY.MAXJOBS.AL_FF_Prod_Weekly
2/4/2014 8:38:32 AM - granted resource bkusolcwbdcs001.NBU_CLIENT.MAXJOBS.bkusolalhpv02
2/4/2014 8:38:32 AM - granted resource bkusolcwbdcs001.NBU_POLICY.MAXJOBS.AL_FF_Prod_Weekly
2/4/2014 8:38:32 AM - granted resource MediaID=@aaadG;DiskVolume=V:\;DiskPool=DCS008-V-R9;Path=V:\;StorageServer=bkusolpwbdcs008;MediaServer=bkusolpwbdcs008
2/4/2014 8:38:32 AM - granted resource DCS008-V-R9
2/4/2014 8:38:32 AM - estimated 143764416 Kbytes needed
2/4/2014 8:38:32 AM - Info nbjm(pid=20084) started backup (backupid=bkusolalhpv02_1391521112) job for client bkusolalhpv02, policy AL_FF_Prod_Weekly, schedule Differential on storage unit DCS008-V-R9
2/4/2014 8:38:34 AM - started process bpbrm (6272)
2/4/2014 8:38:35 AM - Info bpbrm(pid=6272) bkusolalhpv02 is the host to backup data from
2/4/2014 8:38:35 AM - Info bpbrm(pid=6272) reading file list for client
2/4/2014 8:38:35 AM - connecting
2/4/2014 8:38:38 AM - Info bpbrm(pid=6272) starting bpbkar32 on client
2/4/2014 8:38:38 AM - Info bpbkar32(pid=11839) Backup started
2/4/2014 8:38:38 AM - Info bptm(pid=7392) start
2/4/2014 8:38:38 AM - connected; connect time: 0:00:03
2/4/2014 8:38:39 AM - Info bptm(pid=7392) using 1048576 data buffer size
2/4/2014 8:38:39 AM - Info bptm(pid=7392) setting receive network buffer to 4195328 bytes
2/4/2014 8:38:39 AM - Info bptm(pid=7392) using 48 data buffers
2/4/2014 8:38:39 AM - Info bptm(pid=7392) start backup
2/4/2014 8:38:40 AM - Info bptm(pid=7392) backup child process is pid 6680.5344
2/4/2014 8:38:40 AM - Info bptm(pid=6680) start
2/4/2014 8:38:40 AM - begin writing
2/4/2014 8:38:42 AM - Info bpbrm(pid=6272) from client bkusolalhpv02: TRV - [/var/hpsrp/alhp2ap/export/customer/prod/dsal/data01/hist] is in a different file system from [/var/hpsrp/alhp2ap/export/customer/prod/dsal/data01]. Skipping
2/4/2014 8:38:42 AM - Info bpbrm(pid=6272) from client bkusolalhpv02: TRV - [/var/hpsrp/alhp2ap/export/customer/prod/dsal/data01/encounters] is in a different file system from [/var/hpsrp/alhp2ap/export/customer/prod/dsal/data01]. Skipping
2/4/2014 8:38:58 AM - Info bpbrm(pid=6272) from client bkusolalhpv02: TRV - [/var/hpsrp/alhp2ap/export/customer/prod/dsal/data01] is in a different file system from [/var/hpsrp/alhp2ap/export/customer]. Skipping
2/4/2014 8:38:58 AM - Info bpbrm(pid=6272) from client bkusolalhpv02: TRV - [/var/hpsrp/alhp2ap/export/customer/prod/dsal/data02] is in a different file system from [/var/hpsrp/alhp2ap/export/customer]. Skipping
... (cut out a bunch of this Skipping stuff for the sake of brevity)
2/4/2014 9:19:30 AM - Info bpbrm(pid=6272) from client bkusolalhpv02: TRV - [/var/hpsrp/alhp1ap/var/spool/sockets/pwgr/client2162] is a socket special file. Skipping
2/4/2014 9:19:30 AM - Info bpbrm(pid=6272) from client bkusolalhpv02: TRV - [/var/hpsrp/alhp1ap/var/spool/sockets/pwgr/client2170] is a socket special file. Skipping
2/4/2014 9:19:30 AM - Info bpbrm(pid=6272) from client bkusolalhpv02: TRV - [/var/hpsrp/alhp1ap/var/spool/sockets/pwgr/client2172] is a socket special file. Skipping
2/4/2014 9:19:30 AM - Error bpbrm(pid=6272) socket read failed, An existing connection was forcibly closed by the remote host. (10054)
2/4/2014 9:19:32 AM - Info bpbkar32(pid=11839) done. status: 13: file read failed
2/4/2014 9:19:32 AM - end writing; write time: 0:40:52
file read failed(13)
Any ideas?
Thanks,
Wayne