DBCC CHECKDB
CHECKDB throws us some nice errors:
DBCC CHECKDB ('broken') WITH NO_INFOMSGS, ALL_ERRORMSGS;
GO
Msg 8928, Level 16, State 1, Line 1
Object ID 2073058421, index ID 0, partition ID 72057594038321152, alloc unit ID 72057594042318848 (type In-row data): Page (1:143) could not be processed. See other errors for details.
Msg 8939, Level 16, State 98, Line 1
Table error: Object ID 2073058421, index ID 0, partition ID 72057594038321152, alloc unit ID 72057594042318848 (type In-row data), page (1:143). Test (IS_OFF (BUF_IOERR, pBUF->bstat)) failed. Values are 12716041 and -4.
CHECKDB found 0 allocation errors and 2 consistency errors in table 'brokentable' (object ID 2073058421).
CHECKDB found 0 allocation errors and 2 consistency errors in database 'broken'.
repair_allow_data_loss is the minimum repair level for the errors found by DBCC CHECKDB (broken).
Query errors
Any query that touches that page is going to fail with an 824 error:
SELECT * FROM broken..brokentable;
GO
Msg 824, Level 24, State 2, Line 1
SQL Server detected a logical consistency-based I/O error: incorrect checksum (expected: 0x7232c940; actual: 0x720e4940). It occurred during a read of page (1:143) in database ID 8 at offset 0x0000000011e000 in file 'c:\broken.mdf'. Additional messages in the SQL Server error log or system event log may provide more detail. This is a severe error condition that threatens database integrity and must be corrected immediately. Complete a full database consistency check (DBCC CHECKDB). This error can be caused by many factors; for more information, see SQL Server Books Online.
Backup with CHECKSUM
If you have page checksums turned on, you should always use the WITH CHECKSUM option on backups to validate the page checksums as they are read:
BACKUP DATABASE broken TO DISK='c:\broken2.bck'
WITH CHECKSUM;
GO
Msg 3043, Level 16, State 1, Line 1
BACKUP 'broken' detected an error on page (1:143) in file 'c:\broken.mdf'.
Msg 3013, Level 16, State 1, Line 1
BACKUP DATABASE is terminating abnormally. But we can force it to backup. If this is the only copy of the database we have, and we're being forced to run repair, for instance, then we want to make sure we have a backup to restore from. Even a backup that contains a corrupt database is better than no database at all. In this case, we need to use the CONTINUE_AFTER_ERROR option.
BACKUP DATABASE broken TO DISK='c:\broken2.bck'
WITH CHECKSUM, CONTINUE_AFTER_ERROR;
GO
Processed 160 pages for database 'broken', file 'broken' on file 1.
Processed 1 pages for database 'broken', file 'broken_log' on file 1.
BACKUP WITH CONTINUE_AFTER_ERROR successfully generated a backup of the damaged database. Refer to the SQL Server error log for information about the errors that were encountered.
BACKUP DATABASE successfully processed 161 pages in 2.025 seconds (0.651 MB/sec).
RESTORE VERIFYONLY
Now that we have a backup that has checksum information in it, let's see how we can check whether the backup is valid:
RESTORE VERIFYONLY FROM DISK='c:\broken2.bck';
GO
The backup set was written with damaged data by a BACKUP WITH CONTINUE_AFTER_ERROR.
Isn't that cool? It tells us that the backup was already corrupt when it was written. Ok - let's ask it to specifically check the checksums in the backup:
RESTORE VERIFYONLY FROM DISK='c:\broken2.bck'
WITH CHECKSUM;
GO
The backup set was written with damaged data by a BACKUP WITH CONTINUE_AFTER_ERROR. Same thing. What about if we try to check the checksums on the initial backup?
RESTORE VERIFYONLY FROM DISK='c:\broken.bck'
WITH CHECKSUM;
GO
Msg 3187, Level 16, State 1, Line 1
RESTORE WITH CHECKSUM cannot be specified because the backup set does not contain checksum information.
Msg 3013, Level 16, State 1, Line 1
VERIFY DATABASE is terminating abnormally.
RESTORE
How about we try to overwrite the existing 'broken' database with the one from the second backup that has checksum information in it?
RESTORE DATABASE broken FROM DISK='c:\broken2.bck'
WITH REPLACE;
GO Msg 3183, Level 16, State 1, Line 1
RESTORE detected an error on page (1:143) in database "broken" as read from the backup set.
Msg 3013, Level 16, State 1, Line 1
RESTORE DATABASE is terminating abnormally.
It won't let us because the backup contains corrupt data. But we can force it if we need to with CONTINUE_AFTER_ERROR again.
RESTORE DATABASE broken FROM DISK='c:\broken2.bck'
WITH REPLACE, CONTINUE_AFTER_ERROR;
GO
Processed 160 pages for database 'broken', file 'broken' on file 1.
Processed 1 pages for database 'broken', file 'broken_log' on file 1.
The backup set was written with damaged data by a BACKUP WITH CONTINUE_AFTER_ERROR.
RESTORE WITH CONTINUE_AFTER_ERROR was successful but some damage was encountered. Inconsistencies in the database are possible.
RESTORE DATABASE successfully processed 161 pages in 0.392 seconds (3.364 MB/sec). Isn't that cool? It works BUT it tells us that the backup set contained corrupt data and that the database was restored but could have corrupt data in.
This comment has been removed by a blog administrator.
ReplyDelete