Aug 24

Question

Which TSAFS cluster options are supported by IBM Tivoli Storage Manager ?

Answer

The usual way to load the Novell Netware “Target Service Agents for File Systems” module

(TSAFS.NLM) for use with Tivoli Storage Manager client would be with /nocluster.
This is necessary because the Tivoli Storage Manager client does not support the full Netware cluster presentation of volumes ( \\NODE-1\CLUSTERVOLUME – \\VIRTUAL-SERVER\- )
but can identify the volume names by the first part ( \\NODE-1\CLUSTERVOLUME ).

In special cases the setting /Cluster=1 may be required, e.g. for tools which require the Cluster-Volume to be displayed via the virtual server (virtual IP address), as in the case of the Novell migration tools
(SCMT or Migration-Toolkit), To enable Tivoli Storage Manager to work without a problem despite this setting, the volumes to be backed up are explicitly taken into account by the option /ShowClusterVolumeOnNode=1. This will display volumes in the form
\\NODE-1\CLUSTERVOLUME – \\VIRTUAL-SERVER\CLUSTERVOLUME

To summarize, the Tivoli Storage Manager client on Novell NetWare supports the following combination of options for loading the TSAFS module on a NetWare cluster:
1.) TSAFS.NLM /nocluster
2.) TSAFS.NLM /Cluster=1 /ShowClusterVolumesOnNode=1

Related information

Unable to see Cluster volume even when TSAFS.NLM is loa
The Tivoli Storage Manager client doesn’t process Netwa
Before installing the backup-archive client on Netware

written by Bosse

Jul 16

Error description

  • When backing up a Domino Database using one of the scripts
    provided with the SW, FastBack will wait for the Domino server
    to notify it that the database is stopped.
    In some cases, a delay between the time when the Domino server
    has stopped and the actual change of the data base to a stopped
    state can cause the system to try and restart the Domino
    database prematurely and eventually a failure to restart the
    Domino server will occur.

     

  • Local fix

  • Use updated scripts

     

  • Problem summary

  • ****************************************************************
    * USERS AFFECTED: Users that backup Domino with Fastback *
    * Client *
    ****************************************************************
    * PROBLEM DESCRIPTION: See ERROR DESCRIPTION *
    ****************************************************************
    * RECOMMENDATION: Apply fixing level when available (Use *
    * updated scripts). This problem is *
    * currently projected to be fixed in level *
    * 5.5.7 and 6.11. Note that this is subject *
    * to change at the discretion of IBM *
    ****************************************************************
    *

     

  • Problem conclusion

  • Background: When backing up a Domino Database using one of the
    scripts provided with the SW, FastBack will wait for the
    Domino server to notify it that the database is stopped. In
    some cases, a delay between the time when the Domino server
    has stopped and the actual change of the data base to a
    stopped state can cause the system to try and restart the
    Domino database prematurely and eventually a failure to
    restart the Domino server will occur.Fix: The scripts that are responsible for domino server
    restart modified to fix the problem. 

  • Temporary fix

  • Use updated scripts.
    The scripts can be download from
    http://www-01.ibm.com/support/docview.wss?&uid=swg21417993

     

  • written by Bosse

    Jul 16

    Error description

  • FastBack may sent out a snapshot abort using the external
    notification email for consolidated (cleanup) snapshot although
    there was nothing to cleanup.
    
    In Logs
    
    1 .C:Documents and Settings\All Users\Application
    Data\Tivoli\TSM\FastBack\server\events - EventJobX -
    
    2010/05/07 19:36:42 FBSS7037E  Procedure aborted
    2010/05/07 19:36:42 FBSS1031I  **Consolidated**
    2010/05/07 19:36:42 FBSS1041I  Procedure initiated.
    
    2. C:Documents and Settings\All Users\Application
    Data\Tivoli\TSM\FastBack\server\ - Server logs -
    
    DM_CLEANUP_BuildCleanUpMAP: Nothing to do, No change since last
    cleanup
    DM_CLEANUP_Dispatch: DM_CLEANUP_BuildCleanUpMAP (job [5])
    required abort of job
    JOB_S_AddEvent: [5] 2010/05/07 15:02:42 FBSS1031I
    **Consolidated**

     

  • Local fix

  •  Ignore the failure messages

     

  • Problem summary

  • ****************************************************************
    * USERS AFFECTED: All Fastback Users Running Snapshot Cleanup. *
    ****************************************************************
    * PROBLEM DESCRIPTION: See ERROR DESCRIPTION.                  *
    ****************************************************************
    * RECOMMENDATION: Apply fixing level when available. This      *
    *                 problem is currently projected to be fixed   *
    *                 in level 6.1.1 and 5.5.7. Note that this     *
    *                 is subject to change at the discretion of    *
    *                 IBM.                                         *
    ****************************************************************
  • written by Bosse

    Jul 09
    Question
    How to do I automate a backup scheme for database copies in an a Microsoft Exchange Server 2010 Database Availability Group (DAG) using IBM Tivoli Storage Manager for Mail : Data Protection for Microsoft Exchange Server ?
     
     
    Answer
    When performing backups in a Microsoft Exchange Server 2010 Database Availability Group (DAG) environment, it is recommended to perform the backups on the passive database copies. This helps reduce the resource impact on the production Exchange server when performing backup operations for your enterprise. However, there are instances where a passive database copy is not in a healthy state and the active database copy may need to be used for the backup.The following sample Powershell script automates the backing up of the databases hosted on an Exchange 2010 server in a DAG environment. The script determines if the Exchange 2010 server is a member of a DAG and will back up its databases using the following scheme, consistent with best practices:Backup SchemeThe sample Powershell script uses the following logic for backing up the Exchange 2010 server databases:

    1. Back up all local healthy passive database copies.
    2. Back up all local databases that are not replicated.
    3. Back up all local active database copies where a healthy passive copy is not available.

     

    Note: The Powershell script above is a sample. It is provided as-is.

    The sample Powershell script takes an optional parameter which is the path to a log file. If the parameter is not specified, logging will be directed to a file called “.\backup.log”. The script output is sent to the log file as well as the console.

    Example Output

    2010-06-08 02:45:35 – Server: MYSERVER
    2010-06-08 02:45:38 – Building database backup list…
    2010-06-08 02:45:38 – ==> Backing Up Non Replicated Database ‘Mailbox Database 1′
    2010-06-08 02:45:38 – ==> Backing Up Non Replicated Database ‘Mailbox Database 2′
    2010-06-08 02:45:38 – ==> Skipping Dismounted Non Replicated Database ‘Test Mailbox Database’
    2010-06-08 02:45:38 – ==> Backing Up Non Replicated Database ‘Mailbox Database 3′
    2010-06-08 02:45:38 – ==> Backing Up Healthy Passive Database Copy ‘Mailbox Database 4′
    2010-06-08 02:45:38 – ==> Backing Up Healthy Passive Database Copy ‘Mailbox Database 5′
    2010-06-08 02:45:38 – ==> Backing Up Active Database (No Healthy Copies Found) ‘Mailbox Database 6′
    2010-06-08 02:45:38 – Executing command: ‘C:\Program Files\Tivoli\tsm\TDPExchange\tdpexcc.exe’ BACKUP ‘”Mailbox Database 1″,”Mailbox Database 2″,”Test Mailbox Database”,”Mailbox Database 3″,”Mailbox Database 4″,”Mailbox Database 5″,”Mailbox Database 6″ ‘ FULL /BACKUPMETHOD=VSS /BACKUPDESTINATION=TSM
    2010-06-08 02:47:52 – Backup completed.

    Software Requirements

    http://www-01.ibm.com/support/docview.wss?&uid=swg21430691

    Running the Sample Powershell Script from a Scheduler

    If you want to run the provided sample Powershell script on a scheduled basis using either the IBM Tivoli Storage Manager central scheduler or the Windows Task Scheduler service, follow these steps:

    1. Create a .cmd or .bat command file to run the sample Powershell script.

    2. Add the following statement to the command file to execute the sample Powershell script correctly:

    PowerShell.exe -command “. ‘C:\Program Files\Microsoft\Exchange Server\V14\bin\RemoteExchange.ps1′; Connect-ExchangeServer -auto; backupdag.ps1″

    Note: For more information about launching Powershell scripts, please refer to the following Microsoft TechNet Library section titled “Scripting with the Exchange Management Shell”:
    (http://technet.microsoft.com/en-us/library/bb123798.aspx)

    3. Run the script either from the IBM Tivoli Storage Manager central scheduler as described in “Chapter
    6. Using the Tivoli Storage Manager scheduler” of the IBM Tivoli Storage Manager for Mail: Data Protection for Microsoft Exchange Server Installation and User’s Guide V6.1.2:
    (http://www.ibm.com/support/docview.wss?uid=swg27018572)
    or from a Windows Scheduler Task.

    Note: Running from a Windows Scheduler Task requires the following task security settings:

    • Run as a domain administrator or other privileged user task (i.e. not a System Task)
    • Run whether user is logged in or not.
    • Run with highest privileges. 

    Länk to page at IBM

    written by Bosse

    Jul 01

    Error description

    The TSM Server will perform automatic reorgs of database tables,
    however, online reorgs of the table indexes do not occur
    automatically.  By default, the DB2 policy definitions do not
    allow for automatic, online reorgs of database table indexes,
    thus preventing the TSM Server from performing reorgs of the
    various indexes.  This can result in the continued growth of the
    TSM database.

    Platforms affected:
    All TSM 6.1.x and 6.2.x Servers

    Customer/L2 Diagnostics:
    The DB2 ‘reorgchk’ utility can be used to determine if a reorg
    of the table indexes is required.

    Platforms affected:
    All TSM 6.1.x and 6.2.x Servers

    The DB2 ‘reorgchk’ utility can be used to determine if a reorg
    of the table indexes is required.

    Local fix

    The following actions can be performed to modify the DB2 policy
    definitions to allow for automated, online reorgs of the table
    indexes:
    
    1. Login to the host machine where the TSM Server is running as
       the DB2 instance owner (eg. 'tsminst1')
    2. Issue the following command to connect to the TSM database:
    
       $ db2 connect to tsmdb1
    
    3. Issue the following command to export the existing DB2
       automatic maintenance definitions to a file named
       AutoReorg.xml:
    
       $ db2 "call sysproc.automaint_get_policyfile(
       'AUTO_REORG','AutoReorg.xml')"
    4. The file which contains the exported data is placed by the
       stored procedure in the following location:
    
       <instance dir>/sqllib/tmp/AutoReorg.xml
    
    5. Make a copy of the AutoReorg.xml file as this file can be
       used later in the event it becomes necessary to revert back
       to the original configuration.
    6. Modify the AutoReorgNew.xml file as follows:
    
       - change the indexReorgMode value from "Offline" to "Online"
       - change the useSystemTempTableSpace value from "false" to
         "true"
    
    7. Issue the following command to activate the new DB2 automatic
       maintenance definitions:
    
       $ db2 "call sysproc.automaint_set_policyfile(
       'AUTO_REORG','AutoReorgNew.xml')"
    
    Automatic, online reorgs of the database table indexes can now
    be performed by TSM.  The following command can be issued
    periodically to verify that automatic reorgs of the indexes are
    occurring as expected:
    
       $ db2 list history reorg all for db tsmdb1
    
    NOTE: Should it be necessary to disable automated reorgs of the
    table indexes, issue the following command to update the DB2
    automatic maintenance definitions back to their original values:
    
       $ db2 "call sysproc.automaint_set_policyfile(
       'AUTO_REORG','AutoReorg.xml')"
    
    NOTE: The procedures listed above are specific to UNIX hosts.
    These procedures may also apply to DB2 running on Windows hosts
    with the following distinctions:
    
       - all DB2 commands must be run in a DB2 command window
       - the policy definition file on Windows is named
         DB2AutoReorgPolicy.xml.  Substitute this policy file name
         in place of AutoReorg.xml.
    
    It is recommended that the enabling/disabling of automatic index
    reorgs be performed at a time when the TSM Server is quiesced.

    written by Bosse

    Jul 01
    Abstract
    Information is available for the following five potential data integrity-related issues in the Tivoli Storage Manager FastBack product:
    1. APAR IC64526. (Windows 2008/Vista only) After an Instant Restore for a volume, the next snapshot for that volume might be corrupted.
    2. APAR IC63799. (Disaster Recovery users only) When restoring data from the disaster recovery server, during the final stages of the disaster recovery process for a given chain, a locking mechanism does not lock the files of that chain. As such, if a FastBack Mount, Instant Restore, or Bare Machine Recovery process is started, and the process accesses files that should be locked for the disaster recovery, any data accessed can be corrupted.
    3. APAR IC64238. During an Instant Restore, inconsistent data might be accessed. The issue is that corrupted data might be used during the restore. If corrupted data is used during the restore, data loss occurs.
    4. APAR IC64240. During the cleanup process, a failure might occur. The failure might cause data loss.
    5. APAR IC64414. If the backed-up LUN size is 2 TB or greater, snapshots using Tivoli Storage Manager FastBack are not supported.
     
     
    Content
    For the five potential data integrity-related issues for Tivoli Storage Manager FastBack, more information about the problems, and, if available, a workaround and target fix date, are provided below:
    Issue 1. IC64526 (Windows 2008/Vista only)
    If you use Instant Restore to restore a volume, incremental snapshots taken after the Instant Restore completes might be corrupted. If you use the corrupted snapshots, data loss can occur.Who is affected:
    Customers with Tivoli Storage Manager FastBack Version 5.5.x installed, using Instant Restore to restore protected volumes on systems running either Microsoft Vista or Microsoft Windows 2008.

    Workaround #1:
    After completing an Instant Restore, if an incremental snapshot is taken for any policy containing the volume, do not use any of the subsequent incremental snapshots of the restored volume. You can do either a checkpoint or a full snapshot for all snapshot chains that include this volume. (Remember: A volume might be protected by more than one policy.)

    Complete the following steps:

    •  
      1. From FastBack Manager, go to the Configuration tab.
      2. In the navigation tree, locate the policies.
      3. Right click to select a policy; then, click either Run Check Point or Run Full Snapshot.
        Note: Full snapshots require and use more repository space.

    Workaround #2:
    If you do not want to manually run a checkpoint or full snapshot after each Instant Restore, you can configure the FastBack Client to automatically run a delta block snapshot after the FastBack Client service is restarted, by making a change the FastBack Client configuration file on each system with either Microsoft Windows 2008 or Microsoft Vista, and FastBack Client.

    Complete the following steps:
    1. From a command prompt window, change to the directory with the FastBackClient.ini file. The default path for this file follows: C:\Users\All Users\Tivoli\TSM\FastBack\client
    2. Enter the following command to make a copy of the existing FastBackClient.ini file:

      copy FastBackClient.ini FastBackClient.previous

    This copy is your backup copy.
    3. Using Microsoft WordPad (do not use Notepad), open the FastBackClient.ini file.
    4. Go to the [Save-State] section.
    5. Set the Differential-After-Restart parameter to 0. You can refer to the following sample:

      [Save-State]
      Differential-After-Restart = 0

    The 0 parameter means that the software runs delta block after any restart of the client.
    6. Save and close the FastBackClient.ini file.
    7. Restart the FastBack Client system.

    Note, do not complete the previous steps if you have started a checkpoint or full snapshot from FastBack Manager (described in Workaround #1).

    Fixes:

    Release Vulnerable levels First level with fix within that release
    TSM FastBack 5.5 5.5.0.0 through 5.5.5.119 Target: 5.5.6

    This issue does not affect Tivoli Storage Manager FastBack, Version 6.1.0.0.

    Issue 2. IC63799 (Disaster Recovery Server users only)
    During a disaster recovery process, at the end of the process a locking mechanism does not lock some of the files. If a user starts FastBack Mount, an Instant Restore, or a Bare Machine Recovery and accesses files that should be locked, data that is being restore can become corrupted. If the data is corrupted, data loss can occur.

    Who is affected:
    Anyone who has installed IBM Tivoli Storage Manager FastBack Version 5.5.x and uses the Disaster Recovery Server. The problem occurs when you use FastBack Mount, or run an Instant Restore or Bare Machine Recovery on the Disaster Recovery Server while a disaster recovery task that fails to lock files correctly is running. This problem can also affect any customer that uses FilesX Xpress Restore 3.5.x and uses the Disaster Recovery Server.

    Workaround:
    Complete the following steps:
    1. You may be able to download the following application from the following Microsoft Web site: junction.exe at http://technet.microsoft.com/en-us/sysinternals/bb896768.aspx
    2. Save the junction.exe file to the following location: %ProgramFiles%\Junction\junction.exe
    3. Stop the FastBack Disaster Recovery Service.
    4. From a command line prompt, change to the ftp home directory.
    5. For each branch directory (for example, REP_BranchName), complete the following steps:

    •  
        %Program Files%\Junction\junction.exe <Ftp_root_dir>\REP_BranchName\STAGE <Ftp_root_dir>\REP_BranchName
    • a. Under REP_BranchName, delete the STAGE directory.
      b. Use the junction.exe application to recreate the STAGE directory. To complete this step, enter the following command:

    6. Restart the FastBack Disaster Recovery Server.

    For any REP_BranchName items created in the future, you need to repeat step 5b for the new REP_BranchName item.

    If you want to verify that the junction functionality is working, you can create a file under <Ftp_root_dir>\REP_BranchName\STAGE and verify that the file is seen from the following directory: <Ftp_root_dir>\REP_BranchName

    Fixes:

    Release Vulnerable levels First level with fix within that release
    TSM FastBack 5.5 5.5.0.0 through 5.5.5.119 Target: 5.5.6
    FilesX Xpress Restore 3.5 3.5.165 through 3.5.802 Not planned

    This issue does not affect Tivoli Storage Manager FastBack, Version 6.1.0.0.

    Issue 3. IC64238 (Instant Restore users only)
    During the Instant Restore process, access to the snapshot data in the repository might be denied. For example, a temporary network connectivity issue might restrict access to snapshot data in the repository.

    (Windows 2000, 2003, XP only)
    When access to the snapshot data in the repository is denied, the following message is displayed:

      Read block from source have failed. The problem
      may have been caused by a network failure.
      See log file for more details.
      If the problem was caused by a network failure,
      correct the problem and resume the session.

    (Windows 2008 and Windows Vista only)
    When access to the snapshot data in the repository is denied, the “Read block from source have failed…” message might not be visible. The message is not visible if the system reboots while the Instant Restore process is running. For more details about this issue, you can refer to APAR IC63385.

    You can ensure that this message is displayed by changing the FastBack Mount service mode to manual before launching the Instant Restore process. You can use the Windows Service Manager to make this change.

    (All supported Windows OSs)
    The Instant Restore process is paused until the problem is resolved. After the problem is resolved, you have to manually resume the Instant Restore process.

    When the Instant Restore process is paused, applications can continue to access the volume that is being restored. This access may cause a data loss problem after the Instant Restore process is resumed.

    Who is affected:
    Anyone who uses Instant Restore to recover data with IBM Tivoli Storage Manager FastBack, Version 5.5.x. This problem also affects customer using FilesX Xpress Restore 3.5.x.

    To determine if any data loss has occurred, check the system where the Instant Restore ran. If you identify a problem occurred during an Instant Restore, contact IBM Software Support for Tivoli Storage Manager FastBack.

    Recommendation:
    As an alternative to using Instant Restore, you can use the volume restore process.

    Fixes:

    Release Vulnerable levels First level with fix within that release
    TSM FastBack 5.5 5.5.0.0 through 5.5.5.119 Target: 5.5.6
    FilesX Xpress Restore 3.5 3.5.165 through 3.5.802 Not planned

    This issue does not affect Tivoli Storage Manager FastBack, Version 6.1.0.0.

    Issue 4. IC64240
    When running the cleanup process, Tivoli Storage Manager FastBack might identify a snapshot chain that includes corrupted data. If a chain with corrupted data is detected, by default, the next snapshot that runs creates a full snapshot. This full snapshot is a good back up, but, note, all previous back up data might be corrupted.

    If a snapshot chain that includes corrupted data is identified, snapshots on the chain prior to the full snapshot are available for use. Use these snapshots with caution because using a corrupted chain can result in corrupted data or in data loss.

    Who is affected:
    Anyone who has installed IBM Tivoli Storage Manager FastBack Version 5.5.x or FilesX Xpress Restore 3.5.x.

    This problem also affects anyone who has installed IBM Tivoli Storage Manager FastBack Version 6.1.0.0. For Version 6.1.0.0, a message alerts you to this issue.

    Workaround:
    After a corrupt chain is identified, a message is written to the customer log file. To view the log file, from FastBack Manager, go to the Configuration tab. In the navigation area, there is an entry for the FastBack Server Log. Select this entry to view the log in the main window.

    If a corrupt chain is identified, the following message is in the log file:
    The cleanup process <volume/policy description> failed at a critical point. A repair job will be scheduled to fix the chain.

    (All supported Windows operating systems except Windows 2008)
    If the customer file log is filled with a large amount of data, go to the following directory:

      ../Documents & Settings/All users/application data/Tivoli/TSM/FastBack/server

    Look at the clogxxx.log files and search for the message provided above.

    (Windows 2008 only)

      If the customer file log is filled with a large amount of data, go to the following directory: ..\ProgramData\Tivoli\TSM\FastBack\server

    Look at the clogxxx.log files and search for the message provided above.

    If the customer log file does not include a message about a corrupt chain, you can use the chain to restore data.

    Recommendation:
    Use the Workaround to determine if a corrupt chain has been identified. If the customer log file reports that a chain is corrupted, do not perform a restore of this chain. If you perform a restore using this chain, data loss and / or data corruption occurs (because a snapshot with corrupted data is used).

    If the customer log file reports that a chain is corrupted, before mounting or performing a volume restore, you can use the following procedures to identify the data corruption or data loss issues:

    •  
      1. Run the file system check tool. You might have to refer to the operating system documentation for more help in completing this step.
      2. Run the application consistency check tool. You might have to refer to the operating system documentation for more help in completing this step.

    Fixes:

    Release Vulnerable levels First level with fix within that release
    TSM FastBack 6.1.0 6.1.0.0 Target: 6.1.1
    TSM FastBack 5.5 5.5.0.0 through 5.5.5.119 Target: 5.5.6
    FilesX Xpress Restore 3.5 3.5.165 through 3.5.802 Not planned

    Issue 5. IC64414
    Tivoli Storage Manager FastBack does not support protecting LUNs greater than 2 TB.

    Who is affected:
    All Tivoli Storage Manager FastBack and FilesX Xpress Restore customers backing up LUNs greater than 2 TB.

    written by Bosse

    Jun 29
    Question
    How much space will be reclaimable in the TSM Server database if a DB reorganization is done?
     
     
    Cause
    After a reorganization of the TSM Server database, there was much more space reclaimed than the estimate command initial suggested.
     
     
    Answer
    The ESTIMATE DBREORGSTATS command only analyses the used pages. Any pages that are completely empty will not be included in the analysis. This means that heavily fragmented databases will be able to reduce their assigned capacity to much greater degree than the ESTIMATE DBREORGSTATS command suggests. From the example Q DB F=D below:
                        Available Space (MB): 115,000
                      Assigned Capacity (MB): 114,600
                      Maximum Extension (MB): 400
                      Maximum Reduction (MB): 20
                           Page Size (bytes): 4,096
                          Total Usable Pages: 29,337,600
                                  Used Pages: 8,117,180
                                    Pct Util: 27.7
                               Max. Pct Util: 27.7
                            Physical Volumes: 6
                           Buffer Pool Pages: 104,858
                       Total Buffer Requests: 24,711,191
                              Cache Hit Pct.: 99.11
                             Cache Wait Pct.: 0.00
                         Backup in Progress?: No
                  Type of Backup In Progress:
                Incrementals Since Last Full: 0
              Changed Since Last Backup (MB): 414.17
                          Percentage Changed: 1.31
              Last Complete Backup Date/Time: 12/10/07 15:02:51
          Estimate of Recoverable Space (MB): 6,953
     Last Estimate of Recoverable Space (MB): 12/10/07 21:48:07
    Currently the database can only be reduced by 20 MB, even though more space is technically free and empty. With an assigned capacity of 114,600 MB to the database, and a utilization of 27.7% we would expect that after a DB reorganization at least 82,800 MB of space would be recoverable – the amount between the actually utilized space in the database and the empty space up to the assigned capacity. However as seen the Estimate of Recoverable Space is only 6,953 MB.After the DB reorganization was completed, the assigned capacity was reduced by more than 82,800 MB.

    The reason for this drastic difference, is that the estimate only covers the utilized data. Any portion of the database that is assigned, but not utilized, will not be included in the estimate. Therefore, the difference between the utilized data and the assigned capacity should be roughly included in the final estimate by the TSM Administrator when making a determination on if the reorganization is time and space effective.

    written by Bosse

    Jun 29
    Problem(Abstract)
    On Windows, the db2diag.log can be in different locations depending on the OS version, and there may be more than one Tivoli Storage Manager Server instance.
     
    Symptom
    Incorrect db2diag.log for the Server instance.
     
     
    Resolving the problem
    Depending on the version of the Windows server, the db2diag.log can be in different locations. Also, there is a db2diag.log for each DB2 instance on the server. Use the following information to obtain the correct db2diag.logWindows 2008 server:The default location is:

    C:\ProgramData\IBM\DB2\DB2TSM1\

    Windows 2003:

    The default location is:

    C:\Documents and Settings\All Users\Application Data\IBM\DB2\DB2TSM1

    The example below shows two Server instances, SERVER1 and SERVER2.

    written by Bosse

    Jun 09
    Question
    Can the output of SQL statements be used to make a custom file for script or macro purposes?
     
     
    Answer
    When creating a select statement a common bit of information can be added to every line such as a command to make the output a custom macro or script file. Use a string constant for the command embedded in single quotes as part of the items being selected.
    For example:If the outcome was to checkin all volumes that are currently offsite.select ‘CHECKIN LIBVOLUME LIBRARYNAME ‘, VOLUME_NAME from VOLUMES where ACCESS = ‘OFFSITE’ > checkinoffsite.file

    Unnamed[1] VOLUME_NAME
    —————— ——————
    CHECKIN LIBVOLUME LIBRARYNAME /tmp/00000000.BFS
    CHECKIN LIBVOLUME LIBRARYNAME /tmp/00000001.BFS
    CHECKIN LIBVOLUME LIBRARYNAME /tmp/00000002.BFS
    CHECKIN LIBVOLUME LIBRARYNAME /tmp/00000003.BFS
    CHECKIN LIBVOLUME LIBRARYNAME /tmp/00000004.BFS
    CHECKIN LIBVOLUME LIBRARYNAME /tmp/00000005.BFS
    CHECKIN LIBVOLUME LIBRARYNAME /tmp/00000006.BFS
    CHECKIN LIBVOLUME LIBRARYNAME /tmp/00000007.BFS
    CHECKIN LIBVOLUME LIBRARYNAME /tmp/00000008.BFS
    CHECKIN LIBVOLUME LIBRARYNAME /tmp/00000009.BFS
    CHECKIN LIBVOLUME LIBRARYNAME /tmp/0000000A.BFS

    This will redirect the output to a file, checkinoffsite.file, that will contain a checkin command for every offsite volume. This file can be manipulated to create a script or macro.

    written by Bosse

    Jun 09
    Question
    How can the LockList configuration parameter be adjusted to minimize the potential for DB2 deadlocks in a Tivoli Storage Manager environment?
     
     
    Cause
    If the DB2 LOCKLIST parameter is managed incorrectly, it may cause deadlocks to be returned to the Tivoli Storage Manager application for insert, delete and update requests.
     
     
    Answer
    DB2 LOCKLIST PARAMETERThe DB2 LOCKLIST parameter indicates the amount of storage that is allocated within the Database Heap to store lock information. There is one LockList per database and it contains the locks held by all applications, or in this case Tivoli Storage Manager Server transactions, concurrently connected to the database. Locks are stored and manipulated by DB2 in the form of a request block which consumes 40 bytes of memory on 32-bit platforms or 64 bytes on 64-bit systems. By default, DB2 manages the LockList size automatically and increases or decreases the total LockList allocation based on current activity. To determine the current value, issue the following command:

    db2 “get db cfg for <db name>”

    The LockList is comprised of memory allocated as 4K pages. A LOCKLIST parameter value of 10000 (4KB pages) is equivalent to 40,960,000bytes (about 39MB) of memory allotted to the LockList from the Database Heap. A single lock request may require up to 128 bytes out of the LockList heap as one request block is used for the lock descriptor and the second block is used to associate the Tivoli Storage Manager server transaction with the resource being locked. If the lock resource has already been requested then only one request block is required for future transactions requesting that lock resource.

    IMPACT TO THE TIVOLI STORAGE MANAGER SERVER

    The proper management of the DB2 LockList is crucial to prevent deadlocks within the Tivoli Storage Manager server. When the server is under average workload and the amount of concurrent activity is not driving large spikes of lock requests, the DB2 automatic handling of the LockList works well for the Tivoli Storage Manager server. However, when the amount of concurrent data movement activity is high, the inherent spikes to the DB2 LockList can cause lock escalations which will ultimately result in deadlocks within the server.

    The best case scenario is that the Tivoli Storage Manager server will re-drive the request and the subsequent operation will not fail due to the deadlock. Most Tivoli Storage Manager server operations effectively recover from this but in the case where a lot of data had been moved, deleted, or updated, the time required to process the data a second, or even third time can result in severe performance degradation on the Tivoli Storage Manager server. The worst case scenario is that the operation cannot be retried and the entire operation fails.

    If the amount of concurrent data being moved exceeds 500GB at any given time, it is recommended that the DB2 LOCKLIST parameter be adjusted as explained below. If the Tivoli Storage Manager server is involved with deduplicating large files either from the client, or server, properly tuning the LockList manually is extremely critical as large file deduplication inherently drives the number of rows needing to be managed by DB2 very high. This is because a large file is managed in many small pieces which result in many row locks within the DB2 LockList.

    The following Tivoli Storage Manager server messages are issued when a transaction is being rolled back due to a deadlock within DB2.
    ANR0159E dbieval.c(863): Database deadlock detected on XX:X.
    ANR0162W Supplemental database diagnostic information:
    -X:XXXXX:-XXX ([IBM][CLI Driver][DB2/XXXXX] SQL0911N
    The current transaction has been rolled back because of a
    deadlock or timeout. Reason code “2″. SQLSTATE=40001).    

    TUNING THE DB2 LOCKLIST

    To choose a LOCKLIST parameter value that will minimize the amount of deadlocks that may occur within a Tivoli Storage Manager server environment, the maximum amount of data that can be moved, or managed, within the Tivoli Storage Manager server at any given time must be known or at least approximated with a fair amount of precision. .

    Tuning Best Practice: Increase the DB2 LOCKLIST value by twice the amount of anticipated workload so that there are no unexpected failures during daily and nightly operations.

    It is important to note that increasing the DB2 LOCKLIST parameter value has a direct effect on the amount of memory that is required within the instance memory region. Increasing the parameter should be done with the understanding of what the potential memory implications will be on the system. The following information and examples can be used to tune the DB2 LOCKLIST parameter to an appropriate value based on the Tivoli Storage Manager server environment and workload:

    KEY FORMULA FOR DB2 LOCKLIST ADJUSTMENTSConcurrent data movement

    500GB = 2,500,000 locks = 122000 LOCKLIST

    1TB = 5,000,000 locks = 244000 LOCKLIST

    5TB = 25,000,000 locks = 1220000 LOCKLIST

    MEMORY REQUIREMENTS FOR LOCKLIST

    122000 LOCKLIST = 499MB (122000 X 4096)

    244000 LOCKLIST = 1GB (244000 X 4096)

    1220000 LOCKLIST = 4.9 GB (1220000 X 4096)

    Example #1

    100 Tivoli Storage Manager clients store 1TB of data in a 4 hour nightly window

    Migration has the ability to move 1TB of data in the same 4 hour window

    Total concurrent data movement – 2TB

    DB2 LOCKLIST recommendation – 588000

    Additional impact to system memory (potential) – 2.4 GB

    Example #2

    Migration has the ability to move up to 1TB of data in an 8 hour window

    Reclamation can run in the same window and process up to 2TB of data

    Expiration runs in the same window and can process up to 10 Million files which represent 500GB of data

    Total concurrent data movement/management – 3.5 TB

    DB2 LOCKLIST recommendation – 954000

    Additional impact to system memory (potential) – 3.9GB

    UPDATING THE DB2 LOCKLIST PARAMETERThe following command can be used to update the LOCKLIST parameter:

    db2 “update db cfg for <db name> using LOCKLIST XXXXXXXX”

    XXXXXXXXX = LOCKLIST value

    The new value can be verified by issuing the following command:

    db2 “get db cfg for <db name>”

    NOTE: As a result of updating the DB2 LOCKLIST parameter the MAXLOCKS parameter will be set to manual as well. The MAXLOCKS parameter controls what percentage of the LockList that a single transaction can own at any given time. Please ensure this value is set from 95-100. If it is not in this range, the parameter can be updated with the following command:

    db2 “update db cfg for <db name> using maxlocks 97 “

    written by Bosse