備機網絡故障
[oracle@XAG125 ~]$ dgmgrl sys/123456
DGMGRL> show configuration
Protection Mode: MaxAvailability
Members:
DG125 - Primary database
DG126 - (*) Physical standby database
Fast-Start Failover: ENABLED
Configuration Status:
SUCCESS (status updated 23 seconds ago)
[root@XAG126 ~]$ service network stop
[oracle@XAG125 ~]$ dgmgrl sys/123456
DGMGRL> stop observer
DGMGRL> start observer
[W000 10/10 10:04:20.90] Failed to ping the standby.
[W000 10/10 10:04:34.94] The primary database has requested a transition to the UNSYNC/LAGGING state with the standby database DG126.
[W000 10/10 10:04:34.95] Permission granted to the primary database to transition to UNSYNC/LAGGING state with the standby database DG126.
[W000 10/10 10:04:37.96] The primary database returned to SYNC/NOT LAGGING state with the standby database DG126.
Unable to connect to database using sl
ORA-12170: TNS:Connect timeout occurred
Unable to connect to database using sl
ORA-12170: TNS:Connect timeout occurred
[oracle@XAG125 trace]$ tail -50f alert_DG.log
2018-10-10T10:04:09.843235+08:00
***********************************************************************
Fatal NI connect error 12170.
VERSION INFORMATION:
TNS for Linux: Version 12.2.0.1.0 - Production
TCP/IP NT Protocol Adapter for Linux: Version 12.2.0.1.0 - Production
Time: 10-OCT-2018 10:04:09
Tracing not turned on.
Tns error struct:
ns main err code: 12535
TNS-12535: TNS:operation timed out
ns secondary err code: 12609
nt main err code: 0
nt secondary err code: 0
nt OS err code: 0
Client address: (ADDRESS=(PROTOCOL=tcp)(HOST=192.168.190.126)(PORT=1521))
2018-10-10T10:04:16.315237+08:00
Starting control autobackup
2018-10-10T10:04:17.665037+08:00
Deleted Oracle managed file /u01/app/oracle/fast_recovery_area/DG/DG125/backupset/2018_08_01/o1_mf_annnn_TAG20180801T143329_fp2o1v12_.bkp
2018-10-10T10:04:19.729294+08:00
Control autobackup written to DISK device
handle '/u01/app/oracle/fast_recovery_area/DG/DG125/autobackup/2018_10_10/o1_mf_s_989143456_fvtqk1q3_.bkp'
2018-10-10T10:04:32.915333+08:00
ORA-16198: LGWR received timedout error from KSR
LGWR: Attempting destination LOG_ARCHIVE_DEST_2 network reconnect (16198)
LGWR: Destination LOG_ARCHIVE_DEST_2 network reconnect abandoned
2018-10-10T10:04:32.930353+08:00
Errors in file /u01/app/oracle/diag/rdbms/dg125/DG/trace/DG_lgwr_91480.trc:
ORA-16198: Timeout incurred on internal channel during remote archival
Error 16198 for archive log file 2 to 'sl'
2018-10-10T10:04:32.978165+08:00
Destination LOG_ARCHIVE_DEST_2 is UNSYNCHRONIZED
2018-10-10T10:04:35.209544+08:00
LGWR: Failed to archive log 2 thread 1 sequence 12 (16198)
ORA-16198: LGWR received timedout error from KSR
LGWR: Error 16198 disconnecting from destination LOG_ARCHIVE_DEST_2 standby host 'sl'
2018-10-10T10:04:35.240160+08:00
Thread 1 advanced to log sequence 13 (LGWR switch)
Current log# 3 seq# 13 mem# 0: /u01/app/oracle/oradata/DG/redo03.log
2018-10-10T10:04:35.287429+08:00
Archived Log entry 292 added for T-1.S-12 ID 0x7166494c LAD:1
2018-10-10T10:04:56.275827+08:00
DMON: FSFP network call timeout. Killing process FSFP.
2018-10-10T10:05:11.277033+08:00
Starting background process FSFP
2018-10-10T10:05:11.309923+08:00
FSFP started with pid=61, OS id=92207
2018-10-10T10:05:35.631525+08:00
Shared IO Pool defaulting to 32MB. Trying to get it from Buffer Cache for process 92214.
2018-10-10T10:05:46.312954+08:00
[oracle@XAG125 ~]$ dgmgrl sys/123456
DGMGRL> show configuration
Configuration - ADGbroker
Protection Mode: MaxAvailability
Members:
DG125 - Primary database
Error: ORA-16825: multiple errors or warnings, including fast-start failover-related errors or warnings, detected for the database
DG126 - (*) Physical standby database
Error: ORA-12543: TNS:destination host unreachable
Fast-Start Failover: ENABLED
Configuration Status:
ERROR (status updated 27 seconds ago)
DGMGRL> show database 'DG126' StatusReport
Error: ORA-12543: TNS:destination host unreachable
Error: ORA-16625: cannot reach member "DG126"
DGMGRL> show database 'DG125' StatusReport
STATUS REPORT
INSTANCE_NAME SEVERITY ERROR_TEXT
DG ERROR ORA-16737: the redo transport service for member "DG126" has an error
* WARNING ORA-16817: unsynchronized fast-start failover configuration
重啟網卡
[root@XAG126 ~]$ service network restart
重啟備機監聽及數據庫(一次不行,再做一次)
[oracle@XAG126 ~]$ sqlplus / as sysdba
SQL> shutdown immediate;
SQL> startup;
SQL> exit
[oracle@XAG126 ~]$ lsnrctl start
or(主庫SQL模式下運行如下3條命令)
ALTER SYSTEM SET log_archive_dest_state_2='RESET' SCOPE=BOTH;
ALTER SYSTEM SET log_archive_dest_state_2='RESET' SCOPE=BOTH;
ALTER SYSTEM SET log_archive_dest_state_2='ENABLE' SCOPE=BOTH;
alter system archive log current;
查看DG狀態
[oracle@XAG125 ~]$ dgmgrl sys/123456
DGMGRL> show configuration
Protection Mode: MaxAvailability
Members:
DG125 - Primary database
DG126 - (*) Physical standby database
Fast-Start Failover: ENABLED
Configuration Status:
SUCCESS (status updated 50 seconds ago)
主機網絡故障
主備自動切換后 對原主機 startup mount;