在主节点上停止 SAP HANA 数据库 - SAP HANA 开启 Amazon
Amazon Web Services 文档中描述的 Amazon Web Services 服务或功能可能因区域而异。要查看适用于中国区域的差异,请参阅 中国的 Amazon Web Services 服务入门 (PDF)

本文属于机器翻译版本。若本译文内容与英语原文存在差异,则一律以英文原文为准。

在主节点上停止 SAP HANA 数据库

描述-在正常集群操作期间停止主 SAP HANA 数据库。

运行节点-主要 SAP HANA 数据库节点

运行步骤

  • 优雅地停止主 SAP HANA 数据库,因为 <sid>adm

    [root@prihana ~] su - hdbadm
    hdbadm@prihana:/usr/sap/HDB/HDB00> HDB stop
    hdbdaemon will wait maximal 300 seconds for NewDB services finishing.
    Stopping instance using: /usr/sap/HDB/SYS/exe/hdb/sapcontrol -prot
    NI_HTTP -nr 00 -function Stop 400
    
    12.11.2020 11:39:19
    Stop
    OK
    Waiting for stopped instance using:
    /usr/sap/HDB/SYS/exe/hdb/sapcontrol -prot NI_HTTP -nr 00 -function
    WaitforStopped 600 2
    
    12.11.2020 11:39:51
    WaitforStopped
    OK
    hdbdaemon is stopped.

预期结果

  • 集群检测到已停止的主要 SAP HANA 数据库(在节点 1 上),并将辅助的 SAP HANA 数据库(在节点 2 上)提升为主数据库。

    [root@prihana ~] pcs status
    Cluster name: rhelhanaha
    Stack: corosync
    Current DC: sechana (version 1.1.19-8.el7_6.5-c3c624ea3d) - partition with quorum
    Last updated: Tue Nov 10 17:58:19 2020
    Last change: Tue Nov 10 17:57:41 2020 by root via crm_attribute on sechana
    
    2 nodes configured
    6 resources configured
    
    Online: [ prihana sechana ]
    
    Full list of resources:
    
     clusterfence   (stonith:fence_aws):    Started prihana
     Clone Set: SAPHanaTopology_HDB_00-clone [SAPHanaTopology_HDB_00]
         Started: [ prihana sechana ]
     Master/Slave Set: SAPHana_HDB_00-master [SAPHana_HDB_00]
         Masters: [ sechana ]
         Slaves: [ prihana ]
     hana-oip       (ocf::heartbeat:aws-vpc-move-ip):       Started sechana
    
    Failed Actions:
    * SAPHana_HDB_00_monitor_59000 on prihana 'master (failed)' (9): call=31,
    status=complete, exitreason='',
        last-rc-change='Tue Nov 10 17:56:52 2020', queued=0ms, exec=0ms
    
    
    Daemon Status:
      corosync: active/enabled
      pacemaker: active/enabled
      pcsd: active/enabled
  • 重叠 IP 地址已迁移到新的主地址(在节点 2 上)。

    [root@sechana ~] ip addr show
    1: lo: <LOOPBACK,UP,LOWER_UP> mtu 65536 qdisc noqueue state UNKNOWN
    group default qlen 1000
        link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00
        inet 127.0.0.1/8 scope host lo
           valid_lft forever preferred_lft forever
        inet6 ::1/128 scope host
           valid_lft forever preferred_lft forever
    2: eth0: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 9000 qdisc mq state
    UP group default qlen 1000
        link/ether 0e:ef:dd:3c:bf:1b brd ff:ff:ff:ff:ff:ff
        inet xx.xx.xx.xx/24 brd 11.0.2.255 scope global eth0
           valid_lft forever preferred_lft forever
        inet xx.xx.xx.xx/32 scope global eth0:1
           valid_lft forever preferred_lft forever
        inet 192.168.10.16/32 scope global eth0
           valid_lft forever preferred_lft forever
        inet6 fe80::cef:ddff:fe3c:bf1b/64 scope link
           valid_lft forever preferred_lft forever
  • 由于设置AUTOMATED_REGISTER为 true,因此集群会重新启动出现故障的 SAP HANA 数据库,并将其注册到新的主数据库。使用以下命令验证主 SAP HANA 数据库的状态:

    hdbadm@prihana:/usr/sap/HDB/HDB00> sapcontrol -nr 00 -function GetProcessList
    
    10.11.2020 17:59:49
    GetProcessList
    OK
    name, description, dispstatus, textstatus, starttime, elapsedtime, pid
    hdbdaemon, HDB Daemon, GREEN, Running, 2020 11 10 17:58:47, 0:01:02, 25979
    hdbcompileserver, HDB Compileserver, GREEN, Running, 2020 11 10 17:58:52, 0:00:57, 26152
    hdbindexserver, HDB Indexserver-HDB, GREEN, Running, 2020 11 10 17:58:53, 0:00:56, 26201
    hdbnameserver, HDB Nameserver, GREEN, Running, 2020 11 10 17:58:48, 0:01:01, 25997
    hdbpreprocessor, HDB Preprocessor, GREEN, Running, 2020 11 10 17:58:52, 0:00:57, 26155
    hdbwebdispatcher, HDB Web Dispatcher, GREEN, Running, 2020 11 10 17:59:02, 0:00:47, 27100
    hdbxsengine, HDB XSEngine-HDB, GREEN, Running, 2020 11 10 17:58:53, 0:00:56, 26204
    hdbadm@prihana:/usr/sap/HDB/HDB00>

恢复程序

  • 使用以下命令以 root 用户身份清理节点 1 上的 “失败操作”:

    [root@prihana ~] pcs resource cleanup SAPHana_HDB_00 --node prihana
  • 运行清理命令后,“操作失败” 消息应从集群状态中消失。

     [root@prihana ~] pcs status
    Cluster name: rhelhanaha
    Stack: corosync
    Current DC: sechana (version 1.1.19-8.el7_6.5-c3c624ea3d) -
    partition with quorum
    Last updated: Tue Nov 10 18:01:02 2020
    Last change: Tue Nov 10 18:00:45 2020 by root via crm_attribute on sechana
    
    2 nodes configured
    6 resources configured
    
    Online: [ prihana sechana ]
    
    Full list of resources:
    
     clusterfence   (stonith:fence_aws):    Started prihana
     Clone Set: SAPHanaTopology_HDB_00-clone [SAPHanaTopology_HDB_00]
         Started: [ prihana sechana ]
     Master/Slave Set: SAPHana_HDB_00-master [SAPHana_HDB_00]
         Masters: [ sechana ]
         Slaves: [ prihana ]
     hana-oip       (ocf::heartbeat:aws-vpc-move-ip):       Started sechana
    
    Daemon Status:
      corosync: active/enabled
      pacemaker: active/enabled
      pcsd: active/enabled
    [root@prihana ~]