Why new two node cluster not behaving right?Linux HA cluster w/Xen, Heartbeat, Pacemaker. domU does not failover to secondary nodeHow to make active/passive jboss resource in pacemakerHow do I set the pacemaker cluster name with pcs?How to integrate the fence_cisco_ucs.py(python) script with the pacemaker-1.1.10 and corosync-2.3.3Pacemaker error on failback with drbdPacemaker and Stonith : passive node won't bring up resourcesSLES 12 High Availability Cluster - scsi persistant reservation fencingUnable to communicate with pacemaker host while authorisingcorosync systemd resource does not reflect service statusHow to create resource for service using Corosync/Pacemaker

What can cause the front wheel to lock up when going over a small bump?

How bad would a partial hash leak be, realistically?

What does the "c." listed under weapon length mean?

Is this half mask suitable for spray painting?

Can you really not move between grapples/shoves?

Building a road to escape Earth's gravity by making a pyramid on Antartica

Average spam confidence

Why does the Schrödinger equation work so well for the Hydrogen atom despite the relativistic boundary at the nucleus?

Phone number to a lounge, or lounges generally

Strange symbol for two functions

My coworkers think I had a long honeymoon. Actually I was diagnosed with cancer. How do I talk about it?

Implement Homestuck's Catenative Doomsday Dice Cascader

Turing patterns

PL/SQL function to receive a number and return its binary format

Secure offsite backup, even in the case of hacker root access

Why don’t airliners have temporary liveries?

How can drunken, homicidal elves successfully conduct a wild hunt?

What risks are there when you clear your cookies instead of logging off?

Traffic law UK, pedestrians

Smooth switching between 12v batteries, with toggle switch

Why is the application of an oracle function not a measurement?

Do simulator games use a realistic trajectory to get into orbit?

Does an ice chest packed full of frozen food need ice?

Do the English have an ancient (obsolete) verb for the action of the book opening?



Why new two node cluster not behaving right?


Linux HA cluster w/Xen, Heartbeat, Pacemaker. domU does not failover to secondary nodeHow to make active/passive jboss resource in pacemakerHow do I set the pacemaker cluster name with pcs?How to integrate the fence_cisco_ucs.py(python) script with the pacemaker-1.1.10 and corosync-2.3.3Pacemaker error on failback with drbdPacemaker and Stonith : passive node won't bring up resourcesSLES 12 High Availability Cluster - scsi persistant reservation fencingUnable to communicate with pacemaker host while authorisingcorosync systemd resource does not reflect service statusHow to create resource for service using Corosync/Pacemaker






.everyoneloves__top-leaderboard:empty,.everyoneloves__mid-leaderboard:empty,.everyoneloves__bot-mid-leaderboard:empty height:90px;width:728px;box-sizing:border-box;








2















I’m trying to create cluster of two nodes, but it seems to behave a little strange following this guide, but I was unable to do, for example:



[root@afnA ~]# pcs property set stonith-enabled=false
Error: Unable to update cib
Call cib_replace failed (-62): Timer expired


only thing I find in logs are continued corosync events:



Nov 06 01:30:54 corosync [TOTEM ] Retransmit List: 96 97 
Nov 06 01:30:56 corosync [TOTEM ] Retransmit List: 96 97
Nov 06 01:30:57 corosync [TOTEM ] Retransmit List: 96 97
Nov 06 01:30:59 corosync [TOTEM ] Retransmit List: 96 97
Nov 06 01:31:01 corosync [TOTEM ] Retransmit List: 96 97
...


Let me known if more info would help!



pcs cluster report



CentOS 6.7 w/:
pacemaker-1.1.12-8.el6.x86_64
pcs-0.9.139-9.el6_7.1.x86_64
ccs-0.16.2-81.el6.x86_64
resource-agents-3.9.5-24.el6.x86_64
cman-3.0.12.1-73.el6.1.x86_64
corosync-1.4.7-2.el6.x86_64

[root@afnB ~]# pacemakerd --features
Pacemaker 1.1.11 (Build: 97629de) Supporting v3.0.9: generated-manpages agent-manpages ascii-docs ncurses libqb-logging libqb-ipc nagios corosync-plugin cman acls

[root@afnB ~]# corosync-quorumtool -l
Nodeid Name
1 afnA.mxi.tdcfoo
2 afnB.mxi.tdcfoo
[root@afnB ~]# corosync-quorumtool -s
Version: 1.4.7
Nodes: 2
Ring ID: 8
Quorum type: quorum_cman
Quorate: Yes
[root@afnB ~]# pcs status
Cluster name: afn-cluster
WARNING: no stonith devices and stonith-enabled is not false
Last updated: Fri Nov 6 01:35:30 2015
Last change: Fri Nov 6 01:29:37 2015
Stack: cman
Current DC: afna - partition with quorum
Version: 1.1.11-97629de
2 Nodes configured
0 Resources configured

Online: [ afna afnb ]

Full list of resources:


[root@afnB ~]# cat /etc/cluster/cluster.conf



<cluster config_version="1" name="afn-cluster">
<fence_daemon/>
<clusternodes>
<clusternode name="afna" nodeid="1">
<fence>
<method name="pcmk-redirect">
<device name="pcmk" port="afna"/>
</method>
</fence>
</clusternode>
<clusternode name="afnb" nodeid="2">
<fence>
<method name="pcmk-redirect">
<device name="pcmk" port="afnb"/>
</method>
</fence>
</clusternode>
</clusternodes>
<cman/>
<fencedevices>
<fencedevice agent="fence_pcmk" name="pcmk"/>
</fencedevices>
<rm>
<failoverdomains/>
<resources/>
</rm>
</cluster>


[root@afnB ~]# grep -v '#' /etc/corosync/corosync.conf



compatibility: whitetank

totem
version: 2

secauth: off
threads: 0

window_size: 150

interface
ringnumber: 0
bindnetaddr: 10.45.69.0
mcastaddr: 239.255.15.1
mcastport: 5405
ttl: 1



logging
fileline: off
to_stderr: no
to_logfile: yes
logfile: /var/log/cluster/corosync.log
to_syslog: yes
debug: off
timestamp: on
logger_subsys
subsys: AMF
debug: off











share|improve this question
























  • Firewalling. It's always firewalling. (disclaimer: may not always be firewalling)

    – womble
    Nov 6 '15 at 1:11











  • Nope have no FW between nodes :)

    – Steffen Winther Sørensen
    Nov 6 '15 at 10:30











  • Seem unicast fixed the issue ;) This was done by changing cman configuration to udpu transport in cluster.conf like this: <cman transport="udpu" port="5405" two_node="1" expected_votes="1” /> Assuming multicast in open vswitch is the culprit as multicasting stopped after 180s f.ex. with omping, also see here

    – Steffen Winther Sørensen
    Nov 6 '15 at 10:32


















2















I’m trying to create cluster of two nodes, but it seems to behave a little strange following this guide, but I was unable to do, for example:



[root@afnA ~]# pcs property set stonith-enabled=false
Error: Unable to update cib
Call cib_replace failed (-62): Timer expired


only thing I find in logs are continued corosync events:



Nov 06 01:30:54 corosync [TOTEM ] Retransmit List: 96 97 
Nov 06 01:30:56 corosync [TOTEM ] Retransmit List: 96 97
Nov 06 01:30:57 corosync [TOTEM ] Retransmit List: 96 97
Nov 06 01:30:59 corosync [TOTEM ] Retransmit List: 96 97
Nov 06 01:31:01 corosync [TOTEM ] Retransmit List: 96 97
...


Let me known if more info would help!



pcs cluster report



CentOS 6.7 w/:
pacemaker-1.1.12-8.el6.x86_64
pcs-0.9.139-9.el6_7.1.x86_64
ccs-0.16.2-81.el6.x86_64
resource-agents-3.9.5-24.el6.x86_64
cman-3.0.12.1-73.el6.1.x86_64
corosync-1.4.7-2.el6.x86_64

[root@afnB ~]# pacemakerd --features
Pacemaker 1.1.11 (Build: 97629de) Supporting v3.0.9: generated-manpages agent-manpages ascii-docs ncurses libqb-logging libqb-ipc nagios corosync-plugin cman acls

[root@afnB ~]# corosync-quorumtool -l
Nodeid Name
1 afnA.mxi.tdcfoo
2 afnB.mxi.tdcfoo
[root@afnB ~]# corosync-quorumtool -s
Version: 1.4.7
Nodes: 2
Ring ID: 8
Quorum type: quorum_cman
Quorate: Yes
[root@afnB ~]# pcs status
Cluster name: afn-cluster
WARNING: no stonith devices and stonith-enabled is not false
Last updated: Fri Nov 6 01:35:30 2015
Last change: Fri Nov 6 01:29:37 2015
Stack: cman
Current DC: afna - partition with quorum
Version: 1.1.11-97629de
2 Nodes configured
0 Resources configured

Online: [ afna afnb ]

Full list of resources:


[root@afnB ~]# cat /etc/cluster/cluster.conf



<cluster config_version="1" name="afn-cluster">
<fence_daemon/>
<clusternodes>
<clusternode name="afna" nodeid="1">
<fence>
<method name="pcmk-redirect">
<device name="pcmk" port="afna"/>
</method>
</fence>
</clusternode>
<clusternode name="afnb" nodeid="2">
<fence>
<method name="pcmk-redirect">
<device name="pcmk" port="afnb"/>
</method>
</fence>
</clusternode>
</clusternodes>
<cman/>
<fencedevices>
<fencedevice agent="fence_pcmk" name="pcmk"/>
</fencedevices>
<rm>
<failoverdomains/>
<resources/>
</rm>
</cluster>


[root@afnB ~]# grep -v '#' /etc/corosync/corosync.conf



compatibility: whitetank

totem
version: 2

secauth: off
threads: 0

window_size: 150

interface
ringnumber: 0
bindnetaddr: 10.45.69.0
mcastaddr: 239.255.15.1
mcastport: 5405
ttl: 1



logging
fileline: off
to_stderr: no
to_logfile: yes
logfile: /var/log/cluster/corosync.log
to_syslog: yes
debug: off
timestamp: on
logger_subsys
subsys: AMF
debug: off











share|improve this question
























  • Firewalling. It's always firewalling. (disclaimer: may not always be firewalling)

    – womble
    Nov 6 '15 at 1:11











  • Nope have no FW between nodes :)

    – Steffen Winther Sørensen
    Nov 6 '15 at 10:30











  • Seem unicast fixed the issue ;) This was done by changing cman configuration to udpu transport in cluster.conf like this: <cman transport="udpu" port="5405" two_node="1" expected_votes="1” /> Assuming multicast in open vswitch is the culprit as multicasting stopped after 180s f.ex. with omping, also see here

    – Steffen Winther Sørensen
    Nov 6 '15 at 10:32














2












2








2


1






I’m trying to create cluster of two nodes, but it seems to behave a little strange following this guide, but I was unable to do, for example:



[root@afnA ~]# pcs property set stonith-enabled=false
Error: Unable to update cib
Call cib_replace failed (-62): Timer expired


only thing I find in logs are continued corosync events:



Nov 06 01:30:54 corosync [TOTEM ] Retransmit List: 96 97 
Nov 06 01:30:56 corosync [TOTEM ] Retransmit List: 96 97
Nov 06 01:30:57 corosync [TOTEM ] Retransmit List: 96 97
Nov 06 01:30:59 corosync [TOTEM ] Retransmit List: 96 97
Nov 06 01:31:01 corosync [TOTEM ] Retransmit List: 96 97
...


Let me known if more info would help!



pcs cluster report



CentOS 6.7 w/:
pacemaker-1.1.12-8.el6.x86_64
pcs-0.9.139-9.el6_7.1.x86_64
ccs-0.16.2-81.el6.x86_64
resource-agents-3.9.5-24.el6.x86_64
cman-3.0.12.1-73.el6.1.x86_64
corosync-1.4.7-2.el6.x86_64

[root@afnB ~]# pacemakerd --features
Pacemaker 1.1.11 (Build: 97629de) Supporting v3.0.9: generated-manpages agent-manpages ascii-docs ncurses libqb-logging libqb-ipc nagios corosync-plugin cman acls

[root@afnB ~]# corosync-quorumtool -l
Nodeid Name
1 afnA.mxi.tdcfoo
2 afnB.mxi.tdcfoo
[root@afnB ~]# corosync-quorumtool -s
Version: 1.4.7
Nodes: 2
Ring ID: 8
Quorum type: quorum_cman
Quorate: Yes
[root@afnB ~]# pcs status
Cluster name: afn-cluster
WARNING: no stonith devices and stonith-enabled is not false
Last updated: Fri Nov 6 01:35:30 2015
Last change: Fri Nov 6 01:29:37 2015
Stack: cman
Current DC: afna - partition with quorum
Version: 1.1.11-97629de
2 Nodes configured
0 Resources configured

Online: [ afna afnb ]

Full list of resources:


[root@afnB ~]# cat /etc/cluster/cluster.conf



<cluster config_version="1" name="afn-cluster">
<fence_daemon/>
<clusternodes>
<clusternode name="afna" nodeid="1">
<fence>
<method name="pcmk-redirect">
<device name="pcmk" port="afna"/>
</method>
</fence>
</clusternode>
<clusternode name="afnb" nodeid="2">
<fence>
<method name="pcmk-redirect">
<device name="pcmk" port="afnb"/>
</method>
</fence>
</clusternode>
</clusternodes>
<cman/>
<fencedevices>
<fencedevice agent="fence_pcmk" name="pcmk"/>
</fencedevices>
<rm>
<failoverdomains/>
<resources/>
</rm>
</cluster>


[root@afnB ~]# grep -v '#' /etc/corosync/corosync.conf



compatibility: whitetank

totem
version: 2

secauth: off
threads: 0

window_size: 150

interface
ringnumber: 0
bindnetaddr: 10.45.69.0
mcastaddr: 239.255.15.1
mcastport: 5405
ttl: 1



logging
fileline: off
to_stderr: no
to_logfile: yes
logfile: /var/log/cluster/corosync.log
to_syslog: yes
debug: off
timestamp: on
logger_subsys
subsys: AMF
debug: off











share|improve this question
















I’m trying to create cluster of two nodes, but it seems to behave a little strange following this guide, but I was unable to do, for example:



[root@afnA ~]# pcs property set stonith-enabled=false
Error: Unable to update cib
Call cib_replace failed (-62): Timer expired


only thing I find in logs are continued corosync events:



Nov 06 01:30:54 corosync [TOTEM ] Retransmit List: 96 97 
Nov 06 01:30:56 corosync [TOTEM ] Retransmit List: 96 97
Nov 06 01:30:57 corosync [TOTEM ] Retransmit List: 96 97
Nov 06 01:30:59 corosync [TOTEM ] Retransmit List: 96 97
Nov 06 01:31:01 corosync [TOTEM ] Retransmit List: 96 97
...


Let me known if more info would help!



pcs cluster report



CentOS 6.7 w/:
pacemaker-1.1.12-8.el6.x86_64
pcs-0.9.139-9.el6_7.1.x86_64
ccs-0.16.2-81.el6.x86_64
resource-agents-3.9.5-24.el6.x86_64
cman-3.0.12.1-73.el6.1.x86_64
corosync-1.4.7-2.el6.x86_64

[root@afnB ~]# pacemakerd --features
Pacemaker 1.1.11 (Build: 97629de) Supporting v3.0.9: generated-manpages agent-manpages ascii-docs ncurses libqb-logging libqb-ipc nagios corosync-plugin cman acls

[root@afnB ~]# corosync-quorumtool -l
Nodeid Name
1 afnA.mxi.tdcfoo
2 afnB.mxi.tdcfoo
[root@afnB ~]# corosync-quorumtool -s
Version: 1.4.7
Nodes: 2
Ring ID: 8
Quorum type: quorum_cman
Quorate: Yes
[root@afnB ~]# pcs status
Cluster name: afn-cluster
WARNING: no stonith devices and stonith-enabled is not false
Last updated: Fri Nov 6 01:35:30 2015
Last change: Fri Nov 6 01:29:37 2015
Stack: cman
Current DC: afna - partition with quorum
Version: 1.1.11-97629de
2 Nodes configured
0 Resources configured

Online: [ afna afnb ]

Full list of resources:


[root@afnB ~]# cat /etc/cluster/cluster.conf



<cluster config_version="1" name="afn-cluster">
<fence_daemon/>
<clusternodes>
<clusternode name="afna" nodeid="1">
<fence>
<method name="pcmk-redirect">
<device name="pcmk" port="afna"/>
</method>
</fence>
</clusternode>
<clusternode name="afnb" nodeid="2">
<fence>
<method name="pcmk-redirect">
<device name="pcmk" port="afnb"/>
</method>
</fence>
</clusternode>
</clusternodes>
<cman/>
<fencedevices>
<fencedevice agent="fence_pcmk" name="pcmk"/>
</fencedevices>
<rm>
<failoverdomains/>
<resources/>
</rm>
</cluster>


[root@afnB ~]# grep -v '#' /etc/corosync/corosync.conf



compatibility: whitetank

totem
version: 2

secauth: off
threads: 0

window_size: 150

interface
ringnumber: 0
bindnetaddr: 10.45.69.0
mcastaddr: 239.255.15.1
mcastport: 5405
ttl: 1



logging
fileline: off
to_stderr: no
to_logfile: yes
logfile: /var/log/cluster/corosync.log
to_syslog: yes
debug: off
timestamp: on
logger_subsys
subsys: AMF
debug: off








linux failovercluster pacemaker corosync






share|improve this question















share|improve this question













share|improve this question




share|improve this question








edited May 20 at 19:17









chicks

3,08072033




3,08072033










asked Nov 6 '15 at 0:58









Steffen Winther SørensenSteffen Winther Sørensen

616




616












  • Firewalling. It's always firewalling. (disclaimer: may not always be firewalling)

    – womble
    Nov 6 '15 at 1:11











  • Nope have no FW between nodes :)

    – Steffen Winther Sørensen
    Nov 6 '15 at 10:30











  • Seem unicast fixed the issue ;) This was done by changing cman configuration to udpu transport in cluster.conf like this: <cman transport="udpu" port="5405" two_node="1" expected_votes="1” /> Assuming multicast in open vswitch is the culprit as multicasting stopped after 180s f.ex. with omping, also see here

    – Steffen Winther Sørensen
    Nov 6 '15 at 10:32


















  • Firewalling. It's always firewalling. (disclaimer: may not always be firewalling)

    – womble
    Nov 6 '15 at 1:11











  • Nope have no FW between nodes :)

    – Steffen Winther Sørensen
    Nov 6 '15 at 10:30











  • Seem unicast fixed the issue ;) This was done by changing cman configuration to udpu transport in cluster.conf like this: <cman transport="udpu" port="5405" two_node="1" expected_votes="1” /> Assuming multicast in open vswitch is the culprit as multicasting stopped after 180s f.ex. with omping, also see here

    – Steffen Winther Sørensen
    Nov 6 '15 at 10:32

















Firewalling. It's always firewalling. (disclaimer: may not always be firewalling)

– womble
Nov 6 '15 at 1:11





Firewalling. It's always firewalling. (disclaimer: may not always be firewalling)

– womble
Nov 6 '15 at 1:11













Nope have no FW between nodes :)

– Steffen Winther Sørensen
Nov 6 '15 at 10:30





Nope have no FW between nodes :)

– Steffen Winther Sørensen
Nov 6 '15 at 10:30













Seem unicast fixed the issue ;) This was done by changing cman configuration to udpu transport in cluster.conf like this: <cman transport="udpu" port="5405" two_node="1" expected_votes="1” /> Assuming multicast in open vswitch is the culprit as multicasting stopped after 180s f.ex. with omping, also see here

– Steffen Winther Sørensen
Nov 6 '15 at 10:32






Seem unicast fixed the issue ;) This was done by changing cman configuration to udpu transport in cluster.conf like this: <cman transport="udpu" port="5405" two_node="1" expected_votes="1” /> Assuming multicast in open vswitch is the culprit as multicasting stopped after 180s f.ex. with omping, also see here

– Steffen Winther Sørensen
Nov 6 '15 at 10:32











0






active

oldest

votes












Your Answer








StackExchange.ready(function()
var channelOptions =
tags: "".split(" "),
id: "2"
;
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function()
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled)
StackExchange.using("snippets", function()
createEditor();
);

else
createEditor();

);

function createEditor()
StackExchange.prepareEditor(
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader:
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
,
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
);



);













draft saved

draft discarded


















StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fserverfault.com%2fquestions%2f734346%2fwhy-new-two-node-cluster-not-behaving-right%23new-answer', 'question_page');

);

Post as a guest















Required, but never shown

























0






active

oldest

votes








0






active

oldest

votes









active

oldest

votes






active

oldest

votes















draft saved

draft discarded
















































Thanks for contributing an answer to Server Fault!


  • Please be sure to answer the question. Provide details and share your research!

But avoid


  • Asking for help, clarification, or responding to other answers.

  • Making statements based on opinion; back them up with references or personal experience.

To learn more, see our tips on writing great answers.




draft saved


draft discarded














StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fserverfault.com%2fquestions%2f734346%2fwhy-new-two-node-cluster-not-behaving-right%23new-answer', 'question_page');

);

Post as a guest















Required, but never shown





















































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown

































Required, but never shown














Required, but never shown












Required, but never shown







Required, but never shown







Popular posts from this blog

Wikipedia:Vital articles Мазмуну Biography - Өмүр баян Philosophy and psychology - Философия жана психология Religion - Дин Social sciences - Коомдук илимдер Language and literature - Тил жана адабият Science - Илим Technology - Технология Arts and recreation - Искусство жана эс алуу History and geography - Тарых жана география Навигация менюсу

Bruxelas-Capital Índice Historia | Composición | Situación lingüística | Clima | Cidades irmandadas | Notas | Véxase tamén | Menú de navegacióneO uso das linguas en Bruxelas e a situación do neerlandés"Rexión de Bruxelas Capital"o orixinalSitio da rexiónPáxina de Bruselas no sitio da Oficina de Promoción Turística de Valonia e BruxelasMapa Interactivo da Rexión de Bruxelas-CapitaleeWorldCat332144929079854441105155190212ID28008674080552-90000 0001 0666 3698n94104302ID540940339365017018237

What should I write in an apology letter, since I have decided not to join a company after accepting an offer letterShould I keep looking after accepting a job offer?What should I do when I've been verbally told I would get an offer letter, but still haven't gotten one after 4 weeks?Do I accept an offer from a company that I am not likely to join?New job hasn't confirmed starting date and I want to give current employer as much notice as possibleHow should I address my manager in my resignation letter?HR delayed background verification, now jobless as resignedNo email communication after accepting a formal written offer. How should I phrase the call?What should I do if after receiving a verbal offer letter I am informed that my written job offer is put on hold due to some internal issues?Should I inform the current employer that I am about to resign within 1-2 weeks since I have signed the offer letter and waiting for visa?What company will do, if I send their offer letter to another company