[Pgcluster-general] Recovery after one node failure
mitani at sraw.co.jp
mitani at sraw.co.jp
Tue Mar 25 18:22:30 UTC 2008
Hi Pavel,
Thank you for sending the patch.
I'm going to see it.
BTW,
At first, please start (as recovery) the Cluster DB when you re-start Node1.
It uses replication server of Node2.
After that, you can start replication server in Node1.
I recommend this startup sequence as restart node procedure.
Regards,
------------
At.Mitani
-- original message --
From: Pavel Shaydo<pshaydo at elverils.com>
To: <pgcluster-general at pgfoundry.org>
Sent: Tue, 25 Mar 2008 17:15:28 +0300
Subject: Re: [Pgcluster-general] Recovery after one node failure
>Hello all,
>
>first I'm forget to say that I'm using pgcluster-1.7.0rc12.
>
>It seems that I've found cause of the problem. Pgreplicate processes
>periodically sends pings to dbcluster processes. If dbcluster process
>becomes unavailable, then pgreplicate marks it as DB_TBL_ERROR and do
>not pings anymore. When dbcluster starting again it can synchronize
>itself using one of pgreplicate servers. This server marks it as
>DB_TBL_USE after recovery, but other pgreplicates knows nothing about
>this and not replicate queries to restored dbcluster.
>
>Patch that solves this problem for me attached.
>
>--
>Best regards,
>Pavel Shaydo
>
>
>-----Original Message-----
>From: Pavel Shaydo <pshaydo at elverils.com>
>Reply-To: pgcluster-general at pgfoundry.org
>To: Pgcluster-general <pgcluster-general at pgfoundry.org>
>Subject: [Pgcluster-general] Recovery after one node failure
>Date: Fri, 21 Mar 2008 15:52:30 +0300
>
>Hello all,
>
>we have following configuration:
>
>Node1: Node2:
>ClusterDB1 ClusterDB2
>Pgreplicate1 Pgreplicate2
>
>It works, if we inserting record into table at one node it appears
>at another, same for deletion. If we put Node1 down, insert some
>records into table at Node2 and when turn on Node1 again then Node1
>gets synchronized with Node2. After this if we inserting record into
>table at Node1 it appears at Node2, but not vice versa -- if we
>inserting record into table at Node2 it not replicated to Node1.
>After restarting pgreplicate on Node2 all works again.
>
>
>
>_______________________________________________
>Pgcluster-general mailing list
>Pgcluster-general at pgfoundry.org
>http://pgfoundry.org/mailman/listinfo/pgcluster-general
>
More information about the Pgcluster-general
mailing list