connected to the volfile server too long time when cluster have many node and mount server glusterd restart #4442

shenwen2012 · 2024-12-19T07:03:56Z

Description of problem:
when cluster have many nodes 100, if you mounst -t glusterfs localhost:volname /mnt , localhost glusterd restart , will long time can not connect glusterd. beause client will test connect to other node glusterd, but connect ,socket bind localhost ,so it can not connect other node glusterd. it try connect other 99 node then connect localhost glusterd ,so will be too long time.

The exact command to reproduce the issue:

The full output of the command that failed:

Expected results:

Mandatory info:
- The output of the gluster volume info command:

- The output of the gluster volume status command:

- The output of the gluster volume heal command:

**- Provide logs present on following locations of client and server nodes -
/var/log/glusterfs/

**- Is there any crash ? Provide the backtrace and coredump

Additional info:

- The operating system / glusterfs version:

Note: Please hide any confidential data which you don't want to share in public like IP address, file name, hostname or any other configuration

The text was updated successfully, but these errors were encountered:

shenwen2012 · 2024-12-19T07:10:59Z

solve this problem:
when glusterd is down , client change connect glusterd server ,add a if strncmp(local_str,rpc_trans->myinfo.identifier,strlen(local_str)), local_str ="127.0.0.1"
if mount server ip addr is 127.0.0.1 or localhost should not change connect glusterd server,beause it only can connect sucess localhost glusterd. modify on this commit c2d1fa5#diff-8e9a4103e4eb659a9277b69d6b824a195a919df31ea2a384807e3a2444680064R2832

when use localhost(127.0.0.1) addr mount a volume, if glusterd is down ,client not change connect other glusterd server , because socket bind 127.0.0.1 can not connect other node addr success . so change volume server round-roubin filter 127.0.0.1 Fixes: gluster#4442 Signed-off-by: tanshiping <[email protected]>

…lusterd-server change connect glusterd addr filter localhost (gluster#4442)

shenwen2012 mentioned this issue Dec 19, 2024

change connect glusterd addr filter localhost (#4442) shenwen2012/glusterfs#1

Merged

shenwen2012 closed this as completed in shenwen2012/glusterfs#1 Dec 20, 2024

shenwen2012 added a commit to shenwen2012/glusterfs that referenced this issue Dec 20, 2024

Merge pull request #1 from shenwen2012/filter-localhost-round-robin-g…

92131a2

…lusterd-server change connect glusterd addr filter localhost (gluster#4442)

shenwen2012 mentioned this issue Dec 20, 2024

change connect glusterd addr filter localhost (#4442) #4443

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

connected to the volfile server too long time when cluster have many node and mount server glusterd restart #4442

connected to the volfile server too long time when cluster have many node and mount server glusterd restart #4442

shenwen2012 commented Dec 19, 2024

shenwen2012 commented Dec 19, 2024 •

edited

Loading

connected to the volfile server too long time when cluster have many node and mount server glusterd restart #4442

connected to the volfile server too long time when cluster have many node and mount server glusterd restart #4442

Comments

shenwen2012 commented Dec 19, 2024

shenwen2012 commented Dec 19, 2024 • edited Loading

shenwen2012 commented Dec 19, 2024 •

edited

Loading