By: UKFast Support user 16 Feb 2022 at 9:46 a.m. CST

45 Responses
UKFast Support gravatar
Hi Team, I have installed cluster manager 4.3 onto a fresh install of ubuntu, i was following the guide on GLUU Docs, however when i run cluster manager for the first time it does not create the auth.ini Also the cluster manager is refusing connection, IPtable rules are in place aswell as firewall rules etc. Is it possible to get some assistance with this? Kind regards, UKFast support

By Devrim Yatar staff 16 Feb 2022 at 11:51 a.m. CST

Devrim Yatar gravatar
Hi, Let me test and see if I replicate the issue. `auth.ini` file is created after registering to cluster manager. Regards

By Devrim Yatar staff 16 Feb 2022 at 12:56 p.m. CST

Devrim Yatar gravatar
Hi, I followed https://gluu.org/docs/cm/4.3/installation/#on-ubuntu-20 on a clean (new) Ubuntu 20 server. I did not face any issue Regards.

By UKFast Support user 17 Feb 2022 at 3:07 a.m. CST

UKFast Support gravatar
Hi, I have checked the $HOME/./clustermgr4 directory, this auth.ini file is still not present, i have also tried a reinstall onto another fresh machine and this is still not creating the file

By Devrim Yatar staff 17 Feb 2022 at 4:02 a.m. CST

Devrim Yatar gravatar
Hi, `auth.ini` file is created after you signup, after you click "Sign Up" button on the following screen ![](https://gluu.org/docs/cm/4.3/img/Cluster_Manager-01.png)

By UKFast Support user 17 Feb 2022 at 4:09 a.m. CST

UKFast Support gravatar
Hi, Thanks for getting back to me. I have tried to use the local SSH however this is asking me for a password. Can you confirm if this is to be expected and what the default password would be?

By UKFast Support user 17 Feb 2022 at 4:25 a.m. CST

UKFast Support gravatar
Hi There, I have tested this using my admin user which allowed me in, however the tunnel it created is refusing connection, IPtables are set to allow all at the moment, is there anything in the config that may cause this? all config is default at the moment Kind regards,

By Devrim Yatar staff 18 Feb 2022 at 3:01 a.m. CST

Devrim Yatar gravatar
Hi, Nothing special in cluster manager configuration. This is only what you need 1) if you don't have one, create ssh key on Desktop Machine (DM) where you run browser `ssh-keygen -t rsa -b 4096 -m PEM` You will have a file under `~/.ssh/id_rsa.pub` 2) On CM create a user named `cluster` and write content of `id_rsa.pub` file created in Step 1 to `/.ssh/authorized_keys` 3) On DM execute the following command `ssh -L 5000:localhost:5000 cluster@address.of.cluster.manager` 4) ON DM browse http://localhost:5000 Regards

By UKFast Support user 18 Feb 2022 at 8:55 a.m. CST

UKFast Support gravatar
Hi There, Thank you for all your help so far. We have managed to get a route into the device however we now have the following error showing: An unhandled exception occured Traceback (most recent call last): File "/usr/local/lib/python3.8/dist-packages/sqlalchemy/engine/base.py", line 1802, in _execute_context self.dialect.do_execute( File "/usr/local/lib/python3.8/dist-packages/sqlalchemy/engine/default.py", line 732, in do_execute cursor.execute(statement, parameters) sqlite3.OperationalError: no such table: appconfig The above exception was the direct cause of the following exception: Traceback (most recent call last): File "/usr/local/lib/python3.8/dist-packages/flask/app.py", line 1514, in full_dispatch_request rv = self.preprocess_request() File "/usr/local/lib/python3.8/dist-packages/flask/app.py", line 1857, in preprocess_request rv = self.ensure_sync(before_func)() File "/usr/local/lib/python3.8/dist-packages/clustermgr/application.py", line 143, in before_request appconfig = AppConfiguration.query.first() File "/usr/local/lib/python3.8/dist-packages/sqlalchemy/orm/query.py", line 2810, in first return self.limit(1)._iter().first() File "/usr/local/lib/python3.8/dist-packages/sqlalchemy/orm/query.py", line 2894, in _iter result = self.session.execute( File "/usr/local/lib/python3.8/dist-packages/sqlalchemy/orm/session.py", line 1692, in execute result = conn._execute_20(statement, params or {}, execution_options) File "/usr/local/lib/python3.8/dist-packages/sqlalchemy/engine/base.py", line 1614, in _execute_20 return meth(self, args_10style, kwargs_10style, execution_options) File "/usr/local/lib/python3.8/dist-packages/sqlalchemy/sql/elements.py", line 325, in _execute_on_connection return connection._execute_clauseelement( File "/usr/local/lib/python3.8/dist-packages/sqlalchemy/engine/base.py", line 1481, in _execute_clauseelement ret = self._execute_context( File "/usr/local/lib/python3.8/dist-packages/sqlalchemy/engine/base.py", line 1845, in _execute_context self._handle_dbapi_exception( File "/usr/local/lib/python3.8/dist-packages/sqlalchemy/engine/base.py", line 2026, in _handle_dbapi_exception util.raise_( File "/usr/local/lib/python3.8/dist-packages/sqlalchemy/util/compat.py", line 207, in raise_ raise exception File "/usr/local/lib/python3.8/dist-packages/sqlalchemy/engine/base.py", line 1802, in _execute_context self.dialect.do_execute( File "/usr/local/lib/python3.8/dist-packages/sqlalchemy/engine/default.py", line 732, in do_execute cursor.execute(statement, parameters) sqlalchemy.exc.OperationalError: (sqlite3.OperationalError) no such table: appconfig [SQL: SELECT appconfig.id AS appconfig_id, appconfig.replication_dn AS appconfig_replication_dn, appconfig.replication_pw AS appconfig_replication_pw, appconfig.last_test AS appconfig_last_test, appconfig.gluu_version AS appconfig_gluu_version, appconfig.use_ip AS appconfig_use_ip, appconfig.nginx_host AS appconfig_nginx_host, appconfig.log_purge AS appconfig_log_purge, appconfig.admin_email AS appconfig_admin_email, appconfig.monitoring AS appconfig_monitoring, appconfig.modify_hosts AS appconfig_modify_hosts, appconfig.nginx_ip AS appconfig_nginx_ip, appconfig.ldap_update_period AS appconfig_ldap_update_period, appconfig.object_class_base AS appconfig_object_class_base, appconfig.attribute_oid AS appconfig_attribute_oid, appconfig.external_load_balancer AS appconfig_external_load_balancer, appconfig.cache_host AS appconfig_cache_host, appconfig.cache_ip AS appconfig_cache_ip, appconfig.nginx_os AS appconfig_nginx_os, appconfig.use_ldap_cache AS appconfig_use_ldap_cache, appconfig.ldap_update_period_unit AS appconfig_ldap_update_period_unit, appconfig.nginx_os_type AS appconfig_nginx_os_type, appconfig.latest_version AS appconfig_latest_version, appconfig.offline AS appconfig_offline, appconfig.gluu_archive AS appconfig_gluu_archive, appconfig.ldap_cache_clean_period AS appconfig_ldap_cache_clean_period, appconfig.nginx_ssh_port AS appconfig_nginx_ssh_port FROM appconfig LIMIT ? OFFSET ?] [parameters: (1, 0)] (Background on this error at: https://sqlalche.me/e/14/e3q8) Are you able to assist on this? Kind regards, Clifton Meldrum

By Devrim Yatar staff 21 Feb 2022 at 3:18 a.m. CST

Devrim Yatar gravatar
Hi, Did you start cluster manager with the following command? `clustermgr4-cli start` If so, can you send output of this command? If you already started, please stop it `clustermgr4-cli stop` and start to get output. Regards.

By UKFast Support user 25 Feb 2022 at 5:52 a.m. CST

UKFast Support gravatar
Hi Mustafa, Please see output below: Creating clustermgr data directory Upgrading Database Traceback (most recent call last): File "/usr/local/bin/clusterapp.py", line 102, in <module> cli() File "/usr/local/lib/python3.8/dist-packages/click/core.py", line 1128, in __call__ return self.main(*args, **kwargs) File "/usr/local/lib/python3.8/dist-packages/flask/cli.py", line 601, in main return super().main(*args, **kwargs) File "/usr/local/lib/python3.8/dist-packages/click/core.py", line 1053, in main rv = self.invoke(ctx) File "/usr/local/lib/python3.8/dist-packages/click/core.py", line 1653, in invoke cmd_name, cmd, args = self.resolve_command(ctx, args) File "/usr/local/lib/python3.8/dist-packages/click/core.py", line 1700, in resolve_command cmd = self.get_command(ctx, cmd_name) File "/usr/local/lib/python3.8/dist-packages/flask/cli.py", line 544, in get_command self._load_plugin_commands() File "/usr/local/lib/python3.8/dist-packages/flask/cli.py", line 540, in _load_plugin_commands self.add_command(ep.load(), ep.name) File "/usr/local/lib/python3.8/dist-packages/pkg_resources/__init__.py", line 2457, in load self.require(*args, **kwargs) File "/usr/local/lib/python3.8/dist-packages/pkg_resources/__init__.py", line 2480, in require items = working_set.resolve(reqs, env, installer, extras=self.extras) File "/usr/local/lib/python3.8/dist-packages/pkg_resources/__init__.py", line 788, in resolve raise VersionConflict(dist, req).with_context(dependent_req) pkg_resources.ContextualVersionConflict: (MarkupSafe 1.1.0 (/usr/lib/python3/dist-packages), Requirement.parse('MarkupSafe>=2.0'), {'Jinja2'}) Starting Celery Worker celery multi v5.2.3 (dawn-chorus) > Starting nodes... > worker@RD-UAT-GLUU-CM: OK Starting Celery Beat Starting Gunicorn Web Server

By Devrim Yatar staff 28 Feb 2022 at 7:33 a.m. CST

Devrim Yatar gravatar
Hi, Error string is not included in previous message. Did you crop output of `clustermgr4-cli start` ? Regards.

By UKFast Support user 28 Feb 2022 at 10:57 a.m. CST

UKFast Support gravatar
Hi Mustafa, The output above is what was present when i started the cluster manager. This wasnt cropped out, can you confirm an example of what is supposed to be present? Kind regards, UkfastSupport

By Devrim Yatar staff 01 Mar 2022 at 3:36 a.m. CST

Devrim Yatar gravatar
Hi, I just installed cluster manager on a fresh Ubuntu 20, when I start I get the followings: ``` root@ubuntu20cm:~# clustermgr4-cli start Creating clustermgr data directory Upgrading Database INFO [alembic.runtime.migration] Context impl SQLiteImpl. INFO [alembic.runtime.migration] Will assume non-transactional DDL. INFO [alembic.runtime.migration] Running upgrade -> 5246a3f7a7e4, consolidated all server models INFO [alembic.runtime.migration] Running upgrade 5246a3f7a7e4 -> fb4b3b50c872, adds nginx_host to appconfig INFO [alembic.runtime.migration] Running upgrade fb4b3b50c872 -> d14788040223, added redis and stunnel flags to server INFO [alembic.runtime.migration] Running upgrade d14788040223 -> 987b4b9f18bb, add log_purge column to appconfig INFO [alembic.runtime.migration] Running upgrade 987b4b9f18bb -> 2a0c96ff52f6, add admin_email field INFO [alembic.runtime.migration] Running upgrade 2a0c96ff52f6 -> 2926b8ffa2f0, empty message INFO [alembic.runtime.migration] Running upgrade 2926b8ffa2f0 -> 8385d7dcdcdc, add filebeat column INFO [alembic.runtime.migration] Running upgrade 8385d7dcdcdc -> 601d70820f5d, add modify_hosts to AppConfiguration INFO [alembic.runtime.migration] Running upgrade 601d70820f5d -> 4fca7c65b3df, add nginx_ip column to appconfig INFO [alembic.runtime.migration] Running upgrade 4fca7c65b3df -> b21895d83725, add ldap_update_period to AppConfiguration INFO [alembic.runtime.migration] Running upgrade b21895d83725 -> 0e7453df1644, update keyrotation INFO [alembic.runtime.migration] Running upgrade 0e7453df1644 -> 5cc68648b63e, add monitoring for servers INFO [alembic.runtime.migration] Running upgrade 5cc68648b63e -> dd4a741b7bd0, empty message INFO [alembic.runtime.migration] Running upgrade dd4a741b7bd0 -> ac40bca26010, empty message INFO [alembic.runtime.migration] Running upgrade ac40bca26010 -> 55c791eac900, empty message INFO [alembic.runtime.migration] Running upgrade 55c791eac900 -> 0c35f58a2e73, empty message INFO [alembic.runtime.migration] Running upgrade 0c35f58a2e73 -> 22f6c85b4b24, empty message INFO [alembic.runtime.migration] Running upgrade 22f6c85b4b24 -> ab705f5bfabf, add cache_host and cache_ip INFO [alembic.runtime.migration] Running upgrade ab705f5bfabf -> 3ca18f178674, add nginx_os INFO [alembic.runtime.migration] Running upgrade 3ca18f178674 -> 804b57c2b8ae, empty message INFO [alembic.runtime.migration] Running upgrade 804b57c2b8ae -> 791f94bdd1f3, empty message INFO [alembic.runtime.migration] Running upgrade 791f94bdd1f3 -> 59f615b01c3e, add nginx_os_type to AppConfiguration INFO [alembic.runtime.migration] Running upgrade 59f615b01c3e -> 198c383101dd, update AppConfiguration INFO [alembic.runtime.migration] Running upgrade 198c383101dd -> 555f03405e07, add CacheServer INFO [alembic.runtime.migration] Running upgrade 555f03405e07 -> f24e4a10b19b, add ldap_cache_clean_period to AppConfiguration INFO [alembic.runtime.migration] Running upgrade f24e4a10b19b -> 325c760b3ec7, add ssh port INFO [alembic.runtime.migration] Running upgrade 325c760b3ec7 -> 86b22e4bf65d, empty message Starting Celery Worker celery multi v5.2.3 (dawn-chorus) > Starting nodes... > worker@ubuntu20cm: OK Starting Celery Beat Starting Gunicorn Web Server ```

By Devrim Yatar staff 01 Mar 2022 at 3:37 a.m. CST

Devrim Yatar gravatar
Here is my commands to install Cluster Manager ``` root@ubuntu20cm:~# cat .bash_history apt install python3-pip python3-dev libssl-dev libffi-dev apt install build-essential pip3 install https://github.com/GluuFederation/redislite/archive/master.zip pip3 install https://github.com/GluuFederation/cluster-mgr/archive/4.3.zip clustermgr4-cli start ```

By Devrim Yatar staff 01 Mar 2022 at 3:44 a.m. CST

Devrim Yatar gravatar
Hi, I think I figured out your issue: ``` pkg_resources.ContextualVersionConflict: (MarkupSafe 1.1.0 (/usr/lib/python3/dist-packages), Requirement.parse('MarkupSafe>=2.0'), {'Jinja2'}) ``` Somehow MarkupSafe version 1.1.0 was installed on your server. Could you upgrade it to `2.1.0` with following command: ``` pip3 install --upgrade MarkupSafe==2.1.0 ``` Then ``` clustermgr4-cli stop clustermgr4-cli start ``` Regards

By UKFast Support user 01 Mar 2022 at 7:09 a.m. CST

UKFast Support gravatar
Hi Mustafa, Thanks for getting back to us. This has indeed solved the issue, thank you so much for your help. Kind regards, UKFast Support

By UKFast Support user 01 Mar 2022 at 9:18 a.m. CST

UKFast Support gravatar
HI Team, We are now getting the following errors: Traceback (most recent call last): File "/usr/local/lib/python3.8/dist-packages/flask/app.py", line 1516, in full_dispatch_request rv = self.dispatch_request() File "/usr/local/lib/python3.8/dist-packages/flask/app.py", line 1502, in dispatch_request return self.ensure_sync(self.view_functions[rule.endpoint])(**req.view_args) File "/usr/local/lib/python3.8/dist-packages/flask_login/utils.py", line 272, in decorated_view return func(*args, **kwargs) File "/usr/local/lib/python3.8/dist-packages/clustermgr/views/index.py", line 489, in multi_master_replication rep_status = get_opendj_replication_status() File "/usr/local/lib/python3.8/dist-packages/clustermgr/core/utils.py", line 266, in get_opendj_replication_status installer = Installer( File "/usr/local/lib/python3.8/dist-packages/clustermgr/core/clustermgr_installer.py", line 68, in __init__ self.settings() File "/usr/local/lib/python3.8/dist-packages/clustermgr/core/clustermgr_installer.py", line 79, in settings if ('Ubuntu' in self.server_os) or ('Debian' in self.server_os): TypeError: argument of type 'NoneType' is not iterable

By Devrim Yatar staff 01 Mar 2022 at 10:48 a.m. CST

Devrim Yatar gravatar
Hi, Try these: 1. stop cluster manager 2. remove folder `/root/.clustermgr4` 3. start cluster manager

By UKFast Support user 02 Mar 2022 at 3:31 a.m. CST

UKFast Support gravatar
HI Mustafa, I have done the above but this seems to have made things worse, please see below: admin@local Gluu Cluster Manager4.3-01 Dashboard Cluster Replication Monitoring Logging Management Key Rotation Settings Operations Installing Gluu Server on RD-UAT-GLUU-CL1 Step 1 Perpare Server Step 2 Install Gluu Container Step 3 Run setup.py Step 4 Post Installation Perpare Server Install Gluu Container Run setup.py Post Installation Connecting LDAP Server: RD-UAT-GLUU-CL1 Downloading certificates from primary server and uploading to this server Running tar -zcf /tmp/certs_B84A.tgz /opt/gluu-server/etc/certs/ /opt/gluu-server/install/community-edition-setup/output/scim-rp.jks /opt/gluu-server/etc/gluu/conf/passport-config.json tar: Removing leading `/' from member names tar: Removing leading `/' from hard link targets tar: /opt/gluu-server/install/community-edition-setup/output/scim-rp.jks: Cannot stat: No such file or directory tar: Exiting with failure status due to previous errors Downloading remote file /tmp/certs_B84A.tgz to local /tmp/certs_176A.tgz File /tmp/certs_B84A.tgz was downloaded as /tmp/certs_176A.tgz. Uploading local file /tmp/certs_176A.tgz to remote server as /tmp/certs.tgz File /tmp/certs_176A.tgz was uploaded as /tmp/certs.tgz. Running tar -zxf /tmp/certs.tgz -C / Manuplating keys Running ssh -q -o IdentityFile=/etc/gluu/keys/gluu-console -o UserKnownHostsFile=/dev/null -o Port=60022 -o StrictHostKeyChecking=no -o PubkeyAuthentication=yes root@localhost "/opt/jre/bin/keytool -delete -alias UAT-RDGLUU-NGINX_httpd -keystore /opt/jre/jre/lib/security/cacerts -storepass changeit" Running ssh -q -o IdentityFile=/etc/gluu/keys/gluu-console -o UserKnownHostsFile=/dev/null -o Port=60022 -o StrictHostKeyChecking=no -o PubkeyAuthentication=yes root@localhost "/opt/jre/bin/keytool -delete -alias UAT-RDGLUU-NGINX_httpd -keystore /opt/jre/jre/lib/security/cacerts -storepass changeit" keytool error: java.lang.Exception: Alias <UAT-RDGLUU-NGINX_httpd> does not exist Running ssh -q -o IdentityFile=/etc/gluu/keys/gluu-console -o UserKnownHostsFile=/dev/null -o Port=60022 -o StrictHostKeyChecking=no -o PubkeyAuthentication=yes root@localhost "/opt/jre/bin/keytool -import -trustcacerts -alias UAT-RDGLUU-NGINX_httpd -file /etc/certs/httpd.crt -keystore /opt/jre/jre/lib/security/cacerts -storepass changeit -noprompt" Warning: use -cacerts option to access cacerts keystore Certificate was added to keystore Running ssh -q -o IdentityFile=/etc/gluu/keys/gluu-console -o UserKnownHostsFile=/dev/null -o Port=60022 -o StrictHostKeyChecking=no -o PubkeyAuthentication=yes root@localhost "/opt/jre/bin/keytool -delete -alias UAT-RDGLUU-NGINX_opendj -keystore /opt/jre/jre/lib/security/cacerts -storepass changeit" keytool error: java.lang.Exception: Alias <UAT-RDGLUU-NGINX_opendj> does not exist Running ssh -q -o IdentityFile=/etc/gluu/keys/gluu-console -o UserKnownHostsFile=/dev/null -o Port=60022 -o StrictHostKeyChecking=no -o PubkeyAuthentication=yes root@localhost "/opt/jre/bin/keytool -import -trustcacerts -alias UAT-RDGLUU-NGINX_opendj -file /etc/certs/opendj.crt -keystore /opt/jre/jre/lib/security/cacerts -storepass changeit -noprompt" Warning: use -cacerts option to access cacerts keystore Certificate was added to keystore Downloading custom schema filesfrom primary server and upload to this server Creating base dn for o=metric backend × An Unhandled Exception Raised Traceback (most recent call last): File "/usr/local/lib/python3.8/dist-packages/clustermgr/tasks/server.py", line 213, in task_install_gluu_server install_gluu_server(task_id, server_id) File "/usr/local/lib/python3.8/dist-packages/clustermgr/tasks/server.py", line 823, in install_gluu_server if not ldapc.connect(): File "/usr/local/lib/python3.8/dist-packages/clustermgr/core/ldap_functions.py", line 59, in connect return self.conn.bind() File "/usr/local/lib/python3.8/dist-packages/ldap3/core/connection.py", line 589, in bind self.open(read_server_info=False) File "/usr/local/lib/python3.8/dist-packages/ldap3/strategy/sync.py", line 57, in open BaseStrategy.open(self, reset_usage, read_server_info) File "/usr/local/lib/python3.8/dist-packages/ldap3/strategy/base.py", line 146, in open raise exception_history[0][0] ldap3.core.exceptions.LDAPSocketOpenError: socket connection error while opening: [Errno 111] Connection refused During handling of the above exception, another exception occurred: Traceback (most recent call last): File "/usr/local/lib/python3.8/dist-packages/celery/app/trace.py", line 451, in trace_task R = retval = fun(*args, **kwargs) File "/usr/local/lib/python3.8/dist-packages/clustermgr/application.py", line 25, in __call__ return TaskBase.__call__(self, *args, **kwargs) File "/usr/local/lib/python3.8/dist-packages/celery/app/trace.py", line 734, in __protected_call__ return self.run(*args, **kwargs) File "/usr/local/lib/python3.8/dist-packages/clustermgr/tasks/server.py", line 215, in task_install_gluu_server raise Exception(traceback.format_exc()) Exception: Traceback (most recent call last): File "/usr/local/lib/python3.8/dist-packages/clustermgr/tasks/server.py", line 213, in task_install_gluu_server install_gluu_server(task_id, server_id) File "/usr/local/lib/python3.8/dist-packages/clustermgr/tasks/server.py", line 823, in install_gluu_server if not ldapc.connect(): File "/usr/local/lib/python3.8/dist-packages/clustermgr/core/ldap_functions.py", line 59, in connect return self.conn.bind() File "/usr/local/lib/python3.8/dist-packages/ldap3/core/connection.py", line 589, in bind self.open(read_server_info=False) File "/usr/local/lib/python3.8/dist-packages/ldap3/strategy/sync.py", line 57, in open BaseStrategy.open(self, reset_usage, read_server_info) File "/usr/local/lib/python3.8/dist-packages/ldap3/strategy/base.py", line 146, in open raise exception_history[0][0] ldap3.core.exceptions.LDAPSocketOpenError: socket connection error while opening: [Errno 111] Connection refused

By Devrim Yatar staff 02 Mar 2022 at 3:44 a.m. CST

Devrim Yatar gravatar
Hi, It seems installation of primary server was not completed successfully. Ssh into primary server (manually) and uninstall it then re-install via cluster manager.

By UKFast Support user 02 Mar 2022 at 10:37 a.m. CST

UKFast Support gravatar
Hi Mustafa, Thanks for confirming. There seems to be an issue with the GLUU install, whenever the container restarts it seems to shut off all processes and these do not come back up. Is there a fix for this?

By UKFast Support user 02 Mar 2022 at 10:45 a.m. CST

UKFast Support gravatar
Hi Mustafa, We are still getting the below error: Traceback (most recent call last): File "/usr/local/lib/python3.8/dist-packages/clustermgr/tasks/server.py", line 213, in task_install_gluu_server install_gluu_server(task_id, server_id) File "/usr/local/lib/python3.8/dist-packages/clustermgr/tasks/server.py", line 823, in install_gluu_server if not ldapc.connect(): File "/usr/local/lib/python3.8/dist-packages/clustermgr/core/ldap_functions.py", line 59, in connect return self.conn.bind() File "/usr/local/lib/python3.8/dist-packages/ldap3/core/connection.py", line 589, in bind self.open(read_server_info=False) File "/usr/local/lib/python3.8/dist-packages/ldap3/strategy/sync.py", line 57, in open BaseStrategy.open(self, reset_usage, read_server_info) File "/usr/local/lib/python3.8/dist-packages/ldap3/strategy/base.py", line 146, in open raise exception_history[0][0] ldap3.core.exceptions.LDAPSocketOpenError: socket connection error while opening: [Errno 110] Connection timed out During handling of the above exception, another exception occurred: Traceback (most recent call last): File "/usr/local/lib/python3.8/dist-packages/celery/app/trace.py", line 451, in trace_task R = retval = fun(*args, **kwargs) File "/usr/local/lib/python3.8/dist-packages/clustermgr/application.py", line 25, in __call__ return TaskBase.__call__(self, *args, **kwargs) File "/usr/local/lib/python3.8/dist-packages/celery/app/trace.py", line 734, in __protected_call__ return self.run(*args, **kwargs) File "/usr/local/lib/python3.8/dist-packages/clustermgr/tasks/server.py", line 215, in task_install_gluu_server raise Exception(traceback.format_exc()) Exception: Traceback (most recent call last): File "/usr/local/lib/python3.8/dist-packages/clustermgr/tasks/server.py", line 213, in task_install_gluu_server install_gluu_server(task_id, server_id) File "/usr/local/lib/python3.8/dist-packages/clustermgr/tasks/server.py", line 823, in install_gluu_server if not ldapc.connect(): File "/usr/local/lib/python3.8/dist-packages/clustermgr/core/ldap_functions.py", line 59, in connect return self.conn.bind() File "/usr/local/lib/python3.8/dist-packages/ldap3/core/connection.py", line 589, in bind self.open(read_server_info=False) File "/usr/local/lib/python3.8/dist-packages/ldap3/strategy/sync.py", line 57, in open BaseStrategy.open(self, reset_usage, read_server_info) File "/usr/local/lib/python3.8/dist-packages/ldap3/strategy/base.py", line 146, in open raise exception_history[0][0] ldap3.core.exceptions.LDAPSocketOpenError: socket connection error while opening: [Errno 110] Connection timed out

By UKFast Support user 02 Mar 2022 at 11:17 a.m. CST

UKFast Support gravatar
Hi There, It also looks like cluster manager is changing each gluu servers hostname to the hostname of the loadbalancer, can you confirm why this is happening?

By UKFast Support user 02 Mar 2022 at 11:29 a.m. CST

UKFast Support gravatar
Hi There, This error is also showing on the log screen: Traceback (most recent call last): File "/usr/local/lib/python3.8/dist-packages/flask/app.py", line 1516, in full_dispatch_request rv = self.dispatch_request() File "/usr/local/lib/python3.8/dist-packages/flask/app.py", line 1502, in dispatch_request return self.ensure_sync(self.view_functions[rule.endpoint])(**req.view_args) File "/usr/local/lib/python3.8/dist-packages/flask_login/utils.py", line 272, in decorated_view return func(*args, **kwargs) File "/usr/local/lib/python3.8/dist-packages/clustermgr/views/logserver.py", line 115, in index return render_template("log_index.html", form=form, logs=logs, File "/usr/local/lib/python3.8/dist-packages/flask/templating.py", line 147, in render_template return _render( File "/usr/local/lib/python3.8/dist-packages/flask/templating.py", line 128, in _render rv = template.render(context) File "/usr/local/lib/python3.8/dist-packages/jinja2/environment.py", line 1291, in render self.environment.handle_exception() File "/usr/local/lib/python3.8/dist-packages/jinja2/environment.py", line 925, in handle_exception raise rewrite_traceback_stack(source=source) File "/usr/local/lib/python3.8/dist-packages/clustermgr/templates/log_index.html", line 42, in top-level template code &lt;button type="button" class="btn btn-primary copy-btn" data-clipboard-action="copy" data-clipboard-target="#read_more_txt_{{ index }}"&gt; File "/usr/local/lib/python3.8/dist-packages/clustermgr/templates/base.html", line 224, in top-level template code {% block content %}{% endblock %} File "/usr/local/lib/python3.8/dist-packages/clustermgr/templates/log_index.html", line 142, in block 'content' <li><a href="{{ url_for_prev_page(page) }}">Newer</a></li> File "/usr/local/lib/python3.8/dist-packages/clustermgr/application.py", line 121, in url_for_prev_page args = {k: v for k, v in request.values} File "/usr/local/lib/python3.8/dist-packages/clustermgr/application.py", line 121, in &lt;dictcomp&gt; args = {k: v for k, v in request.values} ValueError: too many values to unpack (expected 2)

By Devrim Yatar staff 03 Mar 2022 at 2:22 a.m. CST

Devrim Yatar gravatar
Hi, &gt; There seems to be an issue with the GLUU install, whenever the container restarts it seems to shut off all processes and these do not come back up. Yes it restarts container to load new configuration. Cluster Manager make some configuration changes on server, and there is no backup. Since it installs new server, no need to take backup. You got this error ``` if not ldapc.connect(): File "/usr/local/lib/python3.8/dist-packages/clustermgr/core/ldap_functions.py", line 59, in connect return self.conn.bind() File "/usr/local/lib/python3.8/dist-packages/ldap3/core/connection.py", line 589, in bind self.open(read_server_info=False) File "/usr/local/lib/python3.8/dist-packages/ldap3/strategy/sync.py", line 57, in open BaseStrategy.open(self, reset_usage, read_server_info) File "/usr/local/lib/python3.8/dist-packages/ldap3/strategy/base.py", line 146, in open raise exception_history[0][0] ldap3.core.exceptions.LDAPSocketOpenError: socket connection error while opening: [Errno 110] Connection timed out ``` Cluster manager can't make ldap connection to server at port 1636. This may result from 1. Firewall does not allow connection at port 1636 2. Server is not installed Can you check if opendj is running properly and it listens port 1636 on all interfaces? (netstat -plunt | grep 1636), if so fix firewall.

By UKFast Support user 03 Mar 2022 at 3:05 a.m. CST

UKFast Support gravatar
HI There, So i have checked this morning LDAP connection is open and it is lisening on port 1636, LDAP is also showing green in cluster manager. However the issue is the following: 1) it changes the host name of the server to the hostname of the load balancer and i cannot seem to see why it is doing this 2) When the gluu server reboots Casa, OXD, OXAUTH and passport all go down an will not come back up even when trying to start the service manually

By Devrim Yatar staff 07 Mar 2022 at 2:50 a.m. CST

Devrim Yatar gravatar
Hi, Hostname of all nodes in cluster manager should be hostname of load balancer. This is how Gluu cluster works. Did you install second node and establish replication?

By UKFast Support user 07 Mar 2022 at 3:30 a.m. CST

UKFast Support gravatar
Hi Mustafa, This seems to cause issues getting to the GLUU host, i have indeed managed to get cluster manager installed and get replication working, however the second host in the cluster seems to keep getting a 404 error. Kind regards, UKFast Support

By Mohib Zico staff 07 Mar 2022 at 7:03 a.m. CST

Mohib Zico gravatar
Hello UKFast Support, &gt;&gt; however the second host in the cluster seems to keep getting a 404 error You are getting error from oxauth or identity? ( hint: if we look at browser url, what do you see? )

By Devrim Yatar staff 07 Mar 2022 at 9:18 a.m. CST

Devrim Yatar gravatar
Hi, I think it is better to have a demo. Please schedule a call at https://gluu.org/book-support so that we can demonstrate setting up a cluster with Cluster Manager. Regards.

By UKFast Support user 08 Mar 2022 at 5:36 a.m. CST

UKFast Support gravatar
Hi Mustafa, I have booked this in for thursday at 15:00 GMTon thursday. Kind regards, UKFast Support

By Devrim Yatar staff 08 Mar 2022 at 11:32 a.m. CST

Devrim Yatar gravatar
Hi, Zico and me will be in meeting. Regards

By Mohib Zico staff 13 Mar 2022 at 11:10 p.m. CDT

Mohib Zico gravatar
Hello Clifton, Do you have any gmail address? I would like to share the meeting recording in youtube privately.

By UKFast Support user 14 Mar 2022 at 5:52 a.m. CDT

UKFast Support gravatar
Hi Mohib, I apologise i do not however you can send this to my work email clifton.meldrum@ans.co.uk It looks like the services on gluu for example identity seem to stop and not be able to restarted even manually.

By UKFast Support user 18 Mar 2022 at 2:29 a.m. CDT

UKFast Support gravatar
Hi Team, Is it possible to get an update on the above. Casa is installed however it refueses to work on the browser, i have restarted the service for this to no results. Any assistance would be amazing. Kind regards, Clifton Meldrum

By Mohib Zico staff 18 Mar 2022 at 2:58 a.m. CDT

Mohib Zico gravatar
Hello Clifton, &gt;&gt; Casa is installed however it refueses to work on the browser, i have restarted the service for this to no results. You installed it with Cluster Manager as well?

By Devrim Yatar staff 18 Mar 2022 at 3:38 a.m. CDT

Devrim Yatar gravatar
Hi Clifton, I made some fixes for oxd and casa. Did you install latest version of cluster manager? Regards.

By UKFast Support user 18 Mar 2022 at 4:23 a.m. CDT

UKFast Support gravatar
Hi Mustafa, Thanks for getting in touch. I did install oxd and casa using Cluster manager. I installed cluster manager last week, can you confirm when the fixes were released? Kind regards, Clifton Meldrum

By Devrim Yatar staff 18 Mar 2022 at 12:46 p.m. CDT

Devrim Yatar gravatar
Hi, The fix was done 4 days ago, this is the commit https://github.com/GluuFederation/cluster-mgr/commit/c8f68bcbbbe3475aa4e8fb56cffe795372a2dcb8 You should have **4.3-post2** Regards.

By UKFast Support user 21 Mar 2022 at 7:35 a.m. CDT

UKFast Support gravatar
Hi Mustafa, Thanks for confirming, this seems to have resolved the casa issue however LDAP seems to keep failing, it works initially when the container is booted up but this then fails. Do you have any information that may help with this? Kind regards, Clifton Meldrum

By Devrim Yatar staff 21 Mar 2022 at 9:25 a.m. CDT

Devrim Yatar gravatar
Hi, Which node? Primary or secondary node? Please send log files located under `/opt/opendj/logs` Regards.

By UKFast Support user 22 Mar 2022 at 4:48 a.m. CDT

UKFast Support gravatar
Hi Mustafa, Please find logs below: ``` [21/Mar/2022:10:37:46 +0000] category=PROTOCOL severity=NOTICE msgID=276 msg=Started listening for new connections on Administration Connector 0.0.0.0 port 4444 [21/Mar/2022:10:37:46 +0000] category=PROTOCOL severity=NOTICE msgID=276 msg=Started listening for new connections on LDAPS Connection Handler 0.0.0.0 port 1636 [21/Mar/2022:11:19:39 +0000] category=JVM severity=NOTICE msgID=21 msg=Installation Directory: /opt/opendj [21/Mar/2022:11:19:39 +0000] category=JVM severity=NOTICE msgID=23 msg=Instance Directory: /opt/opendj [21/Mar/2022:11:19:39 +0000] category=JVM severity=NOTICE msgID=17 msg=JVM Information: 11.0.13+8-LTS by Amazon.com Inc., 64-bit architecture, 2174746624 bytes heap size [21/Mar/2022:11:19:39 +0000] category=JVM severity=NOTICE msgID=18 msg=JVM Host: uat-rdl2.resilience.gov.uk, running Linux 5.4.0-100-generic amd64, 4090540032 bytes physical memory size, number of processors available 2 [21/Mar/2022:11:19:39 +0000] category=JVM severity=NOTICE msgID=19 msg=JVM Arguments: "-Xms512m", "-Xmx2074m", "-XX:+UseCompressedOops", "-Dorg.opends.server.scriptName=start-ds" [21/Mar/2022:11:19:40 +0000] category=BACKEND severity=NOTICE msgID=513 msg=The database backend site containing 2 entries has started [21/Mar/2022:11:19:40 +0000] category=BACKEND severity=NOTICE msgID=513 msg=The database backend metric containing 6 entries has started [21/Mar/2022:11:19:41 +0000] category=BACKEND severity=NOTICE msgID=513 msg=The database backend userRoot containing 187 entries has started [21/Mar/2022:11:19:41 +0000] category=EXTENSIONS severity=NOTICE msgID=221 msg=DIGEST-MD5 SASL mechanism using a server fully qualified domain name of: localhost [21/Mar/2022:11:19:41 +0000] category=CORE severity=NOTICE msgID=135 msg=The Directory Server has started successfully [21/Mar/2022:11:19:41 +0000] category=CORE severity=NOTICE msgID=139 msg=The Directory Server has sent an alert notification generated by class org.opends.server.core.DirectoryServer (alert type org.opends.server.DirectoryServerStarted, alert ID org.opends.messages.core-135): The Directory Server has started successfully [21/Mar/2022:11:19:41 +0000] category=PROTOCOL severity=NOTICE msgID=276 msg=Started listening for new connections on Administration Connector 0.0.0.0 port 4444 [21/Mar/2022:11:19:41 +0000] category=PROTOCOL severity=NOTICE msgID=276 msg=Started listening for new connections on LDAPS Connection Handler 0.0.0.0 port 1636 ```

By Mohib Zico staff 22 Mar 2022 at 10:37 a.m. CDT

Mohib Zico gravatar
I am re-assigning to @Mobarak Hosen.Shakil for creating a testbed and do a quick QA on latest code Mustafa provided.

By Devrim Yatar staff 23 Mar 2022 at 4:56 a.m. CDT

Devrim Yatar gravatar
Hi, I looked opendj logs, it seems good. It is running properly. Regards.

By UKFast Support user 25 Mar 2022 at 6:15 a.m. CDT

UKFast Support gravatar
Hi There, Thanks for confirming I have solved the issue by forcing a restart on opendj, it seems that with a system reboot it starts opendj but then errors, however forcing a restart of opendj brought the service back into working correctly. Thank you so much for your help on this, i will now close the ticket. Kind regards, Clifton Meldrum