ClickHouse
diff --git a/‎.claude/instructions.md‎
Lines changed: 50 additions & 0 deletions b/‎.claude/instructions.md‎
Lines changed: 50 additions & 0 deletions
diff --git a/‎.github/copilot-instructions.md‎
Lines changed: 46 additions & 0 deletions b/‎.github/copilot-instructions.md‎
Lines changed: 46 additions & 0 deletions
diff --git a/‎base/poco/Net/src/TCPServerDispatcher.cpp‎
Lines changed: 2 additions & 2 deletions b/‎base/poco/Net/src/TCPServerDispatcher.cpp‎
Lines changed: 2 additions & 2 deletions
diff --git a/‎ci/jobs/functional_tests.py‎
Lines changed: 30 additions & 20 deletions b/‎ci/jobs/functional_tests.py‎
Lines changed: 30 additions & 20 deletions
diff --git a/‎ci/jobs/integration_test_job.py‎
Lines changed: 1 addition & 1 deletion b/‎ci/jobs/integration_test_job.py‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎ci/jobs/scripts/workflow_hooks/filter_job.py‎
Lines changed: 1 addition & 0 deletions b/‎ci/jobs/scripts/workflow_hooks/filter_job.py‎
Lines changed: 1 addition & 0 deletions
diff --git a/‎ci/jobs/scripts/workflow_hooks/set_parent_pr_number.py‎
Lines changed: 18 additions & 0 deletions b/‎ci/jobs/scripts/workflow_hooks/set_parent_pr_number.py‎
Lines changed: 18 additions & 0 deletions
diff --git a/‎ci/praktika/_environment.py‎
Lines changed: 20 additions & 16 deletions b/‎ci/praktika/_environment.py‎
Lines changed: 20 additions & 16 deletions
diff --git a/‎ci/praktika/info.py‎
Lines changed: 2 additions & 0 deletions b/‎ci/praktika/info.py‎
Lines changed: 2 additions & 0 deletions
diff --git a/‎ci/praktika/native_jobs.py‎
Lines changed: 19 additions & 15 deletions b/‎ci/praktika/native_jobs.py‎
Lines changed: 19 additions & 15 deletions
@@ -0,0 +1,50 @@
+# ClickHouse Development Instructions
+
+## Running Stateless Tests
+
+Stateless tests are located in `tests/queries/0_stateless/`.
+
+### Prerequisites
+1. Build ClickHouse: `cd build && ninja clickhouse-server`
+2. Start the server: `./build/programs/clickhouse server --config-file ./programs/server/config.xml`
+3. Wait for server to be ready: `./build/programs/clickhouse client -q "SELECT 1"`
+
+### Running Tests
+Run tests with the correct port environment variables (default config uses TCP=9000, HTTP=8123):
+
+```bash
+CLICKHOUSE_PORT_TCP=9000 CLICKHOUSE_PORT_HTTP=8123 ./tests/clickhouse-test <test_name>
+```
+
+### Useful Flags
+- `--no-random-settings` - Disable settings randomization (useful for deterministic debugging)
+- `--no-random-merge-tree-settings` - Disable MergeTree settings randomization
+- `--record` - Automatically update `.reference` files when stdout differs
+
+### Test File Extensions
+- `.sql` - SQL test (most common)
+- `.sql.j2` - Jinja2-templated SQL test
+- `.sh` - Shell script test
+- `.py` - Python test
+- `.expect` - Expect script test
+- `.reference` - Expected output (compared against stdout)
+- `.gen.reference` - Generated reference for `.j2` tests
+
+### Database Name Normalization
+The test runner creates a temporary database with a random name (e.g., `test_abc123`) for each test.
+After test execution, the random database name is replaced with `default` in stdout/stderr files before comparison with `.reference`.
+This means `.reference` files should use `default` for database names, NOT `${CLICKHOUSE_DATABASE}` or the actual random name.
+
+### Test Tags
+Tests can have tags in the first line as a comment: `-- Tags: no-fasttest, no-parallel`
+Common tags: `disabled`, `no-fasttest`, `no-parallel`, `no-random-settings`, `no-random-merge-tree-settings`, `long`
+
+### Random Settings Limits
+Tests can specify limits for randomized settings: `-- Random settings limits: max_threads=(1, 4); ...`
+
+### Stopping the Server
+Find and kill the server process:
+```bash
+pgrep -f "clickhouse server"  # Get PIDs
+kill <pid1> <pid2>            # Stop processes
+```
@@ -211,3 +211,49 @@ STYLE & CONDUCT
 - Avoid changing scope: review what’s in the PR; suggest follow-ups separately.
 - If you are not reasonably confident a finding is a real issue or meaningful risk, **do not mention it**.
 - When performing a code review, **ignore `/.github/workflows/*` files**.
+
+RUNNING STATELESS TESTS
+
+Stateless tests are located in `tests/queries/0_stateless/`.
+
+**Prerequisites:**
+1. Build ClickHouse: `cd build && ninja clickhouse-server`
+2. Start the server: `./build/programs/clickhouse server --config-file ./programs/server/config.xml`
+3. Wait for server to be ready: `./build/programs/clickhouse client -q "SELECT 1"`
+
+**Running tests** (default config uses TCP=9000, HTTP=8123):
+```bash
+CLICKHOUSE_PORT_TCP=9000 CLICKHOUSE_PORT_HTTP=8123 ./tests/clickhouse-test <test_name>
+```
+
+**Useful flags:**
+- `--no-random-settings` - Disable settings randomization (useful for deterministic debugging)
+- `--no-random-merge-tree-settings` - Disable MergeTree settings randomization
+- `--record` - Automatically update `.reference` files when stdout differs
+
+**Test file extensions:**
+- `.sql` - SQL test (most common)
+- `.sql.j2` - Jinja2-templated SQL test
+- `.sh` - Shell script test
+- `.py` - Python test
+- `.expect` - Expect script test
+- `.reference` - Expected output (compared against stdout)
+- `.gen.reference` - Generated reference for `.j2` tests
+
+**Database name normalization:**
+The test runner creates a temporary database with a random name (e.g., `test_abc123`) for each test.
+After test execution, the random database name is replaced with `default` in stdout/stderr files before comparison with `.reference`.
+This means `.reference` files should use `default` for database names, NOT `${CLICKHOUSE_DATABASE}` or the actual random name.
+
+**Test tags:**
+Tests can have tags in the first line as a comment: `-- Tags: no-fasttest, no-parallel`
+Common tags: `disabled`, `no-fasttest`, `no-parallel`, `no-random-settings`, `no-random-merge-tree-settings`, `long`
+
+**Random settings limits:**
+Tests can specify limits for randomized settings: `-- Random settings limits: max_threads=(1, 4); ...`
+
+**Stopping the server:**
+```bash
+pgrep -f "clickhouse server"  # Get PIDs
+kill <pid1> <pid2>            # Stop processes
+```
@@ -111,8 +111,8 @@ void TCPServerDispatcher::run()
                     if (!_stopped)
                     {
                         std::unique_ptr<TCPServerConnection> pConnection(_pConnectionFactory->createConnection(pCNf->socket()));
-                        poco_check_ptr(pConnection.get());
-                        pConnection->start();
+                        if (pConnection)
+                            pConnection->start();
                     }
                     /// endConnection() should be called after destroying TCPServerConnection,
                     /// otherwise currentConnections() could become zero while some connections are yet still alive.
 
@@ -284,7 +284,9 @@ def main():
         stages.remove(JobStages.COLLECT_COVERAGE)
     else:
         stages.remove(JobStages.COLLECT_LOGS)
-    if is_coverage or info.is_local_run:
+    if is_coverage or info.is_local_run or is_bugfix_validation:
+        # For bugfix validation, we intentionally skip the check error stage (checks FATAL messages):
+        # regular test failures are assumed to be sufficient to validate the test
         stages.remove(JobStages.CHECK_ERRORS)
     if info.is_local_run:
         if JobStages.COLLECT_LOGS in stages:
@@ -542,7 +544,11 @@ def start():
 
     if JobStages.RETRIES in stages and test_result and test_result.is_failure():
         # retry all failed tests and mark original failed either as success on retry or failed on retry
-        failed_tests = [t.name for t in test_result.results if t.is_failure()]
+        failed_tests = [
+            t.name
+            for t in test_result.results
+            if t.is_failure() and t.name and t.name[0].isdigit()
+        ]
         if len(failed_tests) > 10:
             results.append(
                 Result(
@@ -620,24 +626,28 @@ def start():
         test_result.extend_sub_results(results[-1].results)
         results[-1].results = []
 
-        # invert result status for bugfix validation
-        if is_bugfix_validation:
-            has_failure = False
-            for r in results[-1].results:
-                r.set_label("xfail")
-                if r.status == Result.StatusExtended.FAIL:
-                    r.status = Result.StatusExtended.OK
-                    has_failure = True
-                elif r.status == Result.StatusExtended.OK:
-                    r.status = Result.StatusExtended.FAIL
-            if not has_failure:
-                print("Failed to reproduce the bug")
-                results[-1].set_failed().set_info("Failed to reproduce the bug")
-            else:
-                results[-1].set_success()
-
-        if not results[-1].is_ok():
-            results[-1].set_info("Found errors added into Tests results")
+    # invert result status for bugfix validation
+    if is_bugfix_validation and test_result:
+        has_failure = False
+        for r in test_result.results:
+            r.set_label("xfail")
+            if r.status == Result.StatusExtended.FAIL:
+                r.status = Result.StatusExtended.OK
+                has_failure = True
+            elif r.status == Result.StatusExtended.OK:
+                r.status = Result.StatusExtended.FAIL
+        if not has_failure:
+            print("Failed to reproduce the bug")
+            test_result.set_failed().set_info("Failed to reproduce the bug")
+        else:
+            # For bugfix validation, the expected behavior is:
+            # - At least one test must fail (bug reproduced)
+            # - The overall Tests result is treated as success in that case
+            test_result.set_success()
+
+        # For bugfix validation, "Check errors" (latest in the list) is only a helper step and
+        # must not affect the overall job result.
+        results[-1].set_success()
 
     if JobStages.COLLECT_LOGS in stages:
         print("Collect logs")
 
@@ -445,7 +445,7 @@ def main():
             has_error = True
             error_info.append(test_result_sequential.info)
 
-    # Collect logs before rerun
+    # Collect logs before re-run
     attached_files = []
     if not info.is_local_run:
         failed_suits = []
 
@@ -56,6 +56,7 @@ def should_skip_job(job_name):
     global _info_cache
     if _info_cache is None:
         _info_cache = Info()
+        print(f"INFO: PR labels: {_info_cache.pr_labels}")
 
     changed_files = _info_cache.get_kv_data("changed_files")
     if not changed_files:
 
@@ -0,0 +1,18 @@
+import re
+from ci.praktika.info import Info
+
+if __name__ == "__main__":
+    info = Info()
+    if info.pr_number == 0:
+        # Extract original PR number from backport merge commits
+        # Example: "Merge pull request #92596 from ClickHouse/backport/25.12/92538" -> extract 92538
+        commit_message = info.commit_message
+        match = re.search(r"backport/[^/]+/(\d+)", commit_message)
+        if match:
+            try:
+                pr_number = int(match.group(1))
+                info.set_parent_pr_number(pr_number)
+            except ValueError as e:
+                print(
+                    f"Failed to get PR number from commit message [{commit_message}]: {e}"
+                )
@@ -43,7 +43,6 @@ class _Environment(MetaClasses.Serializable):
     JOB_CONFIG: Optional[Job.Config] = None
     TRACEBACKS: List[str] = dataclasses.field(default_factory=list)
     WORKFLOW_JOB_DATA: Dict[str, Any] = dataclasses.field(default_factory=dict)
-    WORKFLOW_STATUS_DATA: Dict[str, Any] = dataclasses.field(default_factory=dict)
     JOB_KV_DATA: Dict[str, Any] = dataclasses.field(default_factory=dict)
     COMMIT_AUTHORS: List[str] = dataclasses.field(default_factory=list)
     WORKFLOW_CONFIG: Optional[Dict[str, Any]] = None
@@ -72,17 +71,14 @@ def from_env(cls) -> "_Environment":
         EVENT_TIME = ""
         COMMIT_MESSAGE = ""
 
-        assert Path(
-            Settings.WORKFLOW_JOB_FILE
-        ).is_file(), f"File not found: {Settings.WORKFLOW_JOB_FILE}"
-        with open(Settings.WORKFLOW_JOB_FILE, "r", encoding="utf8") as f:
-            WORKFLOW_JOB_DATA = json.load(f)
-
-        assert Path(
-            Settings.WORKFLOW_STATUS_FILE
-        ).is_file(), f"File not found: {Settings.WORKFLOW_STATUS_FILE}"
-        with open(Settings.WORKFLOW_STATUS_FILE, "r", encoding="utf8") as f:
-            WORKFLOW_STATUS_DATA = json.load(f)
+        if Path(Settings.WORKFLOW_JOB_FILE).is_file():
+            with open(Settings.WORKFLOW_JOB_FILE, "r", encoding="utf8") as f:
+                WORKFLOW_JOB_DATA = json.load(f)
+        else:
+            print(
+                f"NOTE: Workflow job file [{Settings.WORKFLOW_JOB_FILE}] does not exist"
+            )
+            WORKFLOW_JOB_DATA = {}
 
         if EVENT_FILE_PATH:
             with open(EVENT_FILE_PATH, "r", encoding="utf-8") as f:
@@ -238,7 +234,6 @@ def from_env(cls) -> "_Environment":
                 "parent_pr_number": LINKED_PR_NUMBER,
             },
             WORKFLOW_JOB_DATA=WORKFLOW_JOB_DATA,
-            WORKFLOW_STATUS_DATA=WORKFLOW_STATUS_DATA,
             WORKFLOW_CONFIG=None,
         )
 
@@ -281,9 +276,18 @@ def get(cls):
         if Path(cls.file_name_static()).is_file():
             return cls.from_fs("environment")
         else:
-            env = cls.from_workflow_data()
-            env.dump()
-            return env
+            try:
+                env = cls.from_workflow_data()
+                env.dump()
+                return env
+            except FileNotFoundError as e:
+                # For workflows without Config job
+                print(
+                    f"NOTE: Workflow context file [{Settings.WORKFLOW_STATUS_FILE}] does not exist - read context from GH event"
+                )
+                env = cls.from_env()
+                env.dump()
+                return env
 
     def set_job_name(self, job_name):
         self.JOB_NAME = job_name
 
@@ -163,6 +163,8 @@ def get_secret(self, name):
         return self.workflow.get_secret(name)
 
     def get_job_url(self):
+        if not self.env.WORKFLOW_JOB_DATA:
+            return ""
         return f"{self.env.RUN_URL}/job/{self.env.WORKFLOW_JOB_DATA['check_run_id']}"
 
     def get_job_report_url(self, latest=False):
 
@@ -304,6 +304,25 @@ def _check_db(workflow):
         )
         env.dump()
 
+    _GH_Auth(workflow)
+
+    # refresh PR data
+    if env.PR_NUMBER > 0:
+        title, body, labels = GH.get_pr_title_body_labels()
+        print(f"NOTE: PR title: {title}")
+        print(f"NOTE: PR labels: {labels}")
+        if title:
+            if title != env.PR_TITLE:
+                print("PR title has been changed")
+                env.PR_TITLE = title
+            if env.PR_BODY != body:
+                print("PR body has been changed")
+                env.PR_BODY = body
+            if env.PR_LABELS != labels:
+                print("PR labels have been changed")
+                env.PR_LABELS = labels
+            env.dump()
+
     if workflow.enable_report:
         print("Push pending CI report")
         HtmlRunnerHooks.push_pending_ci_report(workflow)
@@ -511,21 +530,6 @@ def check_affected_jobs():
             )
         )
 
-    # refresh PR data
-    if env.PR_NUMBER > 0:
-        title, body, labels = GH.get_pr_title_body_labels()
-        if title:
-            if title != env.PR_TITLE:
-                print("PR title has been changed")
-                env.PR_TITLE = title
-            if env.PR_BODY != body:
-                print("PR body has been changed")
-                env.PR_BODY = body
-            if env.PR_LABELS != labels:
-                print("PR labels have been changed")
-                env.PR_LABELS = labels
-            env.dump()
-
     if workflow.enable_slack_feed:
         if env.PR_NUMBER:
             commit_authors = set()
Original file line number	Diff line number	Diff line change
`@@ -111,8 +111,8 @@ void TCPServerDispatcher::run()`
`111`	`111`	`if (!_stopped)`
`112`	`112`	`{`
`113`	`113`	`std::unique_ptr<TCPServerConnection> pConnection(_pConnectionFactory->createConnection(pCNf->socket()));`
`114`		`- poco_check_ptr(pConnection.get());`
`115`		`- pConnection->start();`
	`114`	`+ if (pConnection)`
	`115`	`+ pConnection->start();`
`116`	`116`	`}`
`117`	`117`	`/// endConnection() should be called after destroying TCPServerConnection,`
`118`	`118`	`/// otherwise currentConnections() could become zero while some connections are yet still alive.`