TWSD-736 OS10.4.2.128 System crash

Review Request #924 — Created June 18, 2025 and submitted

jasonchang
APV10
rel_apv_10_7
mingji, shuinvy, timlai, timsu

The APV crashes with a segmentation fault when handling an excessive volume of TLSv1.3 traffic.

Root Cause:
Based on analysis of the previous core dump, the issue stems from a NULL pointer access to rp in ./usr/src/sys/click/app/ssl/tlsv13_server.c:1873. This results in a segmentation fault, ultimately causing the APV system to crash.

Fix Details:
To address this, a fail-safe check has been implemented: if rp is NULL, the function returns ACTION_ERROR, which subsequently triggers ssl_error() to reset the connection gracefully.

Result:
After applying the fail-safe mechanism to tlsv13_state_send_helloretryrequest(), a test was conducted using the same topology—approximately 30,000 RPS sustained for 100 seconds. The APV ran without any crashes or failures during the test.

timlai
  1. Ship It!
  2. 
      
mingji
  1. Ship It!
  2. 
      
shuinvy
  1. Ship It!
  2. 
      
jasonchang
Review request changed

Status: Closed (submitted)

Loading...