[RN] ONNX(Open Neural Network Exchange) 이해하기 -5 : 메모리 누수 해결

해당 글에서는 React Native 환경에서 ONNX를 사용하는 중에 메모리 누수 문제에 대해서 이를 해결하기 위해 시도했던 내용을 공유합니다.

💡 [참고] ONNX에 대해 궁금하시면 이전에 작성한 글을 확인하시면 도움이 됩니다.

주제	링크
ONNX(Open Neural Network Exchange) 이해하기-1: React Native 활용	https://adjh54.tistory.com/203
ONNX(Open Neural Network Exchange) 이해하기 -2 : ONNX 모델 불러오기	https://adjh54.tistory.com/205
ONNX(Open Neural Network Exchange) 이해하기 -3 : ONNX 모델 정리	https://adjh54.tistory.com/309
ONNX(Open Neural Network Exchange) 이해하기- 4 : 앱 배포시 주의사항	https://adjh54.tistory.com/327

1) ONNX(Open Neural Network Exchange)

💡 ONNX(Open Neural Network Exchange)

- 딥러닝 모델을 서로 다른 프레임워크 간에 서로 옮길 수 있도록 하는 오픈 소스 프로젝트입니다. ONNX는 모델을 중간 계층 형식으로 변환하고 모델을 실행하는 데 필요한 라이브러리를 제공합니다.
- 간략히 말해, 다양한 플랫폼 환경(Java, JS, C, C#, C++)에서 환경에 제약 없이 구현된 ‘ML 모델’을 호출하고 수행하여 수행 결과값을 반환받는 것을 의미합니다.

💡 ONNX Runtime 이란?

- ONNX 모델을 실행하기 위한 엔진입니다. ONNX 모델을 실행하기 위해 ONNX 런타임은 빠른 추론을 위한 최적화된 커널을 사용합니다.
- 또한, ONNX Runtime은 CPU, GPU 및 딥러닝 가속기(DNNL, NNAPI, OpenVINO)를 지원합니다.따라서, ONNX Runtime은 ONNX 모델을 실행하기 위한 최적화된 런타임 환경을 제공합니다.

2) ONNX(Open Neurl Network Exchange) 메모리 누수 증상 확인

1. 증상 확인

💡 증상 확인

- ONNX 모델을 사용하는것 자체 내에서도 메모리가 누수가 된다는 점을 확인하였습니다.

- 아래와 같이 일반적으로 모델 파일 불러오기 > 모델 로드 > State 공간에 넣기 > 모델 실행이라는 일반적인 과정만을 생각하고 수행했습니다. 해당 과정이 일반적인 수행 과정이고 공식사이트에서도 제안하는 과정이였습니다.

- 그러나 이 과정에서 장기간 수행을 하였을때, 메모리가 누수가 됨을 확인하였습니다

// 모델 파일 불러오기
const HS_EMOTION_MODEL = require("../../../assets/models/hsemotion_q.ort");
// 모델 파일 로드
let _hemotionModel: InferenceSession | null = null;
const _hemotionAssets = await Asset.loadAsync(HS_EMOTION_MODEL);
const _hemotionOnnxModelUri: string | null = _hemotionAssets[0].localUri;
if (_hemotionOnnxModelUri !== null) {
  await InferenceSession.create(_hemotionOnnxModelUri)
    .then((_loadSession: InferenceSession | null) => {
      console.log("[+] Load Hsemotion Model....");
      _hemotionModel = _loadSession;
      _loadSession = null;
    })
    .catch((error) => {
      console.error(`[-] hsemotion Load Error: ${error}`);
    });
}
// 모델 및 State 공간에 넣기(재활용 목적)
setInitModel({
  isLoading: true,
  isTensorReady: _tensorReady,
  faceMeshModel: _faceMeshModel,
  hsemotionModel: _hemotionModel,
  fsanetModel: _fsanetModel,
  hPoseModel: _hposeModel,
});
// 모델 실행
await initModel.hPoseModel
  .run(feed, initModel.hPoseModel.outputNames)
  .then((fetches: InferenceSession.OnnxValueMapType | null) => {
    if (fetches) {
      resultScore = result;
    }
  })
  .catch((err) => {
    console.error(`_hposeModel.run() 함수에서 에러 발생: ${err}`);
  });

💡 반복적으로 ONNX Run()을 수행하였을때 메모리가 증가가 됨을 확인하고, 아래와 같이 여러번 반복해서 Onnx 모델을 수행하는 과정을 하였을때, 메모리가 급격히 증가하는 문제를 확인하였습니다.

시간 별 시점	메모리 사용량(누적)	증감
앱 최초 시점의 메모리	550MB
학습 시작 및 모델로드	1.1GB
10분 소요	1.4GB	300MB
20분 소요	2GB	500MB
30분 소요	2.6GB	600MB
40분 소요	3.2GB	600MB
60분 소요

2. 증상 테스트

💡 증상 테스트

- 실제로 ONNX 모델을 수행하는것 자체에서 메모리가 증감되는지 여부를 확인하기 위해서 반복적으로 수행되는 루프 내에서 강제로 모델을 3번 수행을 하여서 테스트를 수행하였습니다.

const _configTensor2 = tf.zeros([1, 64, 64, 3]).dataSync() as Float32Array;

// [STEP8] 컨버팅한 데이터를 기반으로 모델을 수행합니다.
const feed = { "input": new Tensor(_configTensor2, [1, 64, 64, 3]) }

// 1
if (initModel.fsanetModel) {
    const fetches: any = await initModel.fsanetModel.run(feed, initModel.fsanetModel.outputNames)
    await initModel.fsanetModel.run(feed, initModel.fsanetModel.outputNames)
        .then((fetches: any) => {
            console.log("결과값 1111 :: ", fetches)
        })
        .catch((err) => console.error(`modelCalcHandler.fsanetEstimate() 함수에서 에러가 발생하였습니다 : ${err}`));
} else console.error("[+] FSA-NET 모델이 존재하지 않습니다 ")

// 2
if (initModel.fsanetModel) {
    const fetches: any = await initModel.fsanetModel.run(feed, initModel.fsanetModel.outputNames)
    await initModel.fsanetModel.run(feed, initModel.fsanetModel.outputNames)
        .then((fetches: any) => {
            console.log("결과값 2222 :: ", fetches)
        })
        .catch((err) => console.error(`modelCalcHandler.fsanetEstimate() 함수에서 에러가 발생하였습니다 : ${err}`));
} else console.error("[+] FSA-NET 모델이 존재하지 않습니다 ")

// 3
if (initModel.fsanetModel) {
    const fetches: any = await initModel.fsanetModel.run(feed, initModel.fsanetModel.outputNames)
    await initModel.fsanetModel.run(feed, initModel.fsanetModel.outputNames)
        .then((fetches: any) => {
            console.log("결과값 33333 :: ", fetches)
        })
        .catch((err) => console.error(`modelCalcHandler.fsanetEstimate() 함수에서 에러가 발생하였습니다 : ${err}`));
} else console.error("[+] FSA-NET 모델이 존재하지 않습니다 ")

💡 각각 모든 모델에 대해서 아래와 같이 메모리가 증가됨을 측정되었습니다.

테스트 케이스	메모리 증가 (시간별)
사전/사후 과정 임의의 값 대입, FSA_NET_MODEL 모델만 수행	300MB 증가(60분 수행)
사전/사후 과정 임의의 값 대입, HS_EMOTION_MODEL 모델만 수행	500MB 증가(60분 수행)
사전/사후 과정 임의의 값 대입, HPOSE_MODEL 모델만 수행	200MB 증가(60분 수행)
사전/사후 과정 임의의 값 대입, PFLD_MODEL, IRIS_LANDMARK_MODEL 모델만 수행	300MB 증가(40분 수행)
모든 과정 임의의 값 대입	변화 없음

💡 [참고] 위에와 같이 각각 모델 테스트를 위해서 임의의 값을 대입하였습니다.

// AModel
const AModel = [[131, 83, 250, 153, 0.9998713731765747]];               // 얼굴 탐지 수행 

// BModel
const BModel = [[219.87143433094025, 124.16538900136948], [213.9596370458603, 134.74061065912247], [210.16076481342316, 145.29385885596275], [207.91317796707153, 156.43775495886803], [208.19643414020538, 167.74818700551987], [213.10527896881104, 178.31780016422272], [223.9328352212906, 187.88467752933502], [239.2291305065155, 195.48890852928162], [257.02195143699646, 200.3173063993454], [273.5507217645645, 202.1791274547577], [289.4326922893524, 200.53489792346954], [305.3161082267761, 196.30744844675064], [318.89204955101013, 189.40269303321838], [329.8438069820404, 180.58710831403732], [340.1512382030487, 170.82163536548615], [349.0013060569763, 160.27033418416977], [354.786860704422, 149.28040277957916], [256.0004470348358, 113.4042843580246], [272.26986265182495, 112.45204101502895], [288.01743960380554, 114.94263865053654], [300.9962019920349, 119.50891670584679], [309.98334562778473, 124.96102982759476], [327.9712972640991, 126.51233154535294], [337.9849818944931, 123.10183063149452], [348.044472694397, 121.04811911284924], [358.06571221351624, 120.64467880129814], [366.04695296287537, 123.1012515425682], [314.23951959609985, 133.42918854951859], [312.88633024692535, 141.0312992632389], [311.53150177001953, 148.52897334098816], [310.1744530200958, 156.16735690832138], [289.7999612092972, 160.99236232042313], [294.89745473861694, 163.1061463356018], [299.63417625427246, 165.03788667917252], [305.0948259830475, 165.69098043441772], [310.5486658811569, 165.7969856262207], [259.760427236557, 127.7814254462719], [271.30396938323975, 122.97273027896881], [284.3070504665375, 124.18803730607033], [293.3167097568512, 133.4372775554657], [282.28687024116516, 134.06236073374748], [269.735830783844, 133.01006495952606], [327.7824697494507, 137.4561319053173], [336.93103766441345, 130.05731788277626], [347.6162431240082, 129.7964451611042], [352.7049300670624, 136.69400984048843], [348.3410505056381, 139.86253821849823], [338.04297721385956, 139.7152991592884], [263.7366530895233, 171.6152698993683], [276.40012180805206, 168.72529697418213], [289.2413911819458, 167.70523411035538], [294.5124385356903, 170.3420494198799], [301.1139509677887, 170.48875498771667], [307.2676537036896, 175.34768497943878], [309.7126064300537, 181.3397232890129], [300.34187710285187, 183.8431839942932], [291.2339893579483, 184.29179096221924], [284.8437008857727, 183.27935200929642], [278.5584954023361, 181.47342443466187], [270.68491554260254, 177.64672273397446], [267.43452525138855, 171.97408616542816], [285.67847883701324, 173.08595514297485], [291.6082022190094, 174.7633249759674], [297.28900170326233, 176.07047003507614], [307.51601135730743, 180.38451838493347], [293.7081334590912, 177.66386741399765], [287.8274646997452, 176.34880661964417], [282.1051505804062, 174.49449092149734]]

// CModel
const irisJsonArr = { "leftIrisArr": [34.81892013549805, 34.393226623535156, -4.061603546142578, 40.069766998291016, 33.682064056396484, -4.119124412536621, 33.99586868286133, 29.492755889892578, -4.129571437835693, 29.435707092285156, 35.164794921875, -4.007841110229492, 35.63016891479492, 39.16047286987305, -4.06866979598999], "rightIrisArr": [34.81892013549805, 34.393226623535156, -4.061603546142578, 40.069766998291016, 33.682064056396484, -4.119124412536621, 33.99586868286133, 29.492755889892578, -4.129571437835693, 29.435707092285156, 35.164794921875, -4.007841110229492, 35.63016891479492, 39.16047286987305, -4.06866979598999] }

// DModel
const resultFsanet = [26.119766235351562, -27.402212142944336, -2.7349319458007812]

// EModel
_resultHsemotion = { "arousalArr": [0.15133555233478546], "emotionCode": "SUP", "valenceArr": [-0.04291853681206703] }
const _gazeEstimateResult = { "ear": 0.4030975866672722, "iris_radius": 3.075392723083496, "left_phi": -0.25268025514207865, "left_theta": -0.25268025514207865 }

// FModel
_atntn = 70

3) ONNX(Open Neurl Network Exchange) 메모리 누수 해결방법

1. ONNX Runtime Tensor 메모리

💡 ONNX Runtime Tensor 메모리

- 위에 과정에서 ONNX Runtime으로 모델을 수행했을때 모든 모델에서 메모리가 누수가 됨을 확인하였습니다
- 해당 과정에서 가장 큰 문제점은 ONNX Runtime 내에 GC(Garbage Collector)가 수행이 되지않는 다는 점 이였습니다.

// 모델 파일 불러오기
const HS_EMOTION_MODEL = require("../../../assets/models/hsemotion_q.ort");
// 모델 파일 로드
let _hemotionModel: InferenceSession | null = null;
const _hemotionAssets = await Asset.loadAsync(HS_EMOTION_MODEL);
const _hemotionOnnxModelUri: string | null = _hemotionAssets[0].localUri;
if (_hemotionOnnxModelUri !== null) {
  await InferenceSession.create(_hemotionOnnxModelUri)
    .then((_loadSession: InferenceSession | null) => {
      console.log("[+] Load Hsemotion Model....");
      _hemotionModel = _loadSession;
      _loadSession = null;
    })
    .catch((error) => {
      console.error(`[-] hsemotion Load Error: ${error}`);
    });
}
// 모델 및 State 공간에 넣기(재활용 목적)
setInitModel({
  isLoading: true,
  isTensorReady: _tensorReady,
  faceMeshModel: _faceMeshModel,
  hsemotionModel: _hemotionModel,
  fsanetModel: _fsanetModel,
  hPoseModel: _hposeModel,
});
// 모델 실행
await initModel.hPoseModel
  .run(feed, initModel.hPoseModel.outputNames)
  .then((fetches: InferenceSession.OnnxValueMapType | null) => {
    if (fetches) {
      resultScore = result;
    }
  })
  .catch((err) => {
    console.error(`_hposeModel.run() 함수에서 에러 발생: ${err}`);
  });

2. 해결 방법

💡 해결방법

- 주요 문제가 되는 점이 ONNX의 run() 메서드를 수행할때, input 값과 output값내에 메모리가 누적되는 ONNX Tensor 값이 있었다는 점입니다.

hPoseEstimate: async (data1: Float32Array): Promise<number> => {
    let resultScore = 0;
    let inputTensor1: Tensor | null = null;
    let inputTensor2: Tensor | null = null;
    let tfHelperTensor: tf.Tensor | null = null;
    try {
        if (initModel.hPoseModel) {
            // [STEP1] tf.ones를 활용하여 Int32Array 생성
            tfHelperTensor = tf.tidy(() => tf.ones([1, 1]));
            const data2 = new Int32Array(tfHelperTensor.dataSync());
            // [STEP2] ONNX Tensor 생성
            inputTensor1 = new Tensor(data1.slice(), [1, 10, 8]);
            inputTensor2 = new Tensor(data2.slice(), [1, 1]);
            const feed: { args_0: Tensor | null; args_1: Tensor | null } = {
                args_0: inputTensor1,
                args_1: inputTensor2,
            };
            // [STEP3] 모델 실행
            // @ts-ignore
            await initModel.hPoseModel.run(feed, initModel.hPoseModel.outputNames)
                .then((fetches: InferenceSession.OnnxValueMapType | null) => {
                    if (fetches) {
                        const result = tf.tidy(() => {
                            const output = fetches!.output_1;
                            const [, data2] = output.data;
                            return Math.floor((data2 as number) * 100);
                        });
                        resultScore = result;
                        // [STEP4] ONNX output 비우기
                        for (const key in fetches) {
                            (fetches as any)[key] = null;
                        }
                        // ✅ output도 직접 null 처리 (선택 사항이지만 권장)
                        (fetches as any).output_1 = null;
                        fetches = null;
                    }
                    feed.args_0 = null;
                    feed.args_1 = null;
                })
                .catch((err) => {
                    console.error(`_hposeModel.run() 함수에서 에러 발생: ${err}`);
                });
        }
    } catch (error) {
        console.log(`[-] hPoseEstimate error :: ${error}`);
    } finally {
        // [STEP5] TensorFlow Tensor 제거
        tfHelperTensor?.dispose();
        tfHelperTensor = null;
        // [STEP6] ONNX Tensor 참조 제거
        inputTensor1 = null;
        inputTensor2 = null;
    }
    return resultScore;
},

3. 상세 해결

💡 상세 해결

1. input 값 해결 방법

- input으로는 feed 값으로 Tensor라는 ONNX Tensor 값으로 이루어져 있습니다. 해당 부분에서 문제가 발생하기에 강제로 null 값을 대입하여서 비워줍니다.

2. output 값 해결 방법

- output으로 fetches 값에 대해서도 내부적으로 ONNX Tensor 값이 이루어져 있습니다. 해당 부분에 대해서도 InferenceSession.OnnxValueMapType | null 타입을 지정하고, 모든 요소를 null 값을 대입하여서 비워줍니다.

/**
* INPUT 값을 비워주는 방법
*/
// feed 값으로 강제로 null을 허용하도록 하였습니다.
const feed: { args_0: Tensor | null; args_1: Tensor | null } = {
    args_0: inputTensor1,
    args_1: inputTensor2,
};
// ONNX 실행
await initModel.hPoseModel.run(feed, initModel.hPoseModel.outputNames)
// 실행 후 값을 비워줌
feed.args_0 = null;
feed.args_1 = null;
/**
* OUTPUT 값을 비워주는 방법
*/
await initModel.hPoseModel.run(feed, initModel.hPoseModel.outputNames)
  .then((fetches: InferenceSession.OnnxValueMapType | null) => {
      if (fetches) {
          const result = tf.tidy(() => {
              const output = fetches!.output_1;
              const [, data2] = output.data;
              return Math.floor((data2 as number) * 100);
          });
          resultScore = result;
          // [STEP4] ONNX output 비우기
          for (const key in fetches) {
              (fetches as any)[key] = null;
          }
          // output도 직접 null 처리 (선택 사항이지만 권장)
          (fetches as any).output_1 = null;
          fetches = null;
      }
      feed.args_0 = null;
      feed.args_1 = null;
  })
  .catch((err) => {
      console.error(`_hposeModel.run() 함수에서 에러 발생: ${err}`);
  });
}

오늘도 감사합니다. 😀

'React & React Native > 라이브러리 활용' 카테고리의 다른 글

[RN] React Native 환경에서 Unity LevelPlay 광고 넣기 : Banner, Interstitial (0)	2025.11.04
[RN] VisionCamera frameProcessor Buffer : waitForFreeSlotThenRelock: timeo (0)	2025.09.25
[RN] React Native 환경에서 react-native-rename을 활용하여 프로젝트 명, 패키지 일괄 변경하기 (1)	2025.06.30
[RN] Google AdMob에서 Google Firebase Analytics 연결 방법 (0)	2025.06.30
[RN] React Native에서 구글 애드몹(Google Admob) 이해하고 활용하기 -2 : 광고 종류 및 구현 확인 (0)	2025.06.30

이 포스팅은 쿠팡 파트너스 활동의 일환으로, 이에 따른 일정액의 수수료를 제공받습니다.

1) ONNX(Open Neural Network Exchange)

2) ONNX(Open Neurl Network Exchange) 메모리 누수 증상 확인

1. 증상 확인

2. 증상 테스트

3) ONNX(Open Neurl Network Exchange) 메모리 누수 해결방법

1. ONNX Runtime Tensor 메모리

2. 해결 방법

3. 상세 해결

'React & React Native > 라이브러리 활용' 카테고리의 다른 글

티스토리툴바