FLIRL-Net: Fuzzy Logic-Driven Important Relationship Learning for Scene Graph Generation | Research Square window.SnipcartSettings = { analytics: { enabled: false } }; (function() { var accessVector = localStorage.getItem('access_vector') || ''; window.dataLayer = window.dataLayer || []; if (accessVector) { window.dataLayer.push({ user: { profile: { profileInfo: { snid: accessVector } } } }); } })(); (function(w,d,s,l,i){w[l]=w[l]||[];w[l].push({'gtm.start':new Date().getTime(),event:'gtm.js'});var f=d.getElementsByTagName(s)[0],j=d.createElement(s),dl=l!='dataLayer'?'&l='+l:'';j.async=true;j.src='https://www.googletagmanager.com/gtm.js?id='+i+dl;f.parentNode.insertBefore(j,f);})(window,document,'script','dataLayer','GTM-K279D39R'); Browse Preprints In Review Journals COVID-19 Preprints AJE Video Bytes Research Tools Research Promotion AJE Professional Editing AJE Rubriq About Preprint Platform In Review Editorial Policies Our Team Advisory Board Help Center Sign In Submit a Preprint Cite Share Download PDF Research Article FLIRL-Net: Fuzzy Logic-Driven Important Relationship Learning for Scene Graph Generation Jin Wang, Zilong Yang, Jialing Xu, Sixu Wu This is a preprint; it has not been peer reviewed by a journal. https://doi.org/ 10.21203/rs.3.rs-7500066/v1 This work is licensed under a CC BY 4.0 License Status: Published Journal Publication published 10 Mar, 2026 Read the published version in Multimedia Systems → Version 1 posted 10 You are reading this latest preprint version Abstract Scene Graph Generation (SGG) is a fundamental task in computer vision. It entails classifying entities in images and predicting visual relationships among them. Existing scene graph generation research targets the identification of all possible relationships within an image. However, this approach may suffer from inefficiencies and generate redundant information, hindering the further processing of relationship data in downstream tasks. In contrast, identifying important relationships is more aligned with the practical requiements of downstream tasks. To address this issue, we introduce the Fuzzy Logic-Driven Important Relationship Learning Network (FLIRL-Net). Initially, we employ fuzzy logic to compute an importance score for each relationship. This computation accounts for factors affecting relationship significance and user requirements. Subsequently, we design the Relationship Label Loss Importance Weighting (RLIW) module. This module utilizes these importance scores as weights to adjust relationship sample losses. Such adjustments direct the model's focus towards important relationships. Thirdly, we develop the Importance-Based Entity Pair Feature PoolFormer (IEPFP) module to enhance the model's recognition of important entity pairs. Additionally, We propose metrics Important Relationship Recall (IR@K) and Important Relationship Precision (IP@K). These metrics assess the model's effectiveness in identifying important relationships. Experimental results demonstrate that our model excels at identifying important relationships in the VG150 dataset and effectively minimizes unimportant relationship outputs. Scene Graph Generation Important Relationships Fuzzy Logic Importance Weighting Entity Pair Feature PoolFormer Full Text Additional Declarations No competing interests reported. Cite Share Download PDF Status: Published Journal Publication published 10 Mar, 2026 Read the published version in Multimedia Systems → Version 1 posted Editorial decision: Revision requested 17 Nov, 2025 Reviews received at journal 14 Nov, 2025 Reviews received at journal 31 Oct, 2025 Reviewers agreed at journal 21 Oct, 2025 Reviewers agreed at journal 21 Oct, 2025 Reviewers agreed at journal 21 Oct, 2025 Reviewers invited by journal 21 Oct, 2025 Editor assigned by journal 17 Oct, 2025 Submission checks completed at journal 10 Sep, 2025 First submitted to journal 31 Aug, 2025 You are reading this latest preprint version Research Square lets you share your work early, gain feedback from the community, and start making changes to your manuscript prior to peer review in a journal. As a division of Research Square Company, we’re committed to making research communication faster, fairer, and more useful. We do this by developing innovative software and high quality services for the global research community. Our growing team is made up of researchers and industry professionals working together to solve the most critical problems facing scientific publishing. Also discoverable on Platform About Our Team In Review Editorial Policies Advisory Board Help Center Resources Author Services Accessibility API Access RSS feed Manage Cookie Preferences © Research Square 2026 | ISSN 2693-5015 (online) Privacy Policy Terms of Service Do Not Sell My Personal Information {"props":{"pageProps":{"initialData":{"identity":"rs-7500066","acceptedTermsAndConditions":true,"allowDirectSubmit":false,"archivedVersions":[],"articleType":"Research Article","associatedPublications":[],"authors":[{"id":533966816,"identity":"a87a5f33-8655-41aa-a2fd-2bbbdefb8f3b","order_by":0,"name":"Jin Wang","email":"","orcid":"","institution":"Nantong University","correspondingAuthor":false,"prefix":"","firstName":"Jin","middleName":"","lastName":"Wang","suffix":""},{"id":533966817,"identity":"889d6304-88ea-47b0-98e6-cffd84da53d6","order_by":1,"name":"Zilong Yang","email":"","orcid":"","institution":"Nantong University","correspondingAuthor":false,"prefix":"","firstName":"Zilong","middleName":"","lastName":"Yang","suffix":""},{"id":533966818,"identity":"17773ac6-506d-47c4-bb41-1784ae8e4ed0","order_by":2,"name":"Jialing Xu","email":"","orcid":"","institution":"Nantong University","correspondingAuthor":false,"prefix":"","firstName":"Jialing","middleName":"","lastName":"Xu","suffix":""},{"id":533966819,"identity":"0ad949a6-7453-4a96-943d-5b97f669b781","order_by":3,"name":"Sixu Wu","email":"data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAZAAAAAyAQMAAABI0h/eAAAABlBMVEX///8AAABVwtN+AAAACXBIWXMAAA7EAAAOxAGVKw4bAAAAxklEQVRIiWNgGAWjYBACNmb+jw8//JCoZ2NvPkCcFj52BmNjyR6bBD6eYwnEaZHjZzCT4GFLS5CTyDEg1mEMaRISPIfz2BhyPt54w2Anp9tAWMthiwKLw8VsDGc3W85hSDY2O0BQC2PjDaAtjG2MvdukeRgOJG4jrIWZAegXoBZmnmfEamFjAnk/sY2Nh41YLTzMoEA2BuowtpxjQIRf5PvPMIKiUk5+/uOHN95U2MkR1IICJHiIjBpkLaTqGAWjYBSMghEBAPBqN1FCcGo7AAAAAElFTkSuQmCC","orcid":"","institution":"Nantong University","correspondingAuthor":true,"prefix":"","firstName":"Sixu","middleName":"","lastName":"Wu","suffix":""}],"badges":[],"createdAt":"2025-08-31 10:53:14","currentVersionCode":1,"declarations":"","doi":"10.21203/rs.3.rs-7500066/v1","doiUrl":"https://doi.org/10.21203/rs.3.rs-7500066/v1","draftVersion":[],"editorialEvents":[{"content":"https://doi.org/10.1007/s00530-026-02272-3","type":"published","date":"2026-03-10T15:59:23+00:00"}],"editorialNote":"","failedWorkflow":false,"files":[{"id":94820931,"identity":"7e9c52b3-61bd-4c83-bcf8-d1c64301e0ae","added_by":"auto","created_at":"2025-10-31 06:05:55","extension":"json","order_by":0,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":6195,"visible":true,"origin":"","legend":"","description":"","filename":"7ca21ab7683a41de9f338737c3e680cc.json","url":"https://assets-eu.researchsquare.com/files/rs-7500066/v1/172da2619db2d7e76f939c4b.json"},{"id":94820932,"identity":"e939b209-b840-47a8-b750-c30b3b931a22","added_by":"auto","created_at":"2025-10-31 06:05:55","extension":"xml","order_by":1,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":140729,"visible":true,"origin":"","legend":"","description":"","filename":"7ca21ab7683a41de9f338737c3e680cc1enriched.xml","url":"https://assets-eu.researchsquare.com/files/rs-7500066/v1/04ce5b89ef92949b13d183ae.xml"},{"id":94820934,"identity":"f0294ac6-91e1-4df7-8e8f-10968ee36ffb","added_by":"auto","created_at":"2025-10-31 06:05:55","extension":"pdf","order_by":2,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":7373154,"visible":true,"origin":"","legend":"","description":"","filename":"FLIRLNetFuzzyLogicDrivenImportantRelationshipLearningforSceneGraphGeneration.pdf","url":"https://assets-eu.researchsquare.com/files/rs-7500066/v1/03fd1e33ddad03a5f9529f45.pdf"},{"id":94820941,"identity":"0777e595-87b8-414a-b158-15f555c9bbbf","added_by":"auto","created_at":"2025-10-31 06:05:55","extension":"pdf","order_by":3,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":7373154,"visible":true,"origin":"","legend":"","description":"","filename":"FLIRLNetFuzzyLogicDrivenImportantRelationshipLearningforSceneGraphGeneration.pdf","url":"https://assets-eu.researchsquare.com/files/rs-7500066/v1/4068310d44055e3060fad365.pdf"},{"id":94820933,"identity":"b0787b02-067e-4294-960c-b85e6ce0c368","added_by":"auto","created_at":"2025-10-31 06:05:55","extension":"png","order_by":4,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":994150,"visible":true,"origin":"","legend":"","description":"","filename":"Fig0.png","url":"https://assets-eu.researchsquare.com/files/rs-7500066/v1/363372bb6c30a454eca340fe.png"},{"id":94820935,"identity":"dbef76d8-54f5-4b51-9346-3c83634fc73b","added_by":"auto","created_at":"2025-10-31 06:05:55","extension":"png","order_by":5,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":320529,"visible":true,"origin":"","legend":"","description":"","filename":"Fig1.png","url":"https://assets-eu.researchsquare.com/files/rs-7500066/v1/c0d1711b91a6d16bf6c357e8.png"},{"id":94820939,"identity":"408cc248-6f48-46f0-9717-31f6592cb278","added_by":"auto","created_at":"2025-10-31 06:05:55","extension":"png","order_by":6,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":187481,"visible":true,"origin":"","legend":"","description":"","filename":"Fig2.png","url":"https://assets-eu.researchsquare.com/files/rs-7500066/v1/56848a07c3383d7d77d39212.png"},{"id":94820936,"identity":"e489bae4-42a1-4ca6-b7d0-a6fa4f12f678","added_by":"auto","created_at":"2025-10-31 06:05:55","extension":"png","order_by":7,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":161337,"visible":true,"origin":"","legend":"","description":"","filename":"Fig3.png","url":"https://assets-eu.researchsquare.com/files/rs-7500066/v1/fb0e689bdbc771cc1ef1bbe8.png"},{"id":94820940,"identity":"21285bd4-e7eb-4b22-ab1c-01c502c1bf28","added_by":"auto","created_at":"2025-10-31 06:05:55","extension":"png","order_by":8,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":108687,"visible":true,"origin":"","legend":"","description":"","filename":"Fig4.png","url":"https://assets-eu.researchsquare.com/files/rs-7500066/v1/871cd4c8b95423ebdecfebda.png"},{"id":94820942,"identity":"4aff3db9-a4f1-4c01-bb97-179b94955f3c","added_by":"auto","created_at":"2025-10-31 06:05:55","extension":"png","order_by":9,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":146192,"visible":true,"origin":"","legend":"","description":"","filename":"Fig5.png","url":"https://assets-eu.researchsquare.com/files/rs-7500066/v1/b60c9e5f304404b3d3e8d6c4.png"},{"id":94820946,"identity":"61ddfda6-946a-4920-8c61-49e65451b81c","added_by":"auto","created_at":"2025-10-31 06:05:55","extension":"png","order_by":10,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":2520046,"visible":true,"origin":"","legend":"","description":"","filename":"Fig6.png","url":"https://assets-eu.researchsquare.com/files/rs-7500066/v1/3c3ac7e9a4b379b68c466c80.png"},{"id":94820945,"identity":"c487559b-781b-444e-85d2-f910964de3df","added_by":"auto","created_at":"2025-10-31 06:05:55","extension":"png","order_by":11,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":393870,"visible":true,"origin":"","legend":"","description":"","filename":"Fig7.png","url":"https://assets-eu.researchsquare.com/files/rs-7500066/v1/4af4aa42a041e35eb0c57dca.png"},{"id":94820937,"identity":"6df37028-d0e6-46ee-affc-2ad927754b85","added_by":"auto","created_at":"2025-10-31 06:05:55","extension":"png","order_by":12,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":2752546,"visible":true,"origin":"","legend":"","description":"","filename":"Fig8.png","url":"https://assets-eu.researchsquare.com/files/rs-7500066/v1/60601527d948859cc379b51b.png"},{"id":94826328,"identity":"3df2478b-10ab-4829-9724-de87c3f0d43a","added_by":"auto","created_at":"2025-10-31 06:51:24","extension":"cls","order_by":13,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":55857,"visible":true,"origin":"","legend":"","description":"","filename":"snjnl.cls","url":"https://assets-eu.researchsquare.com/files/rs-7500066/v1/57a43243cef3525344ff55e0.cls"},{"id":94825759,"identity":"7e4189b5-f27a-47f7-b5a2-24a8ece9a1b2","added_by":"auto","created_at":"2025-10-31 06:50:41","extension":"bst","order_by":14,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":64166,"visible":true,"origin":"","legend":"","description":"","filename":"snmathphysnum.bst","url":"https://assets-eu.researchsquare.com/files/rs-7500066/v1/bda141cb12853d85f48c0541.bst"},{"id":94820943,"identity":"76350157-3e95-4db8-b84f-15e0ebe383fd","added_by":"auto","created_at":"2025-10-31 06:05:55","extension":"xml","order_by":24,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":150312,"visible":true,"origin":"","legend":"","description":"","filename":"7ca21ab7683a41de9f338737c3e680cc1structuring.xml","url":"https://assets-eu.researchsquare.com/files/rs-7500066/v1/4fab77c78d8cfc07ff452afc.xml"},{"id":94820947,"identity":"59f2fbec-1036-4e3b-868c-c907e054dd31","added_by":"auto","created_at":"2025-10-31 06:05:55","extension":"html","order_by":25,"title":"","display":"","copyAsset":false,"role":"acdc-reference","size":170450,"visible":true,"origin":"","legend":"","description":"","filename":"earlyproof.html","url":"https://assets-eu.researchsquare.com/files/rs-7500066/v1/c8223ef0545f832a5522f141.html"},{"id":104739528,"identity":"37070fca-13bd-45d9-ac5c-6273e9d493a4","added_by":"auto","created_at":"2026-03-16 16:08:35","extension":"pdf","order_by":1,"title":"","display":"","copyAsset":false,"role":"manuscript-pdf","size":7523837,"visible":true,"origin":"","legend":"","description":"","filename":"FLIRLNetFuzzyLogicDrivenImportantRelationshipLearningforSceneGraphGeneration.pdf","url":"https://assets-eu.researchsquare.com/files/rs-7500066/v1_covered_09c74193-9943-4bc2-a72b-aed8274aed84.pdf"}],"financialInterests":"No competing interests reported.","formattedTitle":"FLIRL-Net: Fuzzy Logic-Driven Important Relationship Learning for Scene Graph Generation","fulltext":[],"fulltextSource":"","fullText":"","funders":[],"hasAdminPriorityOnWorkflow":false,"hasManuscriptDocX":false,"hasOptedInToPreprint":true,"hasPassedJournalQc":"","hasAnyPriority":false,"hideJournal":false,"highlight":"","institution":"","isAcceptedByJournal":true,"isAuthorSuppliedPdf":true,"isDeskRejected":"","isHiddenFromSearch":false,"isInQc":false,"isInWorkflow":false,"isPdf":true,"isPdfUpToDate":true,"isWithdrawnOrRetracted":false,"journal":{"display":true,"email":"
[email protected]","identity":"multimedia-systems","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":false,"externalIdentity":"mmsj","sideBox":"Learn more about [Multimedia Systems](http://link.springer.com/journal/530)","snPcode":"530","submissionUrl":"https://submission.nature.com/new-submission/530/3","title":"Multimedia Systems","twitterHandle":"","acdcEnabled":true,"dfaEnabled":true,"editorialSystem":"em","reportingPortfolio":"Springer Hybrid","inReviewEnabled":true,"inReviewRevisionsEnabled":false},"keywords":"Scene Graph Generation, Important Relationships, Fuzzy Logic, Importance Weighting, Entity Pair Feature PoolFormer","lastPublishedDoi":"10.21203/rs.3.rs-7500066/v1","lastPublishedDoiUrl":"https://doi.org/10.21203/rs.3.rs-7500066/v1","license":{"name":"CC BY 4.0","url":"https://creativecommons.org/licenses/by/4.0/"},"manuscriptAbstract":"\u003cp\u003eScene Graph Generation (SGG) is a fundamental task in computer vision. It entails classifying entities in images and predicting visual relationships among them. Existing scene graph generation research targets the identification of all possible relationships within an image. However, this approach may suffer from inefficiencies and generate redundant information, hindering the further processing of relationship data in downstream tasks. In contrast, identifying important relationships is more aligned with the practical requiements of downstream tasks. To address this issue, we introduce the Fuzzy Logic-Driven Important Relationship Learning Network (FLIRL-Net). Initially, we employ fuzzy logic to compute an importance score for each relationship. This computation accounts for factors affecting relationship significance and user requirements. Subsequently, we design the Relationship Label Loss Importance Weighting (RLIW) module. This module utilizes these importance scores as weights to adjust relationship sample losses. Such adjustments direct the model's focus towards important relationships. Thirdly, we develop the Importance-Based Entity Pair Feature PoolFormer (IEPFP) module to enhance the model's recognition of important entity pairs. Additionally, We propose metrics Important Relationship Recall (IR@K) and Important Relationship Precision (IP@K). These metrics assess the model's effectiveness in identifying important relationships. Experimental results demonstrate that our model excels at identifying important relationships in the VG150 dataset and effectively minimizes unimportant relationship outputs.\u003c/p\u003e","manuscriptTitle":"FLIRL-Net: Fuzzy Logic-Driven Important Relationship Learning for Scene Graph Generation","msid":"","msnumber":"","nonDraftVersions":[{"code":1,"date":"2025-10-31 06:05:50","doi":"10.21203/rs.3.rs-7500066/v1","editorialEvents":[{"type":"communityComments","content":0},{"type":"decision","content":"Revision requested","date":"2025-11-17T11:55:32+00:00","index":"","fulltext":""},{"type":"editorInvitedReview","content":"","date":"2025-11-15T00:13:51+00:00","index":"hide","fulltext":""},{"type":"editorInvitedReview","content":"","date":"2025-10-31T06:34:29+00:00","index":"hide","fulltext":""},{"type":"reviewerAgreed","content":"207925565419558395191518380844392193015","date":"2025-10-22T02:23:46+00:00","index":"hide","fulltext":""},{"type":"reviewerAgreed","content":"232145197160934490169316637420197285412","date":"2025-10-22T00:12:27+00:00","index":"hide","fulltext":""},{"type":"reviewerAgreed","content":"236433385839234684584885106917412240698","date":"2025-10-21T11:22:27+00:00","index":"hide","fulltext":""},{"type":"reviewersInvited","content":"","date":"2025-10-21T08:36:44+00:00","index":"","fulltext":""},{"type":"editorAssigned","content":"","date":"2025-10-17T14:46:53+00:00","index":"","fulltext":""},{"type":"checksComplete","content":"","date":"2025-09-10T11:13:19+00:00","index":"","fulltext":""},{"type":"submitted","content":"Multimedia Systems","date":"2025-08-31T10:42:31+00:00","index":"","fulltext":""}],"status":"published","journal":{"display":true,"email":"
[email protected]","identity":"multimedia-systems","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":false,"externalIdentity":"mmsj","sideBox":"Learn more about [Multimedia Systems](http://link.springer.com/journal/530)","snPcode":"530","submissionUrl":"https://submission.nature.com/new-submission/530/3","title":"Multimedia Systems","twitterHandle":"","acdcEnabled":true,"dfaEnabled":true,"editorialSystem":"em","reportingPortfolio":"Springer Hybrid","inReviewEnabled":true,"inReviewRevisionsEnabled":false}}],"origin":"","ownerIdentity":"250c7f87-a337-4f1d-8e5f-3373914cca0b","owner":[],"postedDate":"October 31st, 2025","published":true,"recentEditorialEvents":[],"rejectedJournal":[],"revision":"","amendment":"","status":"published-in-journal","subjectAreas":[],"tags":[],"updatedAt":"2026-03-16T16:04:25+00:00","versionOfRecord":{"articleIdentity":"rs-7500066","link":"https://doi.org/10.1007/s00530-026-02272-3","journal":{"identity":"multimedia-systems","isVorOnly":false,"title":"Multimedia Systems"},"publishedOn":"2026-03-10 15:59:23","publishedOnDateReadable":"March 10th, 2026"},"versionCreatedAt":"2025-10-31 06:05:50","video":"","vorDoi":"10.1007/s00530-026-02272-3","vorDoiUrl":"https://doi.org/10.1007/s00530-026-02272-3","workflowStages":[]},"version":"v1","identity":"rs-7500066","journalConfig":"researchsquare"},"__N_SSP":true},"page":"/article/[identity]/[[...version]]","query":{"redirect":"/article/rs-7500066","identity":"rs-7500066","version":["v1"]},"buildId":"XKTyCvWXoU3ODBz1xrDgd","isFallback":false,"isExperimentalCompile":false,"dynamicIds":[84888],"gssp":true,"scriptLoader":[]}
Text is read by the "Ask this paper" AI Q&A widget below.
Extraction quality varies by source — PMC NXML preserves structure
cleanly, OA-HTML may include some navigation residue, and OA-PDF can
have broken hyphenation. The publisher copy
(via DOI)
is the canonical version.