https://github.com/OpenAutoCoder/Agentless/blob/main/README_swebench.md**:**
Localization, repair, and patch validation
Turning the project codebase into a tree-like structure
### Repository Structure ###
django/
setup.py
shortcuts.py
__main__.py
__init__.py
utils/
_os.py
decorators.py
lorem_ipsum.py
dateparse.py
termcolors.py
module_loading.py
timezone.py
We prompt the LLM to localize and rank the top N most suspicious files that likely require editing to solve the issue
Please look through the following GitHub problem description and Repository structure and provide a list of files that one would need to edit to fix the problem.\\n\\n
### GitHub Problem Description ###
### Repository Structure ###
django/
setup.py
shortcuts.py
__main__.py
__init__.py
utils/
...
###\\n\\n
Please only provide the full path and return at most 5 files.\\nThe returned files should be separated by new lines ordered by most to least important and wrapped with ```\\nFor example:\\n```\\nfile1.py\\nfile2.py\\n``
"django/core/files/storage.py", "django/conf/global_settings.py", "django/core/files/uploadhandler.py", "django/core/files/temp.py"
Since our repository structure format does not contain detailed source code information, we additionally retrieve files with most relevant code snippets with the issue description using embedding-based retrieval
This is done by first filtering out irrelevant folders by using LLM to produce a list of irrelevant folders that do not need to be retrieved from with the following command:
Please look through the following GitHub problem description and Repository structure and provide a list of folders that are irrelevant to fixing the problem.\\nNote that irrelevant folders are those that do not need to be modified and are safe to ignored when trying to solve this problem.\\n\\n
### GitHub Problem Description
### Repository Structure
django/
setup.py
shortcuts.py
__main__.py
__init__.py
utils/
...
###\\n\\n
Please only provide the full path.\\nRemember that any subfolders will be considered as irrelevant if you provide the parent folder.\\nPlease ensure that the provided irrelevant folders do not include any important files needed to fix the problem\\nThe returned folders should be separated by new lines and wrapped with ```\\nFor example:\\n```\\nfolder1/\\nfolder2/folder3/\\nfolder4/folder5/\\n
"django/setup.py", "django/**init**.py", "django/**main**.py", "django/shortcuts.py", "django/core/signing.py", "django/core/validators.py", "django/core/exceptions.py", "django/core/signals.py", "django/core/wsgi.py", "django/core/paginator.py", "django/core/**init**.py", "django/core/files/utils.py", "django/core/files/storage.py", "django/core/files/**init**.py", "django/core/files/uploadedfile.py", "django/core/files/locks.py", "django/core/files/move.py", "django/core/files/images.py", "django/core/files/base.py", "django/core/files/uploadhandler.py", "django/core/files/temp.py", "django/conf/**init**.py", "django/conf/global_settings.py"
Embedding model retrieval
"django/core/files/storage.py", "django/conf/global_settings.py", "django/core/files/uploadedfile.py", "django/core/files/uploadhandler.py", "django/core/files/temp.py", "django/core/files/__init__.py", "django/core/files/move.py", "django/setup.py", "django/core/files/base.py", "django/core/__init__.py", "django/core/files/locks.py", "django/core/exceptions.py", "django/core/files/images.py", "django/conf/__init__.py", "django/__init__.py", "django/core/files/utils.py", "django/core/validators.py", "django/core/signing.py", "django/core/paginator.py", "django/core/signals.py", "django/__main__.py", "django/core/wsgi.py", "django/shortcuts.py"
Merge the LLM-predicted suspicious file locations with the embedding-based retrieved files to obtain a final list of relevant files
{"instance_id": "django__django-10914", "found_files": ["django/core/files/storage.py", "django/conf/global_settings.py", "django/core/files/uploadhandler.py", "django/core/files/uploadedfile.py"], "additional_artifact_loc_file": {}, "file_traj": {}}