codespell: add codespell config, workflow and some typos fixed#2470
codespell: add codespell config, workflow and some typos fixed#2470yarikoptic wants to merge 8 commits intozenodo:masterfrom
Conversation
| "id": "cc-by-sa" | ||
| }, | ||
| "description": "<p><strong>Project Specification</strong></p>\n\n<p>The aim of this openlab summer student project is to enhance ZENODO digital repository service with several preservation-oriented features, such as preservation meter and badge to indicate the suitability of a document for long-term reservation. The project will be developed in the Python programming language, using Flask/HTML5/jQuery/TwitterBootstrap technologies for the user interface and SQLAlchemy/MySQL for persistence.</p>\n\n<p><strong>Abstract</strong></p>\n\n<p>Digital Preservation consists mainly in storing digital information, mostly digital-born content, and making sure that it remains available and accessible in the future. This tasks has many challenges such as making sure that the files are in a known and acessible format, that they are not corrupt, lost or unretrievable. The digital preservation challenges apply, noticeably, on digital repositories such as Zenodo. Zenodo aims to provide a secure and trusty way of storing data for the long tail of science. This is to say, storing and connecting information that is normally not available on the main publications, such as the used datasets for a given study or the produced software for a specific paper. The goal of this work was to develop a Preservation Meter that allowed the users to know how suitable the files on their submitted records are in terms of preservation.This was accomplished by using a simple and intuitive visual representation of such suitability by means of a progress bar, where a completely filled bar means the file is very likely to be well preserved. The overall goals of the project were completed and the implementation of this work was integrated on the Zenodo repository as a plugin.</p>", | ||
| "description": "<p><strong>Project Specification</strong></p>\n\n<p>The aim of this openlab summer student project is to enhance ZENODO digital repository service with several preservation-oriented features, such as preservation meter and badge to indicate the suitability of a document for long-term reservation. The project will be developed in the Python programming language, using Flask/HTML5/jQuery/TwitterBootstrap technologies for the user interface and SQLAlchemy/MySQL for persistence.</p>\n\n<p><strong>Abstract</strong></p>\n\n<p>Digital Preservation consists mainly in storing digital information, mostly digital-born content, and making sure that it remains available and accessible in the future. This tasks has many challenges such as making sure that the files are in a known and accessible format, that they are not corrupt, lost or unretrievable. The digital preservation challenges apply, noticeably, on digital repositories such as Zenodo. Zenodo aims to provide a secure and trusty way of storing data for the long tail of science. This is to say, storing and connecting information that is normally not available on the main publications, such as the used datasets for a given study or the produced software for a specific paper. The goal of this work was to develop a Preservation Meter that allowed the users to know how suitable the files on their submitted records are in terms of preservation.This was accomplished by using a simple and intuitive visual representation of such suitability by means of a progress bar, where a completely filled bar means the file is very likely to be well preserved. The overall goals of the project were completed and the implementation of this work was integrated on the Zenodo repository as a plugin.</p>", |
| "id": "cc-by-sa" | ||
| }, | ||
| "description": "<p> Abstract</p>\n\n<p>WinCC OA is a SCADA (Supervisory Control and Data Acquisition) system tool that is used to develop the Control System applications. As most of the control systems used in CERN are developed in WinCC OA, it is better useful to understand how the applications developed by EN/ICE are actually used by the different operators .It becomes more and more important to monitor users’ behavior and analyzing it. The final goal of this project is to develop a generic WinCC OA component to collect data about user interaction which will take advantage of (1) the internal mechanisms already present in WinCC OA to monitor some user interactions such as the internal UI data points; and (2) the commonalities of applications through the use of the standard frameworks JCOP, UNICOS and CPC. The final component developed provides the capability of storing as well as displaying user interaction data on a single timeline.</p>\n\n<p>Kewords: WinCC OA, SCADA, JCOP, UNICOS, CPC</p>", | ||
| "description": "<p> Abstract</p>\n\n<p>WinCC OA is a SCADA (Supervisory Control and Data Acquisition) system tool that is used to develop the Control System applications. As most of the control systems used in CERN are developed in WinCC OA, it is better useful to understand how the applications developed by EN/ICE are actually used by the different operators .It becomes more and more important to monitor users’ behavior and analyzing it. The final goal of this project is to develop a generic WinCC OA component to collect data about user interaction which will take advantage of (1) the internal mechanisms already present in WinCC OA to monitor some user interactions such as the internal UI data points; and (2) the commonalities of applications through the use of the standard frameworks JCOP, UNICOS and CPC. The final component developed provides the capability of storing as well as displaying user interaction data on a single timeline.</p>\n\n<p>Keywords: WinCC OA, SCADA, JCOP, UNICOS, CPC</p>", |
| "id": "cc-by-sa" | ||
| }, | ||
| "description": "<p><strong>Project Specification</strong></p>\n\n<p>The project concerns various C++11 features - their performance and reliability. The report summarizes the tesults from four micro-benchmarks designed for this project and run with three different compilers (GCC, ICC, Clang) and tries to make an evaluation based on the results.</p>\n\n<p><strong>Abstract</strong></p>\n\n<p>As C++11 gained almost full support by compilers, it is interesting to see whether we can leverage some of the features to improve performance and reliability of C++ code. This work is focused on four selected problems: time measurement techniques, for-loops efficiency, asynchronous tasks and parallel mode of STL algorithms. For each of them a micro-benchmark is made. All the benchmarks are fully automatized to generate results from running binaries compiled by three compilers: GCC, ICC and Clang with -O2, -O3 and -Ofast options. In order to evaluate vectorization and multithreading, profiling tools such as perf and Intel Vtune are used.</p>", | ||
| "description": "<p><strong>Project Specification</strong></p>\n\n<p>The project concerns various C++11 features - their performance and reliability. The report summarizes the tesults from four micro-benchmarks designed for this project and run with three different compilers (GCC, ICC, Clang) and tries to make an evaluation based on the results.</p>\n\n<p><strong>Abstract</strong></p>\n\n<p>As C++11 gained almost full support by compilers, it is interesting to see whether we can leverage some of the features to improve performance and reliability of C++ code. This work is focused on four selected problems: time measurement techniques, for-loops efficiency, asynchronous tasks and parallel mode of STL algorithms. For each of them a micro-benchmark is made. All the benchmarks are fully automated to generate results from running binaries compiled by three compilers: GCC, ICC and Clang with -O2, -O3 and -Ofast options. In order to evaluate vectorization and multithreading, profiling tools such as perf and Intel Vtune are used.</p>", |
There was a problem hiding this comment.
here was automatized replaced to automated... I guess could be added to skips if really desired
=== Do not change lines below ===
{
"chain": [],
"cmd": "codespell -w -i 3 -C 2",
"exit": 0,
"extra_inputs": [],
"inputs": [],
"outputs": [],
"pwd": "."
}
^^^ Do not change lines above ^^^
=== Do not change lines below ===
{
"chain": [],
"cmd": "codespell -w",
"exit": 0,
"extra_inputs": [],
"inputs": [],
"outputs": [],
"pwd": "."
}
^^^ Do not change lines above ^^^
4d1dd24 to
2aa4813
Compare
zenodo/modules/fixtures/data/records.json looks like @lnielsen in possibly the author's intention in that file was |
A sharp eye of the @musvaage spotted it in zenodo/zenodo#2470 . And indeed github search gives 460 hits ATM https://github.com/search?q=offsers&type=code and they look like legit typos. Typo is really close to offser->offset one, but upon quick github search I never found offser to be a typo for offer. I think that spurious "s" in offsers is mechanical, and thus I did not bother adding second variant for "offser" typo to be offer
…ore where intentional typo made
=== Do not change lines below ===
{
"chain": [],
"cmd": "codespell -w",
"exit": 0,
"extra_inputs": [],
"inputs": [],
"outputs": [],
"pwd": "."
}
^^^ Do not change lines above ^^^
nice catch!
|
A sharp eye of the @musvaage spotted it in zenodo/zenodo#2470 . And indeed github search gives 460 hits ATM https://github.com/search?q=offsers&type=code and they look like legit typos. Typo is really close to offser->offset one, but upon quick github search I never found offser to be a typo for offer. I think that spurious "s" in offsers is mechanical, and thus I did not bother adding second variant for "offser" typo to be offer
|
cf: #2476 (comment)
More specifically developer input on the purpose of modifying the description fields should be forthcoming! Indeed, is there any 'public exposure' of such zenodo hosted "Abstract"(s) aside from those appearing on the specified URLs? If not, there would appear to be NO reason to modify those description fields. I am happy to submit a PR reverting specific changes implemented in My sympathies at this point is to scrap this PR. A new PR implementing exclusively the herein exposed typos occurring in the .html file and the .py files might be made. That approach would obviously not modify Adding codespell content to this repo seems to me to be over the top. |
really not sure what exactly motivation to scrap this PR since it fixes "legit" typos in a number of files/locations. If you would like to keep |
cf: #2476 (comment)
To elaborate on my previous comment to this current PR.
IMHO, the proposed workflow isn't justified owing to the frequency of new textual content being added to this repo. |
oh well, I guess we have different opinions about such automations - I tend to automate anything which could save humans some time at the cost of some negligible compute (in particular the free one). Feel welcome to scrap this PR then. |
As a Member would you voice an opinion on whether changes to records.json in Factually I believe those changes to records.json should be reverted. Separately might be clarified the Maintainer/Member consensus on whether or not to add this OP's proposed workflow. |
|
Unfortunately, I will have to close this PR to prevent unnecessary work from being done in this repository for the time being. See my detailed response at #2519 (comment) |
workflow would detect new typos if they are introduced. No magical fixing would happen behind your shoulders.
Thanks for the zenodo!